PandaDoc

About PandaDoc

Streamlining document workflows for growing organizations

🏢 Retail, Tech👥 251-1K📅 Founded 2013📍 San Francisco, California, United States

Key Highlights

  • Over 35,000 customers including Cisco and HubSpot
  • Headquartered in San Francisco, California
  • Raised $50M+ from investors like Rembrandt Venture Partners
  • Offers unlimited PTO and flexible remote work options

PandaDoc is a document workflow automation platform headquartered in San Francisco, California, that serves over 35,000 organizations, including notable clients like Cisco and HubSpot. The platform streamlines the creation, management, and signing of digital documents such as proposals, quotes, and ...

🎁 Benefits

PandaDoc offers competitive salaries, equity options, unlimited PTO, and a flexible remote work policy, allowing employees to maintain a healthy work-...

🌟 Culture

PandaDoc fosters a remote-friendly culture that emphasizes collaboration and innovation, encouraging employees to contribute ideas and take ownership ...

PandaDoc

Site Reliability Engineer Senior

PandaDocPoland

Apply Now →

Overview

PandaDoc is seeking a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes, maintain observability tools, and contribute to service codebases using Python and Java. This role requires strong experience in AWS and Kubernetes.

Job Description

Who you are

You have solid programming experience, particularly with Python and Java, and are familiar with frameworks such as Django and Spring Boot. Your background includes maintaining observability tools, specifically LGTM, which encompasses Loki, Grafana, Tempo, and Mimir. You have a strong grasp of AWS and Kubernetes, ensuring that production applications run smoothly. Your experience extends to working with relational databases like PostgreSQL and messaging systems such as RabbitMQ, NATS, or Kafka. As an experienced on-call SRE engineer, you understand the importance of incident management and are adept at developing automations and tools to support platform reliability. You enjoy mentoring others and fostering SRE principles within your team.

What you'll do

In this role, you will own and influence the incident management process from start to finish, ensuring that incidents are handled efficiently and effectively. You will maintain and evolve the on-prem observability stack, keeping production applications running smoothly by participating in the on-call rotation. Your contributions will include developing automations and tools that enhance platform reliability and collaborating with product engineers to integrate SRE principles within the R&D organization. You will also be responsible for proactively preventing incidents and resolving performance bottlenecks by actively contributing to service codebases. Your role will be pivotal in driving efforts in observability, incident management, and capacity planning, ultimately ensuring the resiliency of production services.

What we offer

At PandaDoc, we value the contributions of our Site Reliability Engineers and provide an environment that fosters growth and collaboration. You will have the opportunity to work with a talented team dedicated to maintaining reliable operations and enhancing service performance. We encourage you to apply even if your experience doesn't match every requirement, as we believe in the potential of diverse backgrounds and perspectives. Join us in our mission to deliver exceptional service to our customers while enjoying a supportive and innovative workplace.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at PandaDoc.

Similar Jobs You Might Like

Based on your interests and this role

PandaDoc

Site Reliability Engineer

PandaDoc📍 Portugal

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes and contribute to service codebases using Python and Java. This role requires strong experience with AWS and Kubernetes.

Senior
2w ago
PandaDoc

Site Reliability Engineer

PandaDoc📍 Ukraine - Remote

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll work with Python, Java, AWS, and Kubernetes to manage incident processes and observability tools. This role requires solid programming experience and expertise in maintaining production services.

🏠 RemoteSenior
2w ago
PandaDoc

Site Reliability Engineer

PandaDoc📍 Remote (Europe) - Remote

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes, observability tools, and contribute to service codebases using Python and Java. This role requires solid experience in AWS and Kubernetes.

🏠 RemoteSenior
1d ago
PandaDoc

Site Reliability Engineer

PandaDoc📍 Spain - Remote

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll work with Python, Java, AWS, and Kubernetes to manage incident processes and observability tools. This role requires solid programming experience and expertise in maintaining production services.

🏠 RemoteSenior
1d ago
Affirm

Site Reliability Engineer

Affirm📍 Poland - Remote

Affirm is seeking a Senior Site Reliability Engineer to enhance the reliability of their cloud infrastructure. You'll work with Kubernetes and automation tools to support Affirm's engineering teams. This role requires strong cloud engineering skills and experience in operational excellence.

🏠 RemoteSenior
1 month ago