About PandaDoc

Streamlining document workflows for growing organizations

🏢 Retail, Tech👥 251-1K📅 Founded 2013📍 San Francisco, California, United States

Key Highlights

Over 35,000 customers including Cisco and HubSpot
Headquartered in San Francisco, California
Raised $50M+ from investors like Rembrandt Venture Partners
Offers unlimited PTO and flexible remote work options

PandaDoc is a document workflow automation platform headquartered in San Francisco, California, that serves over 35,000 organizations, including notable clients like Cisco and HubSpot. The platform streamlines the creation, management, and signing of digital documents such as proposals, quotes, and ...

🎁 Benefits

PandaDoc offers competitive salaries, equity options, unlimited PTO, and a flexible remote work policy, allowing employees to maintain a healthy work-...

🌟 Culture

PandaDoc fosters a remote-friendly culture that emphasizes collaboration and innovation, encouraging employees to contribute ideas and take ownership ...

🌐 Website 💼 LinkedIn 𝕏 Twitter All 86 jobs →

Site Reliability Engineer • Senior

PandaDoc • Portugal

Posted 2w agoSenior Site Reliability Engineer 📍 Portugal

Apply Now →

Skills & Technologies

python java spring boot aws kubernetes postgresql grafana rabbitmq nats kafka

Overview

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes and contribute to service codebases using Python and Java. This role requires strong experience with AWS and Kubernetes.

Job Description

Who you are

You have solid programming experience, particularly with Python and Java, and are familiar with frameworks like Django and Spring Boot. Your background includes maintaining observability tools, specifically LGTM, which includes Loki, Grafana, Tempo, and Mimir. You have a strong grasp of AWS and Kubernetes, ensuring that production applications run smoothly. Your proficiency extends to relational databases, particularly PostgreSQL, and messaging systems such as RabbitMQ, NATS, and Kafka. As an experienced on-call SRE engineer, you understand the importance of reliability and performance in production environments.

Desirable

You enjoy mentoring others, fostering SRE principles within teams, and contributing to a culture of reliability and resilience. Your experience in developing automations and tools to support platform reliability is a plus, as is your ability to collaborate effectively with product engineers.

What you'll do

In this role, you will own and influence the incident management process from start to finish, ensuring that incidents are handled efficiently and effectively. You will maintain and evolve the on-prem observability stack, keeping a close eye on performance metrics and alerting systems. Your contributions to production services will focus on enhancing performance and resiliency, actively participating in the on-call rotation to address any issues that arise. You will develop automations and tools that support platform reliability, streamlining processes and reducing downtime. Collaboration with product engineers will be key, as you work to instill SRE principles within the R&D organization. Additionally, you will have the opportunity to mentor junior SRE team members and product engineers, sharing your knowledge and expertise to help them grow.

What we offer

At PandaDoc, we value the contributions of our SRE team and offer a supportive environment where you can thrive. You will be part of a team that is essential to our success, ensuring that our customers receive a reliable service. We encourage you to apply even if your experience doesn't match every requirement, as we believe in the potential of our team members to grow and develop in their roles. Join us in making a significant impact on our production services and customer satisfaction.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at PandaDoc.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Site Reliability Engineer

PandaDoc•📍 Poland

PandaDoc is seeking a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes, maintain observability tools, and contribute to service codebases using Python and Java. This role requires strong experience in AWS and Kubernetes.

Senior

2w ago

Site Reliability Engineer

PandaDoc•📍 Remote (Europe) - Remote

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes, observability tools, and contribute to service codebases using Python and Java. This role requires solid experience in AWS and Kubernetes.

🏠 RemoteSenior

1d ago

Site Reliability Engineer

PandaDoc•📍 Spain - Remote

PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll work with Python, Java, AWS, and Kubernetes to manage incident processes and observability tools. This role requires solid programming experience and expertise in maintaining production services.

🏠 RemoteSenior

1d ago

Site Reliability Engineer

PandaDoc•📍 Ukraine - Remote

🏠 RemoteSenior

2w ago

Site Reliability Engineer

Iterable•📍 Lisbon - Hybrid

Iterable is seeking a Senior Site Reliability Engineer to enhance their cloud platform. You'll work with AWS, Docker, and Kubernetes to ensure system reliability and performance. This role requires strong experience in cloud infrastructure and operations.

🏢 HybridSenior

2 months ago

Browse all jobs →