WorkOS

About WorkOS

APIs that empower startups for enterprise success

🏢 Tech👥 21-100 employees📅 Founded 2019📍 Financial District, San Francisco, CA💰 $95m4
B2BEnterpriseSaaSAPI

Key Highlights

  • Headquartered in San Francisco, CA, in the Financial District
  • $95 million raised in funding, including Series B
  • Offers essential enterprise features like SSO and Directory Sync
  • Supports 21-100 employees with a focus on B2B SaaS

WorkOS is a B2B SaaS company headquartered in the Financial District of San Francisco, CA, focused on helping startups bridge the gap to enterprise clients. With $95 million in funding, including a substantial Series B round, WorkOS provides a single API-driven platform that simplifies integration w...

🎁 Benefits

WorkOS offers a comprehensive benefits package, including a stipend for home office equipment, 90% premium coverage for employee and dependent health ...

🌟 Culture

WorkOS fosters a culture that prioritizes solving the challenges of scaling B2B SaaS startups. With a focus on providing essential tools for enterpris...

Overview

WorkOS is hiring a Site Reliability Engineer to ensure the platform remains fast, reliable, and resilient at scale. You'll work with AWS, Docker, and Kubernetes to build systems that handle hundreds of millions of requests. This role requires a strong understanding of complex systems and incident response.

Job Description

Who you are

You have a strong background in site reliability engineering and are excited to improve the reliability of complex systems. You enjoy digging into how things work and have experience with cloud infrastructure, particularly AWS. Your expertise in Docker and Kubernetes allows you to design scalable systems that can handle high traffic loads. You are comfortable working in a Linux environment and have a solid understanding of monitoring tools like Prometheus. You thrive in collaborative settings, working closely with engineers across disciplines to ensure production readiness and effective incident response.

You are proactive and take initiative, identifying reliability risks and driving improvements independently. Your problem-solving skills enable you to think through architectural trade-offs with reliability, simplicity, and maintainability in mind. You are passionate about uptime and performance, and you understand the importance of embedding reliability into everything you do. You are also open to mentoring others and sharing your knowledge with the team.

What you'll do

As a Site Reliability Engineer at WorkOS, you will be responsible for ensuring that our platform remains fast, reliable, and resilient at scale. You will build systems and practices that keep everything running smoothly, handling hundreds of millions of requests while minimizing downtime. Your role will involve collaborating closely with infrastructure and product engineering teams to embed reliability into our processes. You will lead incident response efforts and conduct postmortem reviews to continuously improve our service performance.

You will also work on designing scalable systems and improving observability across our platform. This includes implementing monitoring solutions and alerting mechanisms to proactively identify and address issues before they impact our customers. You will take part in capacity planning and performance tuning to ensure that our systems can handle growth effectively. Your contributions will directly impact the reliability of our services and the satisfaction of our customers.

What we offer

At WorkOS, we offer a fully distributed work environment with employees across North American time zones. We are well-funded and have a fast-growing customer base that includes rapidly growing SaaS companies. You will have the opportunity to work with cutting-edge technologies and make a lasting impact on our platform's reliability. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at WorkOS.

Similar Jobs You Might Like

Based on your interests and this role

Together AI

Site Reliability Engineer

Together AI📍 San Francisco

Together AI is hiring a Site Reliability Engineer to ensure the reliability and performance of user-facing services and production systems. You'll work with Ansible, Terraform, and Kubernetes to build and manage infrastructure. This role requires 2+ years of experience in SRE or a related field.

Mid-Level
2w ago
Mercor

Site Reliability Engineer

Mercor📍 San Francisco - On-Site

Mercor is seeking a Site Reliability Engineer to own production reliability across critical systems. You'll work with AWS, Kubernetes, and Terraform to build and improve high-availability systems in San Francisco.

🏛️ On-SiteMid-Level
1 month ago
Apple

Site Reliability Engineer

Apple📍 San Francisco - On-Site

Apple is seeking a Site Reliability Engineer to join their Services Engineering team. You'll be responsible for building secure, end-to-end solutions and managing the full infrastructure stack. This role requires expertise in solving complex problems at scale.

🏛️ On-Site
1 month ago
Braze

Site Reliability Engineer

Braze📍 San Francisco - On-Site

Braze is hiring a Senior Site Reliability Engineer to ensure the uptime of internal-facing services and platforms. You'll work with Linux, distributed systems, and automation to maintain high service availability. This position requires a strong background in system administration and software engineering.

🏛️ On-SiteSenior
1w ago
Stellar Development Foundation

Site Reliability Engineer

Stellar Development Foundation📍 San Francisco - On-Site

Stellar Development Foundation is hiring a Senior Site Reliability Engineer to enhance the reliability and scalability of their systems. You'll work with AWS, GCP, and Kubernetes to support the Stellar blockchain ecosystem. This role requires strong experience in infrastructure management and automation.

🏛️ On-SiteSenior
3w ago