Replit

About Replit

The coding platform that empowers everyone to learn

🏢 Tech👥 101-200 employees📅 Founded 2016📍 SoMa, San Francisco, CA💰 $472.2m
B2CB2BArtificial IntelligenceEnterpriseTrainingLearningSaaS

Key Highlights

  • Raised $472.2 million in funding
  • Millions of users, including Google and Facebook employees
  • Supports popular languages like C++, JavaScript, and PHP
  • Remote-first culture with flexible work hours

Replit is a collaborative coding platform that simplifies programming for learners, educators, and developers. Based in SoMa, San Francisco, Replit has attracted millions of users, including employees from major tech companies like Google, Facebook, and Stripe. The company has raised $472.2 million ...

🎁 Benefits

Replit offers a remote-first work environment with flexible hours, equity options, and a home office setup stipend. Employees enjoy comprehensive heal...

🌟 Culture

Replit's culture is centered around accessibility in coding, allowing users to start programming without complex setups. The company values innovation...

Overview

Replit is hiring a Site Reliability Engineer to ensure the reliability and performance of its infrastructure. You'll work with tools like Terraform and Ansible to automate operational tasks and implement monitoring solutions. This role requires a passion for building resilient systems at scale.

Job Description

Who you are

You have a strong background in site reliability engineering, with experience in designing and implementing observability solutions that provide real-time visibility into system health and performance. You are skilled in creating dashboards and metrics that enable quick problem identification and resolution, ensuring high availability of services.

You are proficient in automation and infrastructure as code, having architected and implemented solutions using tools like Terraform, Ansible, or Pulumi. Your experience includes designing and maintaining CI/CD pipelines that facilitate reliable and consistent deployments, contributing to the overall efficiency of the development process.

You are passionate about building and maintaining resilient systems at scale, understanding the importance of self-healing systems that can automatically recover from failures. Your technical expertise is complemented by your ability to collaborate effectively with cross-functional teams, bridging the gap between development and operations.

What you'll do

In this role, you will join Replit's Site Reliability Engineering team, focusing on ensuring the reliability, scalability, and performance of the infrastructure that serves millions of developers worldwide. You will design and implement robust monitoring solutions, automating operational tasks to enhance system reliability and performance.

You will develop comprehensive monitoring and alerting systems using modern observability tools, creating dashboards that provide insights into system health. Your responsibilities will include implementing logging strategies that enable quick problem identification and resolution, ensuring that the platform remains operational and efficient.

You will drive automation initiatives, architecting infrastructure automation solutions that streamline processes and reduce manual intervention. Your work will involve designing and maintaining CI/CD pipelines that support consistent and reliable deployments, contributing to the overall success of the engineering team.

What we offer

At Replit, we are committed to democratizing software development and making programming accessible to everyone. We offer a collaborative work environment where diverse perspectives are valued, and we encourage candidates from all backgrounds to apply. Join us in shaping the future of software creation and be part of a team that is making a significant impact in the tech industry.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Replit.

Similar Jobs You Might Like

Based on your interests and this role

Replit

Site Reliability Engineer

Replit📍 Foster City - Hybrid

Replit is hiring a Staff Site Reliability Engineer to ensure the reliability and performance of their infrastructure. You'll work with AWS, Docker, and Kubernetes to implement automation and best practices. This role requires a strong background in SRE principles and experience in building resilient systems.

🏢 HybridStaff
3 months ago
Zoox

Site Reliability Engineer

Zoox📍 Foster City - On-Site

Zoox is seeking a Site Reliability Engineer to ensure the availability and performance of services for autonomous vehicles. You'll work with systems processing massive data volumes and support compute-intensive pipelines. This role requires expertise in Linux, Docker, and Kubernetes.

🏛️ On-Site
1 month ago
ConductorOne

Site Reliability Engineer

ConductorOne📍 Portland - On-Site

ConductorOne is hiring a Site Reliability Engineer to design and operate highly reliable infrastructure across cloud environments. You'll work with AWS, GCP, and Azure while building automation and tooling to enhance system reliability. This position requires 3+ years of experience in SRE or DevOps.

🏛️ On-SiteMid-Level
4 months ago
Together AI

Site Reliability Engineer

Together AI📍 San Francisco

Together AI is hiring a Site Reliability Engineer to ensure the reliability and performance of user-facing services and production systems. You'll work with Ansible, Terraform, and Kubernetes to build and manage infrastructure. This role requires 2+ years of experience in SRE or a related field.

Mid-Level
2w ago
Five9

Site Reliability Engineer

Five9📍 United States - Remote

Five9 is seeking a Site Reliability Engineer to build and maintain highly reliable, scalable systems. You'll work with AWS, Docker, and Linux to ensure service reliability and performance. This role requires a blend of software engineering and operational expertise.

🏠 RemoteMid-Level
2d ago