Netflix

About Netflix

The streaming service redefining entertainment worldwide

🏢 Tech, Media👥 10K-50K📅 Founded 1997📍 Los Gatos, California, United States

Key Highlights

  • Over 238 million subscribers across 190 countries
  • Headquartered in Los Gatos, California
  • Valued at over $150 billion
  • Offers a vast library of original content and films

Netflix, headquartered in Los Gatos, California, is a leading streaming service with over 238 million subscribers globally. The platform offers a vast library of movies, TV shows, and original content, including award-winning series like 'Stranger Things' and 'The Crown.' With a market valuation exc...

🎁 Benefits

Employees enjoy competitive salaries, stock options, unlimited PTO, and comprehensive health benefits. Netflix also offers a flexible remote work poli...

🌟 Culture

Netflix fosters a culture of freedom and responsibility, encouraging employees to take risks and make decisions independently. The company values tran...

Overview

Netflix is hiring a Senior Site Reliability Engineer to design and maintain scalable infrastructure for their streaming services. You'll work with technologies like AWS, Docker, and Kubernetes. This position requires a strong background in distributed systems and incident management.

Job Description

Who you are

You have 5+ years of experience in site reliability engineering or a related field, with a strong understanding of distributed systems and their complexities. You have a proven track record of designing and implementing scalable and reliable infrastructure, ensuring high availability and performance for critical applications.

Your expertise includes working with cloud platforms like AWS, and you are proficient in containerization technologies such as Docker and orchestration tools like Kubernetes. You have a solid foundation in Linux systems and are comfortable scripting in languages like Python to automate tasks and improve operational efficiency.

You thrive in collaborative environments, working closely with engineering and product teams to integrate observability, reliability, and security into the software development lifecycle. Your experience managing incidents and driving post-mortem analyses has equipped you with the skills to minimize impact and enhance system resilience.

You are passionate about leveraging technology to solve complex business problems and are committed to promoting reliability and resilience practices throughout the organization. You understand the importance of monitoring and alerting, and you have experience developing automation tools to streamline these processes.

Desirable

Experience with infrastructure as code tools like Terraform or CloudFormation is a plus. Familiarity with monitoring tools such as Prometheus or Grafana will help you excel in this role. You are also open to learning new technologies and methodologies to continuously improve your skills and the team's performance.

What you'll do

In this role, you will design, implement, and maintain scalable and reliable infrastructure to support Netflix's Streaming Suite. You will collaborate with engineering and product teams to integrate observability, reliability, and security considerations into the entire software development lifecycle. Your responsibilities will include developing and implementing automation tools for monitoring, deployment, and incident response, ensuring that our systems are resilient and performant.

You will lead efforts to improve system reliability and performance, conducting regular reviews and assessments to identify areas for enhancement. Your role will involve managing incidents when they occur, coordinating with cross-functional teams to resolve issues swiftly and effectively. You will also promote best practices in reliability engineering across the organization, mentoring junior engineers and sharing your knowledge with the team.

What we offer

Netflix offers a dynamic work environment where you can make a significant impact on the reliability of our services. We provide competitive compensation and benefits, along with opportunities for professional growth and development. You will be part of a diverse and inclusive team that values collaboration and innovation, working together to deliver exceptional experiences for our members worldwide.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Netflix.

Similar Jobs You Might Like

Based on your interests and this role

Netflix

Site Reliability Engineer

Netflix📍 Washington - Remote

Netflix is seeking a Senior Site Reliability Engineer to ensure the reliability and scalability of the Netflix Ad Suite. You'll work with technologies like AWS, Docker, and Kubernetes to build resilient systems. This role requires strong experience in distributed systems and incident management.

🏠 RemoteSenior
1 month ago
Five9

Site Reliability Engineer

Five9📍 United States - Remote

Five9 is seeking a Site Reliability Engineer to build and maintain highly reliable, scalable systems. You'll work with AWS, Docker, and Linux to ensure service reliability and performance. This role requires a blend of software engineering and operational expertise.

🏠 RemoteMid-Level
2d ago
Twilio

Staff Engineer

Twilio📍 United States - Remote

Twilio is seeking a Staff Engineer to join their Data Substrate team, responsible for architecting scalable data solutions and mentoring engineers. You'll work with distributed systems and data technologies in a fully remote role.

🏠 RemoteSenior
18h ago
Coinbase

Site Reliability Engineer

Coinbase📍 United States - Remote

Coinbase is hiring a Site Reliability Engineer to support and extend existing CI/CD frameworks for IT services. You'll work with technologies like AWS, Docker, and Kubernetes. This position requires a strong background in reliability engineering and cloud infrastructure.

🏠 Remote
3d ago
Netflix

Site Reliability Engineer

Netflix📍 United States - Remote

Netflix is seeking a Senior Site Reliability Engineer to support live streaming events by ensuring cloud infrastructure stability and reliability. You'll work with technologies like AWS, Docker, and Kubernetes to handle API traffic during high-demand events.

🏠 RemoteSenior
1 month ago