
About Google
Empowering the world through technology and information
Key Highlights
- Over 100,000 employees globally
- Headquartered in Mountain View, California
- Parent company Alphabet Inc. valued at $1.5 trillion
- Google Cloud Platform serves millions of customers
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
π Benefits
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
π Culture
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...
Skills & Technologies
Overview
Google is seeking a Senior Site Reliability Engineer to design, build, and maintain large-scale distributed systems. You'll work with technologies like Java, Python, and AWS to ensure reliability and performance. This role requires 5+ years of experience in software development and systems engineering.
Job Description
Who you are
You have a Bachelorβs degree in Computer Science or a related field, or equivalent practical experience. With 5 years of experience in software development across various programming languages, you are well-versed in designing, analyzing, and troubleshooting large-scale distributed systems. Your 2 years of experience leading projects and providing technical leadership have equipped you with the skills to drive initiatives effectively. You understand that engineering solutions to design, build, and maintain efficient large-scale systems is crucial for success. You embody the Site Reliability Engineering (SRE) mindset, combining software and systems engineering to build and run fault-tolerant systems. Your approach to operations problems is creative, and you are committed to optimizing existing systems through automation.
Desirable
A Masterβs degree in Computer Science or Engineering is preferred, as is experience with large-scale systems. You are familiar with practices such as blameless postmortems and sustainable incident response, which are essential for maintaining system health and reliability. Your ability to measure and monitor availability, latency, and overall system health is second nature to you, and you are always looking for ways to scale systems sustainably through automation.
What you'll do
As a Senior Site Reliability Engineer at Google, you will be responsible for ensuring that our services have the reliability and uptime that users expect. You will design and implement solutions that improve system performance while keeping an eye on capacity and performance metrics. Your role will involve maintaining services once they are live, measuring and monitoring their health, and scaling systems sustainably. You will lead initiatives to push for changes that enhance reliability and velocity, practicing sustainable incident response and conducting blameless postmortems to learn from incidents.
You will collaborate with cross-functional teams to build infrastructure that eliminates manual work through automation. Your technical leadership will guide the team in adopting best practices for system reliability and performance. You will also engage in capacity planning and launch reviews, ensuring that our systems can handle the demands placed upon them. Your contributions will directly impact the efficiency and effectiveness of our operations, making you a key player in our mission to deliver reliable services to our users.
What we offer
At Google, we offer a dynamic work environment where innovation thrives. You will have the opportunity to work with cutting-edge technologies and collaborate with some of the brightest minds in the industry. We provide competitive compensation and benefits, including opportunities for professional growth and development. Join us in shaping the future of technology and making a significant impact on how services are delivered at scale.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Google.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
Apple is hiring a Senior Site Reliability Engineer to support and scale cloud services for millions of users. You'll work with technologies like Kubernetes, Cassandra, and Kafka to build critical infrastructural systems. This position requires strong expertise in cloud service infrastructure.

Site Reliability Engineer
Coupang is hiring a Senior Site Reliability Engineer to ensure the reliability and performance of their customer-facing services. You'll work with AWS, Docker, and Kubernetes to build and maintain scalable infrastructure. This role requires a strong background in SRE principles and large-scale distributed systems.

Site Reliability Engineer
Axon is hiring a Site Reliability Engineer II to enhance the reliability and performance of their cloud-native global Kubernetes platform. You'll focus on building infrastructure and tools that support engineering operations. This role requires experience in system stability and cloud technologies.