
About Google
Empowering the world through technology and information
Key Highlights
- Over 100,000 employees globally
- Headquartered in Mountain View, California
- Parent company Alphabet Inc. valued at $1.5 trillion
- Google Cloud Platform serves millions of customers
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
🎁 Benefits
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
🌟 Culture
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...
Skills & Technologies
Overview
Google is hiring a Senior Site Reliability Engineer to ensure the reliability and performance of Google Cloud's services. You'll work with distributed systems and automation to optimize existing systems. This position requires 8 years of software development experience and expertise in large-scale systems.
Job Description
Who you are
You have a Bachelor's degree in Computer Science or a related field, along with 8 years of experience in software development across various programming languages. Your background includes at least 3 years of leading projects and designing, analyzing, and troubleshooting distributed systems. You possess a systematic problem-solving approach and have effective verbal and written communication skills, which are essential for collaborating with cross-functional teams.
Your expertise lies in working with computing, distributed systems, storage, or networking, and you have a strong ability to debug and optimize code. You understand the importance of automation in streamlining routine tasks and are committed to maintaining high reliability and performance in large-scale systems.
What you'll do
As a Senior Site Reliability Engineer at Google, you will combine software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. Your role will involve ensuring that Google Cloud's services meet reliability and uptime standards appropriate to customer needs while continuously improving performance. You will manage complex challenges unique to Google Cloud, leveraging your expertise in coding, algorithms, and large-scale system design.
You will be responsible for maintaining services once they are live by measuring and monitoring availability, latency, and overall system health. Your work will focus on optimizing existing systems, building infrastructure, and eliminating manual work through automation. You will practice sustainable incident response and conduct blameless postmortems to foster a culture of learning and improvement.
What we offer
At Google, you will be part of a culture that values intellectual curiosity and problem-solving. You will have the opportunity to work with cutting-edge technologies and contribute to projects that have a significant impact on the reliability of services used by millions. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds in our teams.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Google.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
LearnUpon is hiring a Staff Site Reliability Engineer to enhance and scale their infrastructure. You'll work with AWS, Docker, and Linux to ensure performance and reliability. This role requires significant experience in site reliability engineering.

Site Reliability Engineer
Udemy is hiring a Staff Site Reliability Engineer to manage and evolve their infrastructure. You'll work with AWS, Kubernetes, and programming languages like Python and Golang. This role requires extensive knowledge of cloud technologies and infrastructure-as-code tools.