
About Google
Empowering the world through technology and information
Key Highlights
- Over 100,000 employees globally
- Headquartered in Mountain View, California
- Parent company Alphabet Inc. valued at $1.5 trillion
- Google Cloud Platform serves millions of customers
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
🎁 Benefits
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
🌟 Culture
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...
Skills & Technologies
Overview
Google is hiring a Site Reliability Engineer to ensure the reliability and uptime of Google Cloud services. You'll work with Java and Python to manage large-scale distributed systems. This position requires 2 years of experience in software development and systems engineering.
Job Description
Who you are
You have a Bachelor's degree in Computer Science or a related field, or equivalent practical experience. With at least 2 years of experience in software development, you are well-versed in data structures and algorithms, and you have a strong foundation in one or more programming languages, particularly Java and Python. Your experience includes working with distributed systems, storage, or networking, and you have a knack for designing, analyzing, and troubleshooting large-scale systems. You possess excellent problem-solving skills and can communicate effectively, both verbally and in writing.
You thrive in a culture that values intellectual curiosity and openness. You enjoy collaborating with diverse teams and are eager to tackle complex challenges unique to Google Cloud. Your ability to debug and optimize code, along with your experience in automating routine tasks, makes you a valuable asset to any team. You are committed to ensuring that systems are reliable and performant, and you take pride in your work.
What you'll do
As a Site Reliability Engineer at Google, you will combine software and systems engineering to build and run large-scale, fault-tolerant systems. Your primary responsibility will be to ensure that Google Cloud's services maintain reliability and uptime that meets customer needs. You will monitor system capacity and performance, and your software development efforts will focus on optimizing existing systems and building infrastructure. You will also automate routine tasks to eliminate unnecessary work, allowing your team to focus on more complex challenges.
You will participate in design reviews with peers and stakeholders, helping to decide among available technologies and ensuring that best practices are followed. Your role will involve triaging product or system issues, debugging, and tracking resolutions by analyzing the sources of issues and their impact on hardware, network, or service operations. You will have the opportunity to manage the complexities of scale that are unique to Google Cloud, using your expertise in coding, algorithms, and large-scale system design to drive improvements.
What we offer
At Google, you will be part of a team that values collaboration and innovation. We encourage you to apply even if your experience doesn't match every requirement. You will have access to a wealth of resources and support to help you grow in your career. Our culture promotes continuous learning and development, and you will have the chance to work on impactful projects that shape the future of technology. Join us in our mission to make information universally accessible and useful.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Google.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
Google is seeking a Site Reliability Engineer to ensure the reliability and performance of Google Cloud services. You'll work with Java, Python, and Linux to manage large-scale distributed systems. This role requires a Bachelor's degree and 2 years of software development experience.

Site Reliability Engineer
Google is hiring a Site Reliability Engineer to ensure the reliability and uptime of Google Cloud services. You'll work with distributed systems and automation tools to manage complex challenges at scale. This position requires 2 years of experience in software development and systems engineering.

Site Reliability Engineer
Google is hiring a Systems Engineer III for their Site Reliability Engineering team to ensure the reliability and performance of Google Cloud services. You'll work with Linux, Python, and networking concepts to manage large-scale distributed systems. This position requires 2+ years of relevant experience.

Site Reliability Engineer
Google is hiring a Staff Software Engineer for Site Reliability Engineering to ensure the reliability and performance of Google Cloud's services. You'll work with distributed systems and automation tools to optimize existing systems. This position requires 8 years of software development experience and expertise in troubleshooting large-scale systems.