
About Google
Empowering the world through technology and information
Key Highlights
- Over 100,000 employees globally
- Headquartered in Mountain View, California
- Parent company Alphabet Inc. valued at $1.5 trillion
- Google Cloud Platform serves millions of customers
Google LLC, headquartered in Mountain View, California, is a global leader in internet-related services and products, including its flagship search engine, Google Search, and the Android operating system. With over 100,000 employees, Google also offers cloud computing services through Google Cloud P...
π Benefits
Google offers competitive salaries, equity options, generous PTO policies, comprehensive health benefits, and a remote work policy that allows flexibi...
π Culture
Google is known for its engineering-first culture, emphasizing innovation and collaboration. The company fosters a unique environment that encourages ...
Overview
Google is hiring a Software Engineer specializing in GPU Performance to work on optimizing high-performance GPU kernels and influencing the technical direction of the GPU software ecosystem. You'll work with technologies like CUDA and Triton, requiring 2 years of experience in software development.
Job Description
Who you are
You have a Bachelor's degree or equivalent practical experience, along with at least 2 years of experience in software development using one or more programming languages. Your background includes low-level GPU programming with technologies such as CUDA, Triton, and CUTLASS, as well as performance engineering techniques. You are familiar with modern GPU architectures, including NVIDIA and AMD, and understand memory hierarchies and performance bottlenecks. Ideally, you hold a Master's degree or PhD in Computer Science or a related technical field, and have 2 years of experience with data structures and algorithms in either an academic or industry setting.
You possess experience with compiler optimization, code generation, and runtime systems for GPU architectures, particularly with tools like OpenXLA and MLIR. Your understanding of modern Large Language Models (LLMs) and their deployment on AI accelerators is a significant asset. You thrive in collaborative environments and are eager to influence the deployment of Googleβs GPU fleet by working with various product teams across the organization.
What you'll do
As a Software Engineer focused on GPU Performance, you will work on critical projects that enhance Googleβs GPU software stack. Your responsibilities will include optimizing high-performance GPU kernels and contributing to the design of ML compiler cost models. You will collaborate with teams across Google, including ML, compiler design, and systems architecture, to influence the technical direction of the GPU software ecosystem. Your role will involve tackling performance bottlenecks and ensuring that Googleβs products can handle information at massive scale.
You will have opportunities to switch teams and projects, allowing you to grow your skills and impact across the organization. Your contributions will directly affect how billions of users connect, explore, and interact with information and one another. You will be part of a team that values fresh ideas from various areas, including information retrieval, distributed computing, and artificial intelligence.
What we offer
At Google, you will be part of a culture that encourages innovation and collaboration. We provide competitive compensation and benefits, along with opportunities for professional growth and development. You will work in an environment that values diversity and inclusion, and where your contributions will have a meaningful impact on the technology that shapes the future. We encourage you to apply even if your experience doesn't match every requirement, as we believe in the potential of diverse teams to build better products.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Google.
Similar Jobs You Might Like
Based on your interests and this role

Software Engineering
Google is seeking a Senior Software Engineer specializing in GPU Performance to tackle performance bottlenecks and optimize GPU software. You'll work with technologies like CUDA and Triton, requiring 5+ years of software development experience.

Performance Engineer
Anthropic is hiring a Senior Performance Engineer specializing in GPU to architect and implement foundational systems for AI. You'll focus on maximizing GPU utilization and performance, requiring deep experience in GPU programming and optimization.

Software Engineering
Waymo is hiring a Software Engineer specializing in GPU development to enhance their autonomous driving technology. You'll work with C++, Python, and OpenGL to develop high-performance GPU primitives. This position requires experience in GPU programming and system-level architecture.

Gpu Performance Engineer
Genmo is seeking a GPU Performance Engineer to optimize their H100 infrastructure for video generation. You'll leverage advanced profiling tools and write high-performance CUDA kernels to achieve significant speedups. This role requires 5+ years of systems programming experience.

Ai Research Engineer
Together AI is hiring a Systems Research Engineer specialized in GPU Programming to develop and optimize GPU-accelerated kernels for ML/AI applications. You'll collaborate with cross-functional teams and leverage your expertise in GPU programming and parallel computing. This role requires a strong background in GPU programming techniques.