
About Amazon
The everything store and cloud computing leader
Key Highlights
- Headquartered in South Lake Union, Seattle, WA
- Over 1.5 million employees worldwide
- Leading cloud services through Amazon Web Services (AWS)
- Acquired Whole Foods, Twitch, and Ring
Amazon, headquartered in South Lake Union, Seattle, WA, is the world's largest online retailer and a leader in cloud computing through Amazon Web Services (AWS). With over 1.5 million employees globally, Amazon operates in various sectors, including AI with its Alexa devices and a vast marketplace k...
🎁 Benefits
Amazon offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also have access to a learning ...
🌟 Culture
Amazon's culture is driven by customer obsession and a focus on innovation. The company encourages employees to think big and move fast, fostering an ...
Skills & Technologies
Overview
Amazon is hiring a Machine Learning - Compiler Engineer II to develop the Neuron compiler for optimizing ML models on AWS Inferentia and Trainium. You'll work with technologies like Python, TensorFlow, and PyTorch, and the role requires experience in compiler optimization.
Job Description
Who you are
You have a strong background in machine learning and compiler design, with experience in optimizing performance for complex ML models. You are proficient in Python and have hands-on experience with frameworks such as TensorFlow, PyTorch, and JAX. Your understanding of deep learning architectures enables you to make informed decisions on compiler optimizations. You possess excellent technical communication skills, allowing you to collaborate effectively with internal teams and external stakeholders.
You are familiar with the intricacies of large language models and have a passion for solving challenging optimization problems. Your experience in object-oriented programming languages equips you to tackle the complexities of compiler development. You thrive in a collaborative environment and are eager to contribute to innovative solutions that democratize access to AI technologies.
What you'll do
In this role, you will be responsible for building the next generation Neuron compiler, transforming ML models for deployment on AWS Inferentia and Trainium servers. You will engage in solving hard compiler optimization problems to achieve optimal performance across various ML model families. Your work will directly impact the efficiency and usability of the Neuron SDK, making it easier for developers to leverage AI hardware.
You will collaborate closely with cross-functional teams, including hardware engineers and product managers, to bring new features to market. Your role will involve pre-silicon design discussions and ensuring that the Neuron compiler meets performance benchmarks. You will also be involved in technical documentation and user support, helping to guide users in utilizing the Neuron SDK effectively.
What we offer
At Amazon, you will be part of a team that is at the forefront of the AI revolution. We offer competitive compensation packages, including equity and comprehensive benefits. You will have the opportunity to work in a dynamic environment that encourages innovation and professional growth. Join us in making deep learning accessible to developers everywhere.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Amazon.
Similar Jobs You Might Like
Based on your interests and this role

Machine Learning Engineer
Amazon is hiring a Machine Learning Engineer for the AWS Neuron team to build next-generation compilers for deep learning models. You'll work with technologies like TensorFlow and PyTorch to optimize performance on custom chips. This position requires experience in compiler optimization and machine learning frameworks.

Machine Learning Engineer
Amazon is hiring a Machine Learning Engineer II to work on AWS Machine Learning accelerators. You'll develop a deep learning compiler stack and optimize performance for complex neural net models. This position requires experience with AWS and popular ML frameworks.

Machine Learning Engineer
Amazon is hiring a Senior Machine Learning Engineer to work on AWS Neuron, focusing on optimizing ML performance on custom-built hardware. You'll utilize AWS tools and frameworks like TensorFlow and PyTorch. This role requires significant experience in machine learning and compiler development.

Machine Learning Engineer
Amazon is hiring a Senior Machine Learning Engineer to work on AWS Neuron, developing a deep learning compiler stack for optimizing neural network models. You'll utilize AWS tools and frameworks like TensorFlow and PyTorch. This position requires significant experience in machine learning and compiler development.

Ml Kernel Performance Engineer
Amazon is hiring an ML Kernel Performance Engineer to optimize performance for AWS's custom ML accelerators. You'll work with technologies like AWS, Python, TensorFlow, and PyTorch. This position requires expertise in machine learning and high-performance computing.