About Amazon

The everything store and cloud computing leader

🏢 Tech👥 1001+ employees📅 Founded 1995📍 South Lake Union, Seattle, WA⭐ 3.7

B2CB2BMarketplaceCloud ComputingeCommerce

Key Highlights

Headquartered in South Lake Union, Seattle, WA
Over 1.5 million employees worldwide
Leading cloud services through Amazon Web Services (AWS)
Acquired Whole Foods, Twitch, and Ring

Amazon, headquartered in South Lake Union, Seattle, WA, is the world's largest online retailer and a leader in cloud computing through Amazon Web Services (AWS). With over 1.5 million employees globally, Amazon operates in various sectors, including AI with its Alexa devices and a vast marketplace k...

🎁 Benefits

Amazon offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also have access to a learning ...

🌟 Culture

Amazon's culture is driven by customer obsession and a focus on innovation. The company encourages employees to think big and move fast, fostering an ...

🌐 Website 💼 LinkedIn 𝕏 Twitter All 11027 jobs →

Machine Learning Engineer • Mid-Level

Amazon • Cupertino - On-Site

Posted 11 months ago🏛️ On-Site Mid-Level Machine Learning Engineer 📍 Cupertino💰 $129,300 - $129,300 / yearly

Apply Now →

Skills & Technologies

python tensorflow pytorch jax

Overview

Amazon is hiring a Machine Learning - Compiler Engineer II to develop the Neuron compiler for optimizing ML models on AWS Inferentia and Trainium. You'll work with technologies like Python, TensorFlow, and PyTorch, and the role requires experience in compiler optimization.

Job Description

Who you are

You have a strong background in machine learning and compiler design, with experience in optimizing performance for complex ML models. You are proficient in Python and have hands-on experience with frameworks such as TensorFlow, PyTorch, and JAX. Your understanding of deep learning architectures enables you to make informed decisions on compiler optimizations. You possess excellent technical communication skills, allowing you to collaborate effectively with internal teams and external stakeholders.

You are familiar with the intricacies of large language models and have a passion for solving challenging optimization problems. Your experience in object-oriented programming languages equips you to tackle the complexities of compiler development. You thrive in a collaborative environment and are eager to contribute to innovative solutions that democratize access to AI technologies.

What you'll do

In this role, you will be responsible for building the next generation Neuron compiler, transforming ML models for deployment on AWS Inferentia and Trainium servers. You will engage in solving hard compiler optimization problems to achieve optimal performance across various ML model families. Your work will directly impact the efficiency and usability of the Neuron SDK, making it easier for developers to leverage AI hardware.

You will collaborate closely with cross-functional teams, including hardware engineers and product managers, to bring new features to market. Your role will involve pre-silicon design discussions and ensuring that the Neuron compiler meets performance benchmarks. You will also be involved in technical documentation and user support, helping to guide users in utilizing the Neuron SDK effectively.

What we offer

At Amazon, you will be part of a team that is at the forefront of the AI revolution. We offer competitive compensation packages, including equity and comprehensive benefits. You will have the opportunity to work in a dynamic environment that encourages innovation and professional growth. Join us in making deep learning accessible to developers everywhere.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Amazon.

Amazon•📍 Cupertino - On-Site

Amazon is hiring an ML Kernel Performance Engineer to optimize performance for AWS's custom ML accelerators. You'll work with technologies like AWS, Python, TensorFlow, and PyTorch. This position requires expertise in machine learning and high-performance computing.

🏛️ On-SiteMid-Level

9 months ago

Browse all jobs →