
About Amazon
The everything store and cloud computing leader
Key Highlights
- Headquartered in South Lake Union, Seattle, WA
- Over 1.5 million employees worldwide
- Leading cloud services through Amazon Web Services (AWS)
- Acquired Whole Foods, Twitch, and Ring
Amazon, headquartered in South Lake Union, Seattle, WA, is the world's largest online retailer and a leader in cloud computing through Amazon Web Services (AWS). With over 1.5 million employees globally, Amazon operates in various sectors, including AI with its Alexa devices and a vast marketplace k...
🎁 Benefits
Amazon offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also have access to a learning ...
🌟 Culture
Amazon's culture is driven by customer obsession and a focus on innovation. The company encourages employees to think big and move fast, fostering an ...
Skills & Technologies
Overview
Amazon is hiring a Software Development Manager for the AWS Neuron Machine Learning Distributed Training team. You'll lead a team to design and deploy machine learning products, requiring expertise in AWS and distributed training frameworks.
Job Description
Who you are
You have a strong background in software development and machine learning, with at least 5 years of experience leading engineering teams. Your technical expertise allows you to navigate complex challenges in developing machine learning products, and you have a proven track record of delivering customer-facing solutions. You are motivated by results and have a passion for innovation in the field of machine learning.
Your experience includes working with frameworks such as PyTorch and JAX, and you are familiar with distributed training libraries like FSDP and DDP. You understand the intricacies of machine learning architectures, including MoE architectures, and are eager to enable models that leverage cutting-edge technology. You thrive in collaborative environments and enjoy solving challenging technical problems that have not been addressed before.
Desirable
Experience with cloud-scale machine learning accelerators and a deep understanding of the full development life cycle for integrations and extensions in machine learning is a plus. Familiarity with AWS services and a customer-centric approach to product development will set you apart.
What you'll do
As the Software Development Manager, you will lead a talented team of engineers and managers in the Machine Learning Distributed Training team. Your responsibilities will include designing, implementing, testing, and maintaining innovative software solutions that enhance service performance and durability. You will ensure that the right products are built and delivered to customers, collaborating across diverse teams and projects to have a significant impact on AWS's global customer base.
You will be involved in the full development life cycle, guiding your team through the complexities of machine learning product development. This includes enabling models using advanced architectures and ensuring support for key ML functionalities in a combined chip/software platform. You will also be responsible for mentoring your team members, fostering a culture of innovation and excellence.
Your role will require you to solve challenging technical problems at every layer of the stack, from design to deployment. You will work closely with cross-functional teams to ensure that the solutions you develop meet customer needs and drive business results. Your leadership will be crucial in navigating the evolving landscape of machine learning technologies and ensuring that your team remains at the forefront of innovation.
What we offer
At Amazon, we offer a competitive compensation package that includes equity, sign-on payments, and a comprehensive range of medical and financial benefits. You will have the opportunity to work in a dynamic environment where your contributions will directly impact the future of machine learning at AWS. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds in our teams.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Amazon.
Similar Jobs You Might Like
Based on your interests and this role

Software Development Manager
Amazon is hiring a Software Development Manager to lead the Frameworks team within AWS Neuron. You'll drive the development of machine learning frameworks and collaborate with open-source communities. This position requires strong technical leadership and experience in machine learning technologies.

Software Development Manager
Amazon is hiring a Software Development Manager to lead the LLM Inference Model Enablement team. You'll optimize state-of-the-art LLMs for inference on Trainium and manage a team of AI/ML engineers. This position requires a strong background in LLM model architectures and performance optimizations.

Machine Learning Engineer
Amazon is hiring a Senior Machine Learning Engineer to develop and optimize distributed training solutions for AWS Neuron. You'll work with technologies like Python, PyTorch, and AWS to enhance performance for large-scale ML models. This position requires experience in training large models and distributed systems.

Machine Learning Engineer
Amazon is hiring a Senior Machine Learning Engineer for the AWS Neuron Distributed Training team. You'll develop and optimize distributed training solutions for large-scale ML models using Python and various libraries. This role requires expertise in machine learning and cloud technologies.

Machine Learning Engineer
Amazon is hiring a Senior Machine Learning Engineer to develop and optimize software solutions for AWS Neuron. You'll work with AWS services and machine learning frameworks to build scalable applications. This position requires expertise in Python and machine learning technologies.