
About Apple
The personal technology company redefining user experience
Key Highlights
- Market cap of $3 trillion as of 2022
- Over 1 billion active devices worldwide
- Comprehensive medical plans including mental healthcare
- Paid parental leave and gradual return-to-work program
Apple Inc. (NASDAQ: AAPL), headquartered in Cupertino, CA, is the world's most valuable company with a market capitalization of $3 trillion as of 2022. Known for its iconic products such as the iPhone, iPad, and Mac, Apple serves over 1 billion active devices globally. The company has a strong commi...
🎁 Benefits
Apple offers comprehensive medical plans covering physical and mental healthcare, paid parental leave, and a gradual return-to-work program. Employees...
🌟 Culture
Apple's culture emphasizes an obsessive focus on user experience and consumer privacy, setting it apart from competitors. The company promotes inclusi...
Skills & Technologies
Overview
Apple is hiring an ML Framework Engineer to work on high-performance, distributed inference of GenAI applications. You'll collaborate on GPU acceleration of ML training frameworks using PyTorch and JAX. This role requires a strong systems background and expertise in machine learning.
Job Description
Who you are
You have a strong systems background and a passion for building scalable, efficient, and production-grade solutions tailored for high-throughput GPU execution. Your experience includes working with machine learning frameworks and optimizing code for performance. You are familiar with advanced model optimization techniques such as speculation, quantization, and compression, which you apply to maximize throughput and minimize latency.
You possess deep knowledge of GPU programming and are excited about the opportunity to influence the design of compute and programming models in next-generation GPU architectures. Your collaborative spirit allows you to work closely with hardware, compiler, and systems teams to align efforts and achieve optimal performance across server hardware families.
Desirable
Experience with distributed compute strategies such as data, tensor, pipeline, and expert parallelism is a plus. Familiarity with custom-built server hardware and Apple silicon is also beneficial.
What you'll do
In this role, you will work on cutting-edge ML inference framework projects, optimizing code for efficient and scalable ML inference. You will develop kernel and compiler-level optimizations and perform in-depth analysis to ensure the best possible performance. Your responsibilities will include collaborating with various teams to align on project goals and deliver robust compute solutions for accelerating machine learning libraries on Apple Silicon.
You will have the opportunity to influence the design of compute and programming models in next-generation GPU architectures, ensuring that Apple remains at the forefront of machine learning technology. Your work will directly contribute to enabling Apple Intelligence through high-performance, distributed inference of GenAI applications on Private Cloud Compute.
What we offer
Apple provides a dynamic work environment where innovation thrives. You will be part of a team that is dedicated to pushing the boundaries of machine learning and GPU programming. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Join us in shaping the future of technology at Apple.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Apple.
Similar Jobs You Might Like
Based on your interests and this role

Machine Learning Engineer
Apple is hiring a Senior Machine Learning Engineer to lead the development of API design and performance analysis for Metal and Metal compute frameworks. You'll work with technologies like C, C++, and various GPU compute APIs. This position requires 5+ years of experience in related fields.

Ml Compute Acceleration Engineer
Apple is hiring an ML Compute Acceleration Engineer to develop high-performance compute solutions for machine learning networks on Apple Silicon. You'll work with GPU programming and optimization techniques in Cupertino.

Gpu Ml Engineer
Apple is hiring a GPU ML Engineer to develop high-performance data parallel algorithms for machine learning and image processing. You'll work with GPU programming and contribute to next-generation GPU architectures in Cupertino.

Machine Learning Engineer
Apple is hiring a Machine Learning Engineer to enhance software development productivity through AI-powered solutions. You'll work with NLP and computer vision technologies to create innovative testing tools. This role requires experience in machine learning and a collaborative mindset.

Machine Learning Engineer
Apple is hiring a Machine Learning Engineer to join their ML Research team. You'll work on advanced ML algorithms and systems, focusing on speech processing and dialog systems. This role requires expertise in Python and ML frameworks like PyTorch.