
About OpenAI
Empowering humanity through safe AI innovation
Key Highlights
- Headquartered in San Francisco, CA with 1,001+ employees
- $68.9 billion raised in funding from top investors
- Launched ChatGPT, gaining 1 million users in 5 days
- 20-week paid parental leave and unlimited PTO policy
OpenAI is a leading AI research and development platform headquartered in the Mission District of San Francisco, CA. With over 1,001 employees, OpenAI has raised $68.9 billion in funding and is known for its groundbreaking products like ChatGPT, which gained over 1 million users within just five day...
🎁 Benefits
OpenAI offers flexible work hours and encourages unlimited paid time off, promoting at least 4 weeks of vacation per year. Employees enjoy comprehensi...
🌟 Culture
OpenAI's culture is centered around its mission to ensure that AGI benefits all of humanity. The company values transparency and ethical consideration...
Skills & Technologies
Overview
OpenAI is hiring a Distributed Training Engineer to enhance the training throughput of their internal framework for video models. You'll work with distributed systems and machine learning techniques in San Francisco. This role requires a strong background in optimizing performance and coding.
Job Description
Who you are
You have a solid background in machine learning and distributed systems, with experience in optimizing performance for complex models. You thrive on writing bug-free code and have a deep understanding of supercomputers and their performance metrics. Your passion for systems efficiency drives you to explore innovative solutions in the realm of AI. You enjoy collaborating with researchers to push the boundaries of video model capabilities and architectures. Your experience with multi-modal ML pipelines equips you to tackle the challenges of integrating various data types effectively. You are detail-oriented and cannot stand having bugs in your code, which reflects your commitment to quality and reliability.
What you'll do
In this role, you will collaborate closely with researchers to develop systems-efficient video models and architectures. You will apply the latest techniques to enhance the internal training framework, aiming for impressive hardware efficiency during training runs. Your responsibilities will include profiling and optimizing the training framework to ensure it meets the demands of cutting-edge research. You will dive deep into systems, analyzing performance bottlenecks and implementing solutions that improve throughput. Your work will directly contribute to the reliability and safety of OpenAI's video models, making a significant impact on the team's research and product goals. You will also have the opportunity to experiment with new ideas and technologies, fostering an environment of innovation and exploration.
What we offer
OpenAI provides a hybrid work model, allowing you to work three days in the office per week while also offering relocation assistance for new employees. You will be part of a dynamic team that values collaboration and innovation, working on projects that have the potential to shape the future of technology. The company is committed to providing reasonable accommodations to applicants with disabilities, ensuring an inclusive work environment. Join us in our mission to leverage artificial intelligence for solving global challenges and share in the benefits of AI advancements.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at OpenAI.
Similar Jobs You Might Like
Based on your interests and this role

Software Engineering
OpenAI is hiring a Software Engineer for the Sora team to design and scale infrastructure for multimodal training and evaluation. You'll work with distributed data systems and collaborate closely with researchers. This position requires strong experience in building reliable infrastructure.

Machine Learning Engineer
OpenAI is hiring a Machine Learning Engineer to design and scale infrastructure for large-scale multimodal training and evaluation. You'll work with distributed data systems and collaborate closely with researchers. This position requires strong experience in building reliable infrastructure.

Software Engineering
OpenAI is hiring a Senior Software Engineer to design and build a load balancer for their research inference stack. You'll work with technologies like Java and Python, focusing on distributed systems and performance optimization. This role requires strong experience in building reliable and efficient systems.

Distributed Systems Engineer
Krea is hiring a Distributed Systems Engineer to design and maintain large-scale distributed infrastructure for AI research and real-time model serving. You'll work with technologies like Kubernetes and Python, and collaborate closely with ML engineers. This position requires experience in distributed systems and cloud deployments.

Software Engineering
Baseten is hiring a Senior Product Engineer for their Training Platform to develop features like multi-node training and serverless reinforcement learning. You'll work with Python and machine learning techniques in San Francisco.