
About OpenAI
Empowering humanity through safe AI innovation
Key Highlights
- Headquartered in San Francisco, CA with 1,001+ employees
- $68.9 billion raised in funding from top investors
- Launched ChatGPT, gaining 1 million users in 5 days
- 20-week paid parental leave and unlimited PTO policy
OpenAI is a leading AI research and development platform headquartered in the Mission District of San Francisco, CA. With over 1,001 employees, OpenAI has raised $68.9 billion in funding and is known for its groundbreaking products like ChatGPT, which gained over 1 million users within just five day...
🎁 Benefits
OpenAI offers flexible work hours and encourages unlimited paid time off, promoting at least 4 weeks of vacation per year. Employees enjoy comprehensi...
🌟 Culture
OpenAI's culture is centered around its mission to ensure that AGI benefits all of humanity. The company values transparency and ethical consideration...
Skills & Technologies
Overview
OpenAI is hiring a Software Engineer for their Infrastructure Reliability team to design and operate reliable systems that support cutting-edge AI research. You'll work with technologies like Java, Python, AWS, and Docker. This position requires a strong background in system reliability and performance optimization.
Job Description
Who you are
You have a solid background in software engineering with a focus on infrastructure reliability — you've designed and built systems that prioritize performance and security, ensuring they can scale effectively. Your experience includes working with cloud technologies and container orchestration, allowing you to manage complex infrastructures with ease.
You thrive in collaborative environments where you can take ownership of projects — you enjoy solving deep technical problems and have a knack for identifying performance bottlenecks. Your ability to communicate effectively with cross-functional teams helps you turn complex challenges into reliable solutions.
You are passionate about automation and continuously seek ways to improve internal tooling and developer experience — you understand the importance of reducing manual work and enhancing system resilience. Your proactive approach to incident response and postmortems ensures that you learn from challenges and improve systems over time.
Desirable
Experience with database systems and online storage solutions is a plus — you understand how to optimize data access and storage for high-performance applications. Familiarity with observability tools and practices will help you ensure that systems are not only reliable but also easy to monitor and troubleshoot.
What you'll do
In this role, you will design, build, and operate reliable systems used across engineering teams — your work will directly impact the performance and safety of AI systems like ChatGPT and the OpenAI API. You will identify and fix performance bottlenecks, ensuring that our infrastructure can scale to meet the demands of millions of users.
You will collaborate closely with infrastructure, product, and research teams to shape the technical direction of our systems — your insights will help turn complex infrastructure into reliable platforms that support cutting-edge research. You will also contribute to incident response efforts, ensuring that systems remain operational and resilient.
Your role will involve continuously improving automation processes to reduce manual work — you will enhance internal tooling and developer experience, making it easier for teams to deploy and manage applications. You will dig deep to resolve complex issues, leveraging your technical expertise to ensure high reliability and performance.
What we offer
At OpenAI, you will be part of a mission-driven organization that believes in the potential of AI to solve global challenges — your work will contribute to shaping the future of technology. We offer a collaborative and inclusive work environment where you can grow your skills and make a meaningful impact.
We encourage you to apply even if your experience doesn't match every requirement — we value diverse perspectives and are committed to building a team that reflects a variety of backgrounds and experiences. Join us in our mission to ensure that the benefits of AI are widely shared.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at OpenAI.
Similar Jobs You Might Like
Based on your interests and this role

Software Engineering
OpenAI is hiring a Software Engineer specializing in Reliability to ensure the performance and scalability of their systems. You'll work with Python, JavaScript, and AWS to build resilient infrastructure. This position requires experience in engineering and problem-solving skills.

Backend Engineer
Doppel is hiring a Backend Engineer to build the infrastructure for their AI-native social engineering defense platform. You'll work with technologies like Elasticsearch and Kubernetes to design scalable systems. This position requires experience in backend engineering and infrastructure management.

Infrastructure Engineer
Middesk is hiring an Infrastructure Engineer to join their DevSecOps team. You'll build tooling and platform capabilities to enhance software delivery and developer experience. This position requires experience with infrastructure-as-code tools and high availability systems.

Software Engineering
Baseten is hiring a Software Engineer - Infrastructure to build and maintain components of their ML inference platform. You'll work with Python, Go, and Kubernetes to enable developers to deploy and monitor ML models. This position requires experience in infrastructure development.

Site Reliability Engineer
Okta is seeking a Senior Manager for Site Reliability Engineering to lead the Infrastructure Platform team. You'll oversee initiatives in Edge networking, Kubernetes, and DevOps transformation, leveraging skills in AWS and automation. This role requires significant technical leadership experience.