OpenAI

About OpenAI

Empowering humanity through safe AI innovation

🏢 Tech👥 1001+ employees📅 Founded 2015📍 Mission District, San Francisco, CA💰 $68.9b4.2
B2CB2BArtificial IntelligenceEnterpriseSaaSAPIDevOps

Key Highlights

  • Headquartered in San Francisco, CA with 1,001+ employees
  • $68.9 billion raised in funding from top investors
  • Launched ChatGPT, gaining 1 million users in 5 days
  • 20-week paid parental leave and unlimited PTO policy

OpenAI is a leading AI research and development platform headquartered in the Mission District of San Francisco, CA. With over 1,001 employees, OpenAI has raised $68.9 billion in funding and is known for its groundbreaking products like ChatGPT, which gained over 1 million users within just five day...

🎁 Benefits

OpenAI offers flexible work hours and encourages unlimited paid time off, promoting at least 4 weeks of vacation per year. Employees enjoy comprehensi...

🌟 Culture

OpenAI's culture is centered around its mission to ensure that AGI benefits all of humanity. The company values transparency and ethical consideration...

Skills & Technologies

Overview

OpenAI is hiring a Software Engineer for their Fleet Hardware team to ensure the reliability and uptime of their compute fleet. You'll work with technologies like Python, Linux, and Docker to maintain high-performance systems. This position requires a strong focus on system-level investigations and automation.

Job Description

Who you are

You have a solid background in software engineering, with experience in maintaining large-scale systems — you've tackled challenges related to hardware reliability and uptime, ensuring that complex infrastructures run smoothly. Your expertise in Python and Linux allows you to develop automated solutions that enhance system performance and efficiency.

You thrive in environments where you can investigate deeply and solve intricate problems — your analytical mindset helps you identify issues before they escalate, and you enjoy building automation for detection and remediation at scale. You understand the importance of safety and reliability in AI deployment, and you are committed to responsible technology practices.

You are comfortable working with cutting-edge technologies and are eager to learn and adapt as new challenges arise — your curiosity drives you to explore innovative solutions that can improve the health of supercomputing infrastructures. You value collaboration and are excited to work with a team that empowers engineers with autonomy and ownership.

Desirable

Experience with container orchestration tools like Kubernetes is a plus — you understand how to manage and scale applications in cloud environments. Familiarity with monitoring and alerting systems will help you ensure that the compute fleet operates at peak performance.

What you'll do

As a Software Engineer on the Fleet Hardware team, you will be responsible for the reliability and uptime of OpenAI's compute fleet — your work will directly impact the performance of AI models and services. You will conduct comprehensive investigations into system-level issues, identifying root causes and implementing solutions that minimize hardware failures.

You will develop automated tools and scripts to monitor system health and performance, ensuring that any anomalies are detected and addressed promptly — your contributions will help maintain the efficiency of large-scale systems that support both internal research and external products like ChatGPT.

Collaboration is key in this role, as you will work closely with other engineers to troubleshoot and resolve complex issues — your ability to communicate effectively will facilitate cross-functional teamwork and drive successful outcomes. You will also have the opportunity to mentor junior engineers, sharing your knowledge and expertise to help them grow in their careers.

What we offer

At OpenAI, you will be part of a mission-driven team that is shaping the future of technology — we believe in the potential of AI to solve global challenges and are committed to responsible deployment. You will have access to cutting-edge resources and technologies, enabling you to push the boundaries of what is possible in AI and computing.

We offer a competitive salary and benefits package, along with opportunities for professional development and growth — your contributions will be recognized and valued as we work together to advance the field of artificial intelligence.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at OpenAI.

Similar Jobs You Might Like

Based on your interests and this role

OpenAI

Software Engineering

OpenAI📍 San Francisco - Hybrid

OpenAI is hiring a Software Engineer for their Fleet Management team to design and build systems managing cloud and bare-metal fleets. You'll work with technologies like Python, Linux, and Docker in San Francisco with a hybrid work model.

🏢 HybridMid-Level
3 months ago
Waymo

Staff Engineer

Waymo📍 San Francisco - On-Site

Waymo is seeking a Staff Software Engineer to design and develop frontend systems for fleet monitoring and platform functionalities. You'll work with TypeScript and Angular to build mission-critical tools. This role requires 8+ years of experience in frontend systems.

🏛️ On-SiteSenior
1w ago
OpenAI

Full Stack Engineer

OpenAI📍 San Francisco - Hybrid

OpenAI is hiring a Full Stack Engineer for their Fleet Scheduling team to design and develop web-based systems for managing AI workloads. You'll work with technologies like JavaScript, React, and Node.js in San Francisco. This position requires collaboration with researchers and infrastructure teams.

🏢 HybridMid-Level
1 year ago
OpenAI

Software Engineering

OpenAI📍 San Francisco - Hybrid

OpenAI is hiring a Software Engineer to help build and optimize the low-level stack for AI-native silicon. You'll work with Python and C++ in a hybrid role based in San Francisco.

🏢 Hybrid
3 months ago
Zoox

Support Engineer

Zoox📍 San Francisco - On-Site

Zoox is hiring a Senior Fleet Support Engineer to maximize the operational uptime and reliability of their regional robotaxi fleet. This role requires hands-on technical expertise to address complex vehicle system issues on-site.

🏛️ On-SiteSenior
3w ago