OpenAI

About OpenAI

Empowering humanity through safe AI innovation

🏢 Tech👥 1001+ employees📅 Founded 2015📍 Mission District, San Francisco, CA💰 $68.9b4.2
B2CB2BArtificial IntelligenceEnterpriseSaaSAPIDevOps

Key Highlights

  • Headquartered in San Francisco, CA with 1,001+ employees
  • $68.9 billion raised in funding from top investors
  • Launched ChatGPT, gaining 1 million users in 5 days
  • 20-week paid parental leave and unlimited PTO policy

OpenAI is a leading AI research and development platform headquartered in the Mission District of San Francisco, CA. With over 1,001 employees, OpenAI has raised $68.9 billion in funding and is known for its groundbreaking products like ChatGPT, which gained over 1 million users within just five day...

🎁 Benefits

OpenAI offers flexible work hours and encourages unlimited paid time off, promoting at least 4 weeks of vacation per year. Employees enjoy comprehensi...

🌟 Culture

OpenAI's culture is centered around its mission to ensure that AGI benefits all of humanity. The company values transparency and ethical consideration...

Overview

OpenAI is seeking an HPC Systems Engineer to design and operate large-scale HPC environments for consumer product development. You'll work with tools like Abaqus and Ansys to optimize simulation workloads. This role requires experience in managing high-performance computing systems.

Job Description

Who you are

You have a strong background in high-performance computing (HPC) systems engineering, with experience in designing and operating large-scale HPC environments that support complex simulations. Your expertise includes working with tools such as Abaqus, Ansys, and COMSOL, which are critical for product design and validation. You understand the intricacies of workload management and have experience optimizing scheduling and resource allocation in HPC clusters. You are familiar with both on-prem and hybrid-cloud environments, ensuring high availability and reliability in the systems you manage. You thrive in collaborative settings, working closely with product engineers and applied scientists to drive innovation and efficiency in product development.

Desirable

Experience with workload management systems like IBM/Platform LSF and Slurm is a plus, as is familiarity with simulation-driven product development processes. You are comfortable with balancing workloads across diverse product teams and have a keen eye for minimizing iteration latency. Your problem-solving skills enable you to tackle challenges related to license availability and system performance, ensuring that product teams can iterate quickly and effectively.

What you'll do

In this role, you will architect, deploy, and operate large-scale HPC clusters that support simulation workloads critical to consumer product development. You will be responsible for managing environments with over 1,000 compute nodes, ensuring they are optimized for performance and reliability. Your day-to-day tasks will involve collaborating with consumer product and simulation engineers to enhance scheduling processes and reduce queue wait times. You will implement strategies for workload balancing and cluster federation, ensuring that diverse product teams can efficiently utilize the HPC resources available to them. Additionally, you will monitor system performance and make adjustments as necessary to maintain optimal operation.

What we offer

At OpenAI, you will be part of a team that is at the forefront of technology, working on projects that have a significant impact on the future of AI and consumer products. We offer a collaborative work environment where your contributions will directly influence product outcomes. You will have access to cutting-edge tools and technologies, and opportunities for professional growth and development. We believe in fostering a culture of innovation and encourage you to apply even if your experience doesn't match every requirement. Join us in shaping the future of technology and making a difference in the world.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at OpenAI.

Similar Jobs You Might Like

Based on your interests and this role

OpenAI

Software Engineering

OpenAI📍 San Francisco - On-Site

OpenAI is hiring a Software Engineer for their GPU Infrastructure team to ensure the reliability and uptime of their compute fleet. You'll work with cutting-edge technologies in a high-performance computing environment. This position requires experience in system-level investigations and automation.

🏛️ On-SiteMid-Level
2w ago
Nebius AI

Systems Engineer

Nebius AI📍 Amsterdam - On-Site

Nebius AI is seeking a Systems Engineer to support benchmarking of GPU platforms for machine learning and AI workloads. You'll work closely with hardware and development teams to evaluate GPU performance using technologies like CUDA. This position requires expertise in AI and deep learning frameworks.

🏛️ On-SiteMid-Level
18h ago
SpaceX

Systems Engineer

SpaceX📍 Hawthorne - On-Site

SpaceX is hiring a High Performance Computing (HPC) Systems Engineer to manage HPC clusters and provide application support across engineering disciplines. You'll work with Linux systems and virtualization technologies in Hawthorne, CA.

🏛️ On-SiteEntry-Level
1w ago
OpenAI

Qa Engineer

OpenAI📍 San Francisco - Hybrid

OpenAI is hiring a QA Software Engineer to ensure the quality and reliability of device software. You'll design automated test frameworks and integrate them into CI/CD pipelines. This role requires experience in software quality and automation.

🏢 HybridMid-Level
1 month ago
The Exploration Company

Hpc Engineer

The Exploration Company📍 Turin - On-Site

The Exploration Company is hiring an HPC Engineer to design, build, and operate high-performance computing environments for space mission simulations. You'll work with AWS and GCP to manage HPC clusters and ensure optimal performance. This position requires experience in cloud environments and infrastructure automation.

🏛️ On-SiteMid-Level
2 months ago