About OpenAI

Empowering humanity through safe AI innovation

🏢 Tech👥 1001+ employees📅 Founded 2015📍 Mission District, San Francisco, CA💰 $68.9b⭐ 4.2

B2CB2BArtificial IntelligenceEnterpriseSaaSAPIDevOps

Key Highlights

Headquartered in San Francisco, CA with 1,001+ employees
$68.9 billion raised in funding from top investors
Launched ChatGPT, gaining 1 million users in 5 days
20-week paid parental leave and unlimited PTO policy

OpenAI is a leading AI research and development platform headquartered in the Mission District of San Francisco, CA. With over 1,001 employees, OpenAI has raised $68.9 billion in funding and is known for its groundbreaking products like ChatGPT, which gained over 1 million users within just five day...

🎁 Benefits

OpenAI offers flexible work hours and encourages unlimited paid time off, promoting at least 4 weeks of vacation per year. Employees enjoy comprehensi...

🌟 Culture

OpenAI's culture is centered around its mission to ensure that AGI benefits all of humanity. The company values transparency and ethical consideration...

🌐 Website 💼 LinkedIn 𝕏 Twitter All 499 jobs →

Software Engineering • Mid-Level

OpenAI • San Francisco - Hybrid

Posted 6 months ago🏢 Hybrid Mid-Level Software Engineering 📍 San Francisco

Apply Now →

Skills & Technologies

python machine learning reinforcement learning

Overview

OpenAI is hiring a Software Engineer for the Applied Evals team to design and build evaluation systems for advanced AI models. You'll work with Python and machine learning techniques to improve model reliability and user experience. This position requires a product-minded approach and experience in AI systems.

Job Description

Who you are

You have a strong background in software engineering, particularly in designing and building systems that evaluate AI models. Your experience includes working with real-world quality metrics and integrating them into training stacks, ensuring that the systems you create are reliable and effective. You thrive in collaborative environments, working closely with research and product teams to drive model improvements. Your builder's mindset allows you to prototype quickly and create reusable systems that others can extend. You are comfortable with both backend pipelines and user-facing interfaces, bridging the gap between technical and product-oriented work.

You bring experience in reinforcement learning and related methods, applying these techniques in production settings to enhance AI systems. Your judgment in creating scalable solutions is complemented by your ability to take initiative and create structure in ambiguous situations. You understand the importance of user feedback and are adept at turning complex workflows into clear, actionable signals for model training.

What you'll do

In this role, you will own the development of evaluation systems that capture real-world quality for advanced AI systems. You will collaborate with cross-functional teams to design and implement evals and harnesses that guide model training and product quality. Your work will directly influence how models behave and improve their reliability, raising the standard for user expectations. You will prototype with users to gather feedback and iterate on your designs, ensuring that the systems you build are both effective and user-friendly.

You will be responsible for evaluating multi-turn and tool-using systems, applying your knowledge of reinforcement learning to enhance their performance. Your role will involve building reliable pipelines that integrate evaluation signals into training stacks, contributing to a compounding loop of model improvement. You will work closely with engineers and researchers, fostering a collaborative environment that encourages innovation and rapid iteration. Your contributions will help shape the future of AI technology, making a significant impact on how users interact with advanced systems.

What we offer

At OpenAI, you will be part of a mission-driven team that believes in the potential of artificial intelligence to solve global challenges. We offer a hybrid work model, allowing you to balance in-office collaboration with remote work flexibility. You will have the opportunity to work on cutting-edge AI technologies and contribute to projects that have a meaningful impact on society. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds in our team.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at OpenAI.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Software Engineering

Mercor•📍 San Francisco - On-Site

Mercor is hiring a Software Engineer specializing in Applied AI to build and operate systems that bridge AI research and data delivery. You'll work with Python and Machine Learning technologies in San Francisco. This role requires a strong technical background and experience in data engineering.

🏛️ On-SiteMid-Level

2w ago

Ai Research Engineer

Exa•📍 San Francisco - On-Site

Exa is hiring an AI Research Engineer to design and build evaluation systems for their AI-driven search engine. You'll work with Python and Rust to develop comprehensive evaluation strategies. This position requires hands-on ML experience and strong engineering fundamentals.

🏛️ On-SiteMid-Level

4 months ago

Ai Research Engineer

OpenAI•📍 San Francisco - On-Site

OpenAI is hiring an AI Research Engineer to develop ambitious environments for measuring and steering AI models. You'll work with statistical analysis and reinforcement learning techniques in San Francisco.

🏛️ On-SiteMid-Level

10 months ago

Ai Research Engineer

OpenAI•📍 San Francisco - On-Site

OpenAI is hiring a Research Engineer in Applied AI Engineering to design and deploy advanced machine learning models that solve real-world problems. You'll work with technologies like Python and TensorFlow in San Francisco.

🏛️ On-SiteMid-Level

1 year ago

Staff Engineer

Waymo•📍 Mountain View - Hybrid

Waymo is hiring a Staff Software Engineer for their Quantitative Evaluation team to develop signals that measure the performance of the Waymo driver. You'll work with techniques including statistics, algorithms, and machine learning. This role requires experience in data-driven decision making.

🏢 HybridSenior

1w ago

Browse all jobs →