
About Anthropic
Building safe and reliable AI systems for everyone
Key Highlights
- Headquartered in SoMa, San Francisco, CA
- Raised $29.3 billion in funding, including $13 billion Series F
- Over 1,000 employees focused on AI safety and research
- Launched Claude, an AI chat assistant rivaling ChatGPT
Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...
🎁 Benefits
Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...
🌟 Culture
Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...
Skills & Technologies
Overview
Anthropic is hiring a Staff Machine Learning Engineer to design and implement reinforcement learning environments for Claude, their AI system. You'll work with Python and focus on virtual collaborator workflows. This position requires extensive experience in machine learning and AI systems.
Job Description
Who you are
You are an experienced Machine Learning Engineer with a strong background in Python programming — you have designed and implemented complex machine learning models and understand the intricacies of reinforcement learning. Your expertise allows you to create authentic training environments that enhance AI capabilities, particularly in virtual collaboration. You thrive in collaborative settings, working closely with product teams to ensure alignment between AI training and product features.
You have a deep understanding of data generation platforms and can build scalable systems for creating high-quality tasks — your experience includes integrating real organizational data to develop robust training environments. You are adept at developing evaluation systems that maintain quality and prevent reward hacking, ensuring the AI's performance is reliable and effective.
What you'll do
In this role, you will design and implement reinforcement learning pipelines specifically targeted at virtual collaborator use cases, such as productivity and organizational navigation. You will build and scale a data creation platform that generates high-quality, open-ended tasks in collaboration with domain experts and crowdworkers. Your responsibilities will also include integrating real organizational data to create authentic training environments for Claude, enhancing its capabilities in document manipulation and co-creation.
You will partner directly with product teams to ensure that the training aligns with shipped features, providing insights and feedback to refine the AI's performance. Your work will be pivotal in transforming Claude into the best virtual collaborator, making a significant impact on how knowledge work is conducted within organizations.
What we offer
At Anthropic, we provide competitive compensation and benefits, including optional equity donation matching, generous vacation and parental leave, and flexible working hours. You will have the opportunity to work in a lovely office space in San Francisco, collaborating with a diverse team of researchers, engineers, and policy experts dedicated to building beneficial AI systems. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Anthropic.
Similar Jobs You Might Like
Based on your interests and this role

Machine Learning Engineer
Quantcast is hiring a Senior Machine Learning Engineer to develop and maintain machine learning systems that optimize advertising technology. You'll work with large datasets and advanced algorithms to enhance the company's AI-powered Demand Side Platform. This role requires strong expertise in machine learning and data analysis.

Machine Learning Engineer
Uber is hiring a Staff Machine Learning Engineer to lead technical initiatives in the Consumer Incentives team. You'll work on enhancing user experience through machine learning and optimization solutions. This role requires 6+ years of experience in ML engineering and expertise in frameworks like PyTorch and TensorFlow.

Machine Learning Engineer
Databricks is hiring a Staff Machine Learning Engineer to drive the development of GenAI-powered products. You'll work with technologies like Python and TensorFlow to enhance AI capabilities. This position requires significant experience in machine learning and data engineering.

Machine Learning Engineer
Sentry is hiring a Staff Machine Learning Engineer to develop AI models and agents that enhance their software monitoring tools. You'll work with Python and machine learning frameworks like TensorFlow and Keras in San Francisco. This role requires significant experience in AI and ML.

Machine Learning Engineer
Headway is hiring a Staff Machine Learning Engineer to lead the development of a matching system that connects patients with mental health providers. You'll work on cutting-edge machine learning technologies to enhance user experience. This role requires significant experience in machine learning and software development.