Anthropic

About Anthropic

Building safe and reliable AI systems for everyone

🏢 Tech👥 1001+ employees📅 Founded 2021📍 SoMa, San Francisco, CA💰 $29.3b4.5
B2BArtificial IntelligenceDeep TechMachine LearningSaaS

Key Highlights

  • Headquartered in SoMa, San Francisco, CA
  • Raised $29.3 billion in funding, including $13 billion Series F
  • Over 1,000 employees focused on AI safety and research
  • Launched Claude, an AI chat assistant rivaling ChatGPT

Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...

🎁 Benefits

Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...

🌟 Culture

Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...

Anthropic

Research Scientist Senior

AnthropicSan Francisco - Remote

Posted 1d ago🏠 RemoteSeniorResearch Scientist📍 San Francisco💰 $340,000 - $425,000 / yearly
Apply Now →

Overview

Anthropic is seeking a Senior Research Scientist to lead research on reward models for AI systems. You'll focus on improving how models learn human preferences and collaborate with various teams to enhance model capabilities. This role requires expertise in machine learning and reinforcement learning.

Job Description

Who you are

You have a strong background in research, particularly in the fields of machine learning and reinforcement learning — your experience includes developing novel architectures and methodologies that push the boundaries of AI capabilities. You are adept at working with large language models and understand the intricacies of aligning AI systems with human values. Your collaborative spirit shines through as you work closely with cross-functional teams, ensuring that your research translates into practical improvements in production systems.

You possess a deep understanding of reward modeling and have experience in evaluating and grading AI systems — your knowledge extends to identifying and mitigating reward hacking, which is crucial for developing safe and beneficial AI. You are driven by a mission to create interpretable and steerable AI systems that are reliable and beneficial for society. You thrive in environments that challenge you to think critically and creatively about complex problems.

Desirable

Experience with large-scale AI systems and a track record of publishing in reputable conferences or journals would be advantageous. Familiarity with tools and frameworks used in AI research, such as TensorFlow or PyTorch, is also a plus. You are comfortable navigating the challenges of AI alignment and are eager to contribute to the advancement of this field.

What you'll do

As a Senior Research Scientist at Anthropic, you will lead research efforts focused on reward models, shaping how our AI systems understand and optimize for human preferences. Your work will involve developing innovative architectures and training methodologies for reinforcement learning from human feedback (RLHF). You will explore new approaches to evaluation and grading of AI systems, including rubric-based methods, to ensure that our models align closely with human values.

Collaboration is key in this role — you will work alongside teams in Finetuning, Alignment Science, and our broader research organization to ensure that your research leads to tangible improvements in model capabilities and safety. You will have access to cutting-edge models and significant computational resources, allowing you to tackle some of the most pressing challenges in AI alignment.

Your research will not only advance the science of AI but also contribute to the practical deployment of these systems in real-world applications. You will drive ambitious research agendas while also ensuring that your findings are implemented effectively in production systems. This role offers the opportunity to work on critical open problems in AI, making a meaningful impact on the future of technology.

What we offer

At Anthropic, we are committed to creating a supportive and collaborative work environment. We offer competitive compensation and benefits, including optional equity donation matching, generous vacation and parental leave, and flexible working hours. Our office in San Francisco provides a lovely space for collaboration, and we encourage a culture of open communication and teamwork. You will be part of a mission-driven organization focused on building beneficial AI systems that prioritize safety and reliability for users and society as a whole.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Anthropic.

Similar Jobs You Might Like

Based on your interests and this role

Anthropic

Ai Research Engineer

Anthropic📍 San Francisco - Remote

Anthropic is seeking an AI Research Engineer for their Reward Models Platform to automate research workflows and build scalable tools for model training. This role requires collaboration with researchers and a focus on optimizing reward methodologies.

🏠 Remote
1d ago
OpenAI

Research Scientist

OpenAI📍 San Francisco - Hybrid

OpenAI is hiring a Research Scientist for their Synthetic RL team to develop novel reinforcement learning techniques using synthetic environments. You'll collaborate with engineers and researchers to design experiments and improve large-scale models.

🏢 HybridMid-Level
1 month ago
Imbue

Ai Research Engineer

Imbue📍 San Francisco - On-Site

Imbue is hiring a Research Engineer to build AI systems that power their flagship product, Sculptor. You'll work on creating open coding agents to enhance software creation. This role requires a blend of research excellence and pragmatic engineering skills.

🏛️ On-SiteMid-Level
4 years ago
OpenAI

Research Scientist

OpenAI📍 San Francisco

OpenAI is hiring a Research Scientist to develop innovative machine learning techniques and advance the research agenda. You'll collaborate with peers across the organization and contribute to impactful research problems.

10 months ago
Waymo

Research Scientist

Waymo📍 Mountain View - Hybrid

Waymo is seeking a Senior Research Scientist to conduct applied foundation model research and development in autonomous driving technology. You'll work with Python and deep learning frameworks like TensorFlow and PyTorch. This role requires a Master's or PhD in deep learning and 3+ years of experience.

🏢 HybridSenior
1w ago