
About Anthropic
Building safe and reliable AI systems for everyone
Key Highlights
- Headquartered in SoMa, San Francisco, CA
- Raised $29.3 billion in funding, including $13 billion Series F
- Over 1,000 employees focused on AI safety and research
- Launched Claude, an AI chat assistant rivaling ChatGPT
Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...
🎁 Benefits
Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...
🌟 Culture
Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...
Skills & Technologies
Overview
Anthropic is hiring an AI Engineer to enhance the reliability of AI systems. You'll work with technologies like Python, AWS, and Kubernetes to ensure robust service delivery. This position requires experience in AI reliability engineering.
Job Description
Who you are
You have a strong background in AI reliability engineering, with experience in developing Service Level Objectives for large language model serving systems. You understand the balance between availability, latency, and development velocity, and you have a knack for designing and implementing monitoring and observability systems across complex infrastructures.
You are skilled in incident response for critical AI services, ensuring rapid recovery and maintaining high availability across multiple regions and cloud providers. Your collaborative spirit allows you to work effectively with cross-functional teams, making you a key player in enhancing system reliability.
What you'll do
In this role, you will partner with various teams at Anthropic to improve the reliability of AI systems. You will develop and implement Service Level Objectives that align with the company's mission to create safe and beneficial AI. Your responsibilities will include designing monitoring systems that provide visibility into the performance of AI services and leading incident response efforts to ensure quick recovery from outages.
You will also collaborate with engineering teams to enhance the robustness of the systems that deliver Claude, Anthropic's AI model. This involves working on high-availability serving infrastructure and ensuring that the systems are resilient against failures. Your role will require you to zoom out and look at the entire system architecture, identifying areas for improvement and implementing solutions that enhance reliability.
What we offer
Anthropic offers competitive compensation and benefits, including optional equity donation matching and generous vacation and parental leave. You will enjoy flexible working hours and a collaborative office environment in San Francisco, where you can work alongside a dedicated team of researchers and engineers committed to building beneficial AI systems.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Anthropic.
Similar Jobs You Might Like
Based on your interests and this role

Ai Engineer
Anthropic is hiring a Senior AI Engineer to enhance the reliability of AI systems. You'll work with Python, AWS, and Kubernetes to develop robust infrastructure for language model serving. This position requires significant experience in AI reliability engineering.

Ai Engineer
Anthropic is hiring a Senior AI Engineer to enhance the reliability of AI systems. You'll work with AWS, Python, and Kubernetes to develop monitoring systems and high-availability infrastructure. This position requires significant experience in AI reliability engineering.

Ai Engineer
Postman is hiring a Lead AI Engineer to develop and manage reliability metrics for AI-driven API services. You'll work with technologies like Python and AWS to ensure the performance and scalability of AI systems. This position requires significant experience in reliability engineering.

Software Engineering
OpenAI is hiring a Software Engineer specializing in Reliability to ensure the performance and scalability of their systems. You'll work with Python, JavaScript, and AWS to build resilient infrastructure. This position requires experience in engineering and problem-solving skills.

Site Reliability Engineer
Crusoe is hiring a Site Reliability Engineer to ensure the reliability and scalability of their AI-optimized cloud platform. You'll work on building and operating managed AI services at scale, focusing on distributed systems and large language models.