
About Anthropic
Building safe and reliable AI systems for everyone
Key Highlights
- Headquartered in SoMa, San Francisco, CA
- Raised $29.3 billion in funding, including $13 billion Series F
- Over 1,000 employees focused on AI safety and research
- Launched Claude, an AI chat assistant rivaling ChatGPT
Anthropic, headquartered in SoMa, San Francisco, is an AI safety and research company focused on developing reliable, interpretable, and steerable AI systems. With over 1,000 employees and backed by Google, Anthropic has raised $29.3 billion in funding, including a monumental Series F round of $13 b...
🎁 Benefits
Anthropic offers comprehensive health, dental, and vision insurance for employees and their dependents, along with inclusive fertility benefits via Ca...
🌟 Culture
Anthropic's culture is rooted in AI safety and reliability, with a focus on producing less harmful outputs compared to existing AI systems. The compan...
Skills & Technologies
Overview
Anthropic is hiring a Senior AI Engineer to enhance the reliability of AI systems. You'll work with AWS, Python, and Kubernetes to develop monitoring systems and high-availability infrastructure. This position requires significant experience in AI reliability engineering.
Job Description
Who you are
You have 5+ years of experience in software engineering, particularly in AI reliability or related fields — you've developed and managed systems that ensure the reliability of AI models and understand the complexities involved in serving large-scale AI applications. Your expertise in Python and cloud services like AWS allows you to design robust infrastructures that can handle high traffic and ensure seamless user experiences.
You possess a strong understanding of Kubernetes and Docker, enabling you to implement containerized applications that are scalable and maintainable — your experience with monitoring tools like Prometheus and Grafana helps you create effective observability solutions that track system performance and reliability metrics. You are comfortable leading incident responses and have a proactive approach to identifying and mitigating potential issues before they impact users.
What you'll do
In this role, you will develop Service Level Objectives for AI model serving and training systems, balancing availability and latency with development velocity — you will design and implement monitoring systems that track key performance indicators, ensuring that our AI services meet the highest reliability standards. You will assist in creating high-availability infrastructure capable of supporting millions of users, collaborating with cross-functional teams to enhance system robustness.
You will lead the development of automated failover and recovery systems for model serving deployments across multiple regions and cloud providers — your leadership will be crucial in ensuring that our AI systems remain operational and performant, even in the face of challenges. You will also mentor junior engineers, sharing your knowledge and fostering a culture of reliability within the team.
What we offer
At Anthropic, we provide competitive compensation and benefits, including optional equity donation matching and generous vacation policies — you will enjoy flexible working hours and a collaborative office environment that encourages innovation and teamwork. We are committed to creating a workplace where you can thrive and contribute to our mission of building safe and beneficial AI systems.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Anthropic.
Similar Jobs You Might Like
Based on your interests and this role

Ai Engineer
Anthropic is hiring a Senior AI Engineer to enhance the reliability of AI systems. You'll work with Python, AWS, and Kubernetes to develop robust infrastructure for language model serving. This position requires significant experience in AI reliability engineering.

Ai Engineer
Anthropic is hiring an AI Engineer to enhance the reliability of AI systems. You'll work with technologies like Python, AWS, and Kubernetes to ensure robust service delivery. This position requires experience in AI reliability engineering.

Product Engineer
Intercom is seeking a Senior Product Engineer to join their AI Group and build AI-powered products. You'll collaborate closely with ML Scientists and product teams, utilizing skills in Machine Learning and Python. This role requires a deep understanding of product and customer needs.

Product Engineer
Intercom is seeking a Senior Product Engineer to join their AI Infrastructure team, focusing on building AI-powered products. You'll collaborate with ML engineers and scientists, utilizing skills in Machine Learning and Python to enhance customer service solutions.

Ai Engineer
Postman is hiring a Lead AI Engineer to develop and manage reliability metrics for AI-driven API services. You'll work with technologies like Python and AWS to ensure the performance and scalability of AI systems. This position requires significant experience in reliability engineering.