
About Baseten
Simplifying machine learning for every organization
Key Highlights
- Headquartered in Union Square, San Francisco, CA
- $285 million raised in Series C funding
- Team growth of 3x over the last five years
- Unlimited PTO with a company-wide holiday break
Baseten is a machine learning application builder headquartered in Union Square, San Francisco, CA. With $285 million in funding from investors like Coatue Management and Founders Fund, Baseten simplifies AI integration for businesses, enabling data scientists to deploy ML models without needing spe...
🎁 Benefits
Baseten offers a remote-first work environment with a $1,000 stipend for home office setup, unlimited PTO with a company-wide break during the holiday...
🌟 Culture
Baseten's culture emphasizes simplifying complex AI technologies for businesses, fostering a collaborative environment where team members can connect ...
Skills & Technologies
Overview
Baseten is hiring a Forward Deployed SRE to ensure the smooth deployment and performance of ML workloads for strategic customers. You'll work with technologies like Kubernetes and AWS, requiring expertise in infrastructure and debugging. This position is ideal for someone with a strong background in machine learning and customer-facing roles.
Job Description
Who you are
You have a strong background in site reliability engineering with hands-on experience in managing and deploying machine learning workloads. Your expertise in Kubernetes and AWS allows you to ensure the reliability and performance of critical systems, and you are comfortable debugging complex infrastructure issues. You possess excellent communication skills, enabling you to effectively collaborate with both technical teams and executive-level stakeholders.
You have a solid understanding of machine learning principles and can monitor AI model performance effectively. Your experience includes diagnosing runtime issues related to latency, memory behavior, and GPU utilization, ensuring that systems run smoothly and efficiently. You thrive in a collaborative environment and are eager to partner with product and engineering teams to drive adoption and success for high-value accounts.
Desirable
Experience with cloud infrastructure and a deep understanding of AI model lifecycle management would be beneficial. Familiarity with debugging tools and performance monitoring solutions is a plus, as is a proactive approach to identifying patterns that can lead to product improvements.
What you'll do
In this role, you will serve as the primary technical owner for Baseten's most strategic customers, ensuring the successful deployment and operation of machine learning workloads. You will manage and resolve escalations, maintaining and improving runbooks to enhance operational efficiency. Your responsibilities will include diagnosing and resolving runtime issues, collaborating with engineering teams to remove technical friction, and driving the adoption of Baseten's platform.
You will work closely with product and engineering teams to identify areas for improvement and ensure that customer needs are met. Your role will involve hands-on debugging and infrastructure expertise, allowing you to address issues related to Kubernetes, networking, and other critical components. You will also be responsible for monitoring AI model performance and providing insights that can lead to product enhancements.
What we offer
At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants. Join us in building a platform that empowers engineers to ship AI products effectively. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Baseten.
Similar Jobs You Might Like
Based on your interests and this role

Support Engineer
OpenAI is hiring an AI Support Engineer to enhance customer experience and resolve complex technical issues. You'll work closely with cross-functional teams to improve operational processes and support customer adoption of AI technologies.

Ai Solutions Engineer
Baseten is hiring an AI Solutions Engineer to architect, build, and deploy high-scale production AI applications. You'll work with technologies like Python and AWS, and engage directly with customers to translate business goals into reliable services.

Machine Learning Engineer
Tonal is hiring a Staff Machine Learning Engineer to design and implement intelligent systems that enhance coaching and personalize workouts. You'll work with advanced AI technologies and large datasets in San Francisco.

Ai Engineer
Factory is hiring an AI Engineer to design and develop innovative AI systems that enhance productivity and innovation. You'll work with emerging AI research and integrate it into customer-centric solutions. This role requires 2+ years of experience in AI/ML engineering.

Ai Engineer
Collate is hiring an AI Engineer to build and productionize models for their AI document generation platform in life sciences. You'll work at the intersection of machine learning and software engineering. This role requires expertise in AI and Python.