About Baseten

Simplifying machine learning for every organization

🏢 Tech👥 21-100 employees📍 Union Square, San Francisco, CA💰 $285m

B2BAnalyticsBusiness IntelligenceMachine Learning

Key Highlights

Headquartered in Union Square, San Francisco, CA
$285 million raised in Series C funding
Team growth of 3x over the last five years
Unlimited PTO with a company-wide holiday break

Baseten is a machine learning application builder headquartered in Union Square, San Francisco, CA. With $285 million in funding from investors like Coatue Management and Founders Fund, Baseten simplifies AI integration for businesses, enabling data scientists to deploy ML models without needing spe...

🎁 Benefits

Baseten offers a remote-first work environment with a $1,000 stipend for home office setup, unlimited PTO with a company-wide break during the holiday...

🌟 Culture

Baseten's culture emphasizes simplifying complex AI technologies for businesses, fostering a collaborative environment where team members can connect ...

🌐 Website 💼 LinkedIn 𝕏 Twitter All 38 jobs →

Site Reliability Engineer • Mid-Level

Baseten • San Francisco - On-Site

Posted 3 months ago🏛️ On-Site Mid-Level Site Reliability Engineer 📍 San Francisco

Apply Now →

Skills & Technologies

kubernetes python machine learning aws docker

Overview

Baseten is hiring a Forward Deployed SRE to ensure the smooth deployment and performance of ML workloads for strategic customers. You'll work with technologies like Kubernetes and AWS, requiring expertise in infrastructure and debugging. This position is ideal for someone with a strong background in machine learning and customer-facing roles.

Job Description

Who you are

You have a strong background in site reliability engineering with hands-on experience in managing and deploying machine learning workloads. Your expertise in Kubernetes and AWS allows you to ensure the reliability and performance of critical systems, and you are comfortable debugging complex infrastructure issues. You possess excellent communication skills, enabling you to effectively collaborate with both technical teams and executive-level stakeholders.

You have a solid understanding of machine learning principles and can monitor AI model performance effectively. Your experience includes diagnosing runtime issues related to latency, memory behavior, and GPU utilization, ensuring that systems run smoothly and efficiently. You thrive in a collaborative environment and are eager to partner with product and engineering teams to drive adoption and success for high-value accounts.

Desirable

Experience with cloud infrastructure and a deep understanding of AI model lifecycle management would be beneficial. Familiarity with debugging tools and performance monitoring solutions is a plus, as is a proactive approach to identifying patterns that can lead to product improvements.

What you'll do

In this role, you will serve as the primary technical owner for Baseten's most strategic customers, ensuring the successful deployment and operation of machine learning workloads. You will manage and resolve escalations, maintaining and improving runbooks to enhance operational efficiency. Your responsibilities will include diagnosing and resolving runtime issues, collaborating with engineering teams to remove technical friction, and driving the adoption of Baseten's platform.

You will work closely with product and engineering teams to identify areas for improvement and ensure that customer needs are met. Your role will involve hands-on debugging and infrastructure expertise, allowing you to address issues related to Kubernetes, networking, and other critical components. You will also be responsible for monitoring AI model performance and providing insights that can lead to product enhancements.

What we offer

At Baseten, we are committed to fostering a diverse and inclusive workplace. We provide equal employment opportunities to all employees and applicants. Join us in building a platform that empowers engineers to ship AI products effectively. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Baseten.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Support Engineer

OpenAI•📍 San Francisco - On-Site

OpenAI is hiring an AI Support Engineer to enhance customer experience and resolve complex technical issues. You'll work closely with cross-functional teams to improve operational processes and support customer adoption of AI technologies.

🏛️ On-SiteMid-Level

1 month ago

Ai Solutions Engineer

Baseten•📍 San Francisco - On-Site

Baseten is hiring an AI Solutions Engineer to architect, build, and deploy high-scale production AI applications. You'll work with technologies like Python and AWS, and engage directly with customers to translate business goals into reliable services.

🏛️ On-SiteMid-Level

3 months ago

Machine Learning Engineer

Tonal•📍 San Francisco - On-Site

Tonal is hiring a Staff Machine Learning Engineer to design and implement intelligent systems that enhance coaching and personalize workouts. You'll work with advanced AI technologies and large datasets in San Francisco.

🏛️ On-SiteStaff

3 months ago

Ai Engineer

Factory•📍 San Francisco - On-Site

Factory is hiring an AI Engineer to design and develop innovative AI systems that enhance productivity and innovation. You'll work with emerging AI research and integrate it into customer-centric solutions. This role requires 2+ years of experience in AI/ML engineering.

🏛️ On-SiteMid-Level

11 months ago

Ai Engineer

Collate•📍 San Francisco

Collate is hiring an AI Engineer to build and productionize models for their AI document generation platform in life sciences. You'll work at the intersection of machine learning and software engineering. This role requires expertise in AI and Python.

1 year ago

Browse all jobs →