About Baseten

Simplifying machine learning for every organization

🏢 Tech👥 21-100 employees📍 Union Square, San Francisco, CA💰 $285m

B2BAnalyticsBusiness IntelligenceMachine Learning

Key Highlights

Headquartered in Union Square, San Francisco, CA
$285 million raised in Series C funding
Team growth of 3x over the last five years
Unlimited PTO with a company-wide holiday break

Baseten is a machine learning application builder headquartered in Union Square, San Francisco, CA. With $285 million in funding from investors like Coatue Management and Founders Fund, Baseten simplifies AI integration for businesses, enabling data scientists to deploy ML models without needing spe...

🎁 Benefits

Baseten offers a remote-first work environment with a $1,000 stipend for home office setup, unlimited PTO with a company-wide break during the holiday...

🌟 Culture

Baseten's culture emphasizes simplifying complex AI technologies for businesses, fostering a collaborative environment where team members can connect ...

🌐 Website 💼 LinkedIn 𝕏 Twitter All 38 jobs →

Software Engineering

Baseten • San Francisco

Posted 1 year agoSoftware Engineering 📍 San Francisco

Apply Now →

Skills & Technologies

python pytorch tensorflow

Overview

Baseten is hiring a Software Engineer focused on Model Performance to enhance AI model inference. You'll work with technologies like Python, PyTorch, and TensorFlow in San Francisco.

Job Description

Who you are

You have a strong background in software engineering with a focus on machine learning performance — your experience includes optimizing ML models and working with open-source frameworks. You thrive in collaborative environments and are eager to contribute to innovative AI solutions. Your technical expertise allows you to dive deep into codebases and implement cutting-edge techniques for model inference.

You are familiar with various ML optimization techniques such as quantization and speculative decoding — you understand the importance of performance in deploying AI models at scale. Your passion for artificial intelligence drives you to stay updated with the latest advancements in the field, and you are excited about the potential of large language models (LLMs).

What you'll do

As a Software Engineer at Baseten, you will be responsible for implementing and refining techniques that enhance ML model performance — this includes working on projects like the Baseten Embeddings Inference and the Baseten Inference Stack. You will collaborate with a dynamic team to drive model performance optimization and ensure that the infrastructure supports high-speed inference.

You will engage with the underlying codebases of frameworks such as TensorRT and PyTorch, contributing to the development of efficient solutions for AI applications. Your role will involve productionizing advanced techniques that improve the speed and efficiency of model inference, making significant contributions to the company's mission of enabling AI companies to bring their models into production seamlessly.

What we offer

At Baseten, we provide a supportive and inclusive work environment where innovation thrives. You will have the opportunity to work on mission-critical projects that impact leading AI companies. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Join us in shaping the future of AI technology.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Baseten.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Engineering Manager

Baseten•📍 San Francisco

Baseten is hiring an Engineering Manager focused on Model Performance to lead a team optimizing ML model inference. You'll work with cutting-edge AI technologies in San Francisco.

Lead

1 year ago

Software Engineering

Baseten•📍 San Francisco

Baseten is hiring a Software Engineer focused on Model APIs to design and optimize infrastructure for AI models. You'll work with technologies like CUDA and TensorRT in San Francisco.

4 months ago

Software Engineering

OpenAI•📍 San Francisco - On-Site

OpenAI is hiring a Software Engineer for their Model Inference team to optimize AI models for high-volume production environments. You'll work with Azure and Python to enhance model performance and efficiency. This position requires 5+ years of experience in software engineering.

🏛️ On-SiteMid-Level

1 year ago

Software Engineering

Baseten•📍 San Francisco - On-Site

Baseten is hiring a Senior Software Engineer - Model Training to build infrastructure for large-scale training of foundation models. You'll work with technologies like Python and TensorFlow to optimize GPU utilization and create scalable pipelines. This position requires significant experience in software engineering and machine learning.

🏛️ On-SiteSenior

5 months ago

Software Engineering

Baseten•📍 Vancouver

Baseten is seeking an Entry-Level Software Engineer to build automated performance benchmarking tools for AI infrastructure. You'll work with technologies like Python and focus on high-performance computing and machine learning. This role is ideal for early-career engineers looking to dive deep into AI systems.

Entry-Level

1 month ago

Browse all jobs →