
About Baseten
Simplifying machine learning for every organization
Key Highlights
- Headquartered in Union Square, San Francisco, CA
- $285 million raised in Series C funding
- Team growth of 3x over the last five years
- Unlimited PTO with a company-wide holiday break
Baseten is a machine learning application builder headquartered in Union Square, San Francisco, CA. With $285 million in funding from investors like Coatue Management and Founders Fund, Baseten simplifies AI integration for businesses, enabling data scientists to deploy ML models without needing spe...
🎁 Benefits
Baseten offers a remote-first work environment with a $1,000 stipend for home office setup, unlimited PTO with a company-wide break during the holiday...
🌟 Culture
Baseten's culture emphasizes simplifying complex AI technologies for businesses, fostering a collaborative environment where team members can connect ...
Skills & Technologies
Overview
Baseten is hiring a Senior Software Engineer - Model Training to build infrastructure for large-scale training of foundation models. You'll work with technologies like Python and TensorFlow to optimize GPU utilization and create scalable pipelines. This position requires significant experience in software engineering and machine learning.
Job Description
Who you are
You have 5+ years of experience in software engineering, particularly in building and maintaining complex systems. Your expertise in Python and machine learning frameworks like TensorFlow allows you to design and implement distributed training systems effectively. You understand the intricacies of GPU utilization and have experience optimizing performance in large-scale environments.
You thrive in collaborative settings, working closely with product and infrastructure teams to identify customer needs and translate them into technical solutions. Your strong problem-solving skills enable you to tackle challenges in scalable training infrastructure, ensuring reliability and efficiency in model training processes.
What you'll do
As a Senior Software Engineer – Model Training at Baseten, you will design and build the infrastructure that powers the training and fine-tuning of foundation models. You will implement scalable pipelines that facilitate efficient model adaptation for both Baseten and its customers. Your role will involve optimizing GPU utilization to enhance performance and reliability in model training.
You will take ownership of key components of the training stack, ensuring that the systems you build are robust and scalable. Collaborating with cross-functional teams, you will push the boundaries of what is possible in scalable training infrastructure, contributing to the advancement of AI technologies.
What we offer
At Baseten, you will be part of a forward-thinking team dedicated to innovation in AI. We provide a supportive environment that encourages professional growth and development. You will have the opportunity to work on cutting-edge projects that have a significant impact on the AI landscape. We value diversity and inclusion, fostering a workplace where everyone can thrive.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Baseten.
Similar Jobs You Might Like
Based on your interests and this role

Software Engineering
Baseten is hiring a Software Engineer focused on Model Performance to enhance AI model inference. You'll work with technologies like Python, PyTorch, and TensorFlow in San Francisco.

Software Engineering
Baseten is hiring a Senior Product Engineer for their Training Platform to develop features like multi-node training and serverless reinforcement learning. You'll work with Python and machine learning techniques in San Francisco.

Software Engineering
Databricks is hiring a Senior Software Engineer for their Model Serving team to design and build systems for deploying AI/ML models. You'll work with technologies like Python and focus on scalability and reliability in San Francisco.

Software Engineering
OpenAI is hiring a Software Engineer for their Model Inference team to optimize AI models for high-volume production environments. You'll work with Azure and Python to enhance model performance and efficiency. This position requires 5+ years of experience in software engineering.

Software Engineering
Baseten is hiring a Software Engineer focused on Model APIs to design and optimize infrastructure for AI models. You'll work with technologies like CUDA and TensorRT in San Francisco.