
About Baseten
Simplifying machine learning for every organization
Key Highlights
- Headquartered in Union Square, San Francisco, CA
- $285 million raised in Series C funding
- Team growth of 3x over the last five years
- Unlimited PTO with a company-wide holiday break
Baseten is a machine learning application builder headquartered in Union Square, San Francisco, CA. With $285 million in funding from investors like Coatue Management and Founders Fund, Baseten simplifies AI integration for businesses, enabling data scientists to deploy ML models without needing spe...
🎁 Benefits
Baseten offers a remote-first work environment with a $1,000 stipend for home office setup, unlimited PTO with a company-wide break during the holiday...
🌟 Culture
Baseten's culture emphasizes simplifying complex AI technologies for businesses, fostering a collaborative environment where team members can connect ...
Skills & Technologies
Overview
Baseten is hiring an Applied AI Inference Engineer to architect and deploy high-scale production AI applications. You'll work with Python and AWS to deliver reliable services. This role requires experience in AI and software development.
Job Description
Who you are
You have a strong background in software engineering with hands-on experience in building and deploying AI applications — you thrive in environments where you can translate complex business goals into technical solutions. Your expertise in Python and familiarity with machine learning frameworks enable you to create reliable and observable services that meet customer needs. You enjoy collaborating with cross-functional teams, including product management and customer success, to ensure that AI solutions are effectively integrated into business processes.
You are comfortable working directly with customers, guiding them through the journey from initial exploration to production deployment — your entrepreneurial spirit drives you to take ownership of projects and deliver high-quality outcomes. You have a knack for understanding customer requirements and translating them into actionable engineering tasks, ensuring that the solutions you build align with their expectations.
Your experience includes working with cloud platforms, particularly AWS, which allows you to leverage scalable infrastructure for AI applications — you understand the importance of performance engineering and are adept at optimizing applications for latency and cost. You are also familiar with best practices in software development, including version control and continuous integration, which help you maintain high standards in your work.
You possess excellent communication skills, enabling you to articulate technical concepts to non-technical stakeholders — you believe that collaboration is key to successful project delivery and enjoy mentoring junior engineers as they grow in their roles. You are passionate about the potential of AI and are eager to contribute to innovative projects that push the boundaries of technology.
Desirable
Experience with additional programming languages or frameworks is a plus, as is familiarity with data engineering concepts — you are open to learning and adapting to new technologies as needed. A background in product management or customer-facing roles can enhance your ability to bridge the gap between technical and business teams.
What you'll do
As an Applied AI Inference Engineer at Baseten, you will be responsible for architecting and deploying high-scale production AI applications on our platform — you will work closely with customers to understand their needs and guide them through the implementation process. Your role will involve translating ambiguous business goals into clear technical specifications, ensuring that the solutions you develop are reliable and meet performance expectations.
You will collaborate with product teams to define success metrics and monitor the performance of deployed applications — your insights will help drive continuous improvement and ensure that our AI solutions deliver value to customers. You will also engage in hands-on coding, developing software that integrates seamlessly with our platform and meets the high standards expected by our clients.
In addition to technical responsibilities, you will play a key role in customer success, working directly with clients to address their concerns and provide solutions that enhance their experience with our products — your ability to communicate effectively will be crucial in building strong relationships with customers. You will also contribute to pre-sales activities, helping to demonstrate the capabilities of our platform to potential clients.
What we offer
At Baseten, we are committed to fostering a diverse and inclusive workplace — we provide equal employment opportunities to all employees and applicants. You will have the opportunity to work with a collaborative and forward-thinking team, contributing to projects that are at the forefront of AI technology. We offer competitive compensation and benefits, along with opportunities for professional growth and development in a rapidly evolving industry.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Baseten.
Similar Jobs You Might Like
Based on your interests and this role

Ai Engineer
Delve is hiring an Applied AI Engineer to bring the latest AI research into production. You'll collaborate with product engineers to design and deploy AI-driven features. This role requires expertise in Python and machine learning technologies.

Ai Engineer
SafetyKit is hiring an Applied AI Engineer to develop innovative B2B SaaS solutions using AI agents. You'll work with language models and collaborate with OpenAI's research teams. This role requires a strong understanding of AI applications and model evaluation.

Ai Engineer
Parafin is hiring an Applied AI Engineer to leverage industry-leading models for financial infrastructure and risk modeling. You'll work with technologies like Python and TensorFlow. This position is suitable for early career professionals.

Ai Engineer
Factory is hiring an AI Engineer to design and develop innovative AI systems that enhance productivity and innovation. You'll work with emerging AI research and integrate it into customer-centric solutions. This role requires 2+ years of experience in AI/ML engineering.

Ai Engineer
Collate is hiring an AI Engineer to build and productionize models for their AI document generation platform in life sciences. You'll work at the intersection of machine learning and software engineering. This role requires expertise in AI and Python.