About Together AI

Empowering corporate mentorship for effective learning

👥 21-100 employees📍 CityPlace, Toronto, ON💰 $1.7m

B2BHRLearningSaaSCommunity

Key Highlights

Founded in 2018, headquartered in Toronto, ON
Raised $1.7 million in seed funding
Partnerships with Heineken, Reddit, and 7-Eleven
4 weeks paid vacation and competitive equity packages

Together is a corporate mentorship management platform founded in 2018, headquartered in CityPlace, Toronto, ON. The platform streamlines the mentorship lifecycle, facilitating connections among employees at companies like Heineken, Reddit, and 7-Eleven. With $1.7 million in seed funding, Together a...

🎁 Benefits

Together offers competitive salaries and equity packages, 4 weeks of paid vacation, and a comprehensive health, dental, and vision plan through Honeyb...

🌟 Culture

Together fosters a culture of autonomy and impact, allowing employees to take on significant responsibilities without bureaucratic constraints. The fo...

🌐 Website All 36 jobs →

Ai Research Engineer • Mid-Level

Together AI • San Francisco - On-Site

Posted 2w ago🏛️ On-Site Mid-Level Ai Research Engineer 📍 San Francisco💰 $160,000 - $230,000 / yearly

Apply Now →

Skills & Technologies

Cuda Gpu programming Parallel computing

Overview

Together AI is hiring a Systems Research Engineer specialized in GPU Programming to develop and optimize GPU-accelerated kernels for ML/AI applications. You'll collaborate with cross-functional teams and leverage your expertise in GPU programming and parallel computing. This role requires a strong background in GPU programming techniques.

Job Description

Who you are

You have a strong background in GPU programming and parallel computing, with expertise in technologies such as CUDA and/or Triton. Your knowledge of ML/AI applications and models allows you to contribute effectively to the development of GPU-accelerated solutions. You possess excellent problem-solving and analytical skills, enabling you to optimize and fine-tune GPU code for better performance and scalability. With a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experience, you are well-equipped to tackle complex challenges in this field.

Desirable

Staying up-to-date with the latest advancements in GPU programming techniques is important to you, and you are eager to apply this knowledge to enhance the performance and efficiency of AI systems. Your collaborative spirit allows you to work effectively with cross-functional teams, integrating GPU-accelerated solutions into existing software systems.

What you'll do

As a Systems Research Engineer at Together AI, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. You will work closely with the modeling and algorithm team to co-design GPU kernels and model architecture, ensuring that our AI infrastructure remains at the forefront of innovation. Your research skills will be vital in exploring new GPU programming techniques and contributing to the co-design of efficient GPU architectures and programming models.

You will optimize and fine-tune GPU code to achieve better performance and scalability, collaborating with hardware and software teams to integrate GPU-accelerated solutions into existing systems. Your contributions will help enhance the overall efficiency of our AI systems, making a significant impact on our research-driven initiatives.

What we offer

Together AI is committed to fostering an inclusive and innovative work environment. You will have the opportunity to work on cutting-edge technologies and contribute to the advancement of AI systems. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Join us in our mission to drive open and transparent AI systems that will shape the future of technology.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Together AI.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Technical Lead

OpenAI•📍 San Francisco

OpenAI is hiring a Technical Lead for the Sora team to optimize model serving efficiency and enhance inference performance. You'll work closely with research and product teams, leveraging your expertise in GPU and kernel-level systems.

Lead

10 months ago

Gpu Kernel Engineer

Baseten•📍 San Francisco - On-Site

Baseten is hiring a GPU Kernel Engineer to optimize performance for cutting-edge AI workloads. You'll work with C, C++, and CUDA in San Francisco. This position requires experience in low-level optimization and machine learning.

🏛️ On-SiteMid-Level

7 months ago

Software Engineering

OpenAI•📍 San Francisco - On-Site

OpenAI is hiring a Software Engineer for their GPU Infrastructure team to ensure the reliability and uptime of their compute fleet. You'll work with cutting-edge technologies in a high-performance computing environment. This position requires experience in system-level investigations and automation.

🏛️ On-SiteMid-Level

2w ago

Gpu Performance Engineer

Genmo•📍 San Francisco - On-Site

Genmo is seeking a GPU Performance Engineer to optimize their H100 infrastructure for video generation. You'll leverage advanced profiling tools and write high-performance CUDA kernels to achieve significant speedups. This role requires 5+ years of systems programming experience.

🏛️ On-SiteSenior

7 months ago

Software Engineering

Waymo•📍 Mountain View - Hybrid

Waymo is hiring a Software Engineer specializing in GPU development to enhance their autonomous driving technology. You'll work with C++, Python, and OpenGL to develop high-performance GPU primitives. This position requires experience in GPU programming and system-level architecture.

🏢 HybridMid-Level

1w ago

Browse all jobs →