
About Together AI
Empowering corporate mentorship for effective learning
Key Highlights
- Founded in 2018, headquartered in Toronto, ON
- Raised $1.7 million in seed funding
- Partnerships with Heineken, Reddit, and 7-Eleven
- 4 weeks paid vacation and competitive equity packages
Together is a corporate mentorship management platform founded in 2018, headquartered in CityPlace, Toronto, ON. The platform streamlines the mentorship lifecycle, facilitating connections among employees at companies like Heineken, Reddit, and 7-Eleven. With $1.7 million in seed funding, Together a...
🎁 Benefits
Together offers competitive salaries and equity packages, 4 weeks of paid vacation, and a comprehensive health, dental, and vision plan through Honeyb...
🌟 Culture
Together fosters a culture of autonomy and impact, allowing employees to take on significant responsibilities without bureaucratic constraints. The fo...
Skills & Technologies
Overview
Together AI is hiring a Systems Research Engineer specialized in GPU Programming to develop and optimize GPU-accelerated kernels for ML/AI applications. You'll collaborate with cross-functional teams and leverage your expertise in GPU programming and parallel computing. This role requires a strong background in GPU programming techniques.
Job Description
Who you are
You have a strong background in GPU programming and parallel computing, with expertise in technologies such as CUDA and/or Triton. Your knowledge of ML/AI applications and models allows you to contribute effectively to the development of GPU-accelerated solutions. You possess excellent problem-solving and analytical skills, enabling you to optimize and fine-tune GPU code for better performance and scalability. With a Bachelor's, Master's, or Ph.D. degree in Computer Science, Electrical Engineering, or equivalent practical experience, you are well-equipped to tackle complex challenges in this field.
Desirable
Staying up-to-date with the latest advancements in GPU programming techniques is important to you, and you are eager to apply this knowledge to enhance the performance and efficiency of AI systems. Your collaborative spirit allows you to work effectively with cross-functional teams, integrating GPU-accelerated solutions into existing software systems.
What you'll do
As a Systems Research Engineer at Together AI, you will play a crucial role in developing and optimizing GPU-accelerated kernels and algorithms for ML/AI applications. You will work closely with the modeling and algorithm team to co-design GPU kernels and model architecture, ensuring that our AI infrastructure remains at the forefront of innovation. Your research skills will be vital in exploring new GPU programming techniques and contributing to the co-design of efficient GPU architectures and programming models.
You will optimize and fine-tune GPU code to achieve better performance and scalability, collaborating with hardware and software teams to integrate GPU-accelerated solutions into existing systems. Your contributions will help enhance the overall efficiency of our AI systems, making a significant impact on our research-driven initiatives.
What we offer
Together AI is committed to fostering an inclusive and innovative work environment. You will have the opportunity to work on cutting-edge technologies and contribute to the advancement of AI systems. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds. Join us in our mission to drive open and transparent AI systems that will shape the future of technology.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Together AI.
Similar Jobs You Might Like
Based on your interests and this role

Technical Lead
OpenAI is hiring a Technical Lead for the Sora team to optimize model serving efficiency and enhance inference performance. You'll work closely with research and product teams, leveraging your expertise in GPU and kernel-level systems.

Gpu Kernel Engineer
Baseten is hiring a GPU Kernel Engineer to optimize performance for cutting-edge AI workloads. You'll work with C, C++, and CUDA in San Francisco. This position requires experience in low-level optimization and machine learning.

Software Engineering
OpenAI is hiring a Software Engineer for their GPU Infrastructure team to ensure the reliability and uptime of their compute fleet. You'll work with cutting-edge technologies in a high-performance computing environment. This position requires experience in system-level investigations and automation.

Gpu Performance Engineer
Genmo is seeking a GPU Performance Engineer to optimize their H100 infrastructure for video generation. You'll leverage advanced profiling tools and write high-performance CUDA kernels to achieve significant speedups. This role requires 5+ years of systems programming experience.

Software Engineering
Waymo is hiring a Software Engineer specializing in GPU development to enhance their autonomous driving technology. You'll work with C++, Python, and OpenGL to develop high-performance GPU primitives. This position requires experience in GPU programming and system-level architecture.