
About Cohere
AI solutions built for enterprise trust and security
Key Highlights
- Headquartered in Grange Park, Toronto, ON
- $1.5 billion in funding from top investors
- Clients include Royal Bank of Canada, Fujitsu, and Oracle
- Focus on AI solutions for regulated industries
Cohere, headquartered in Grange Park, Toronto, ON, specializes in enterprise-grade AI solutions tailored for regulated industries such as banking and telecom. With $1.5 billion in funding, Cohere has secured contracts with major clients including Royal Bank of Canada, Fujitsu, and Oracle, providing ...
🎁 Benefits
Cohere offers comprehensive benefits including 100% coverage for health, dental, and vision insurance premiums, a $2,000 annual education benefit, six...
🌟 Culture
Cohere's culture emphasizes security and trust in AI adoption, focusing on enterprise needs rather than consumer trends. The company prioritizes a sup...
Skills & Technologies
Overview
Cohere is hiring a Member of Technical Staff, Model Efficiency to enhance the performance of AI models. You'll work with technologies like Python and TensorFlow to optimize model execution. This position requires experience in machine learning and model optimization.
Job Description
Who you are
You have a strong background in machine learning and model optimization, with experience in improving model efficiency in production environments. You are proficient in Python and have hands-on experience with frameworks like TensorFlow and PyTorch, enabling you to dive deep into model execution and identify bottlenecks effectively.
You thrive in collaborative environments, working closely with researchers and engineers to experiment, measure, and implement improvements that enhance inference performance. Your analytical mindset allows you to develop innovative optimizations that drive lower latency and higher throughput across diverse workloads.
What you'll do
As a Member of Technical Staff, you will be responsible for building reliable machine learning systems that push the boundaries of LLM inference efficiency. You will collaborate with modeling and systems teams to experiment with various techniques and measure their impact on core performance metrics. Your role will involve diving deep into model execution, identifying bottlenecks, and developing optimizations that enhance the overall performance of AI models in production.
You will contribute to the team's mission of scaling intelligence to serve humanity by ensuring that the models we deploy are efficient and effective. Your work will directly impact the capabilities of our models and the value they provide to customers, making you an integral part of our mission to drive the widespread adoption of AI.
What we offer
Cohere provides a supportive work environment with a focus on mental health and personal enrichment. We offer a generous parental leave policy, flexible remote work options, and a budget for personal development in areas such as arts, culture, and fitness. You will enjoy six weeks of vacation, allowing you to recharge and maintain a healthy work-life balance. Join us in shaping the future of AI and making a meaningful impact in the industry.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Cohere.
Similar Jobs You Might Like
Based on your interests and this role

Ai Research Engineer
Cohere is hiring a Staff Research Engineer to enhance model efficiency for AI systems. You'll work on optimizing large language models and improving inference efficiency. This position requires expertise in machine learning and performance optimization.

Ai Engineer
Cohere is hiring an Audio Inference Engineer to optimize audio model serving efficiency. You'll work on advancing core audio metrics and collaborate with infrastructure teams. This role requires expertise in machine learning and audio processing.

Ai Research Engineer
Cohere is hiring a Member of Technical Staff, Agents Modeling to drive the development of agentic LLM systems. You'll work with machine learning techniques and data generation strategies to enhance model capabilities. This role requires experience in machine learning research and engineering.

Other Technical Roles
Anterior is hiring a Member of Technical Staff to help transform healthcare administration through an AI-powered platform. You'll work on simplifying administrative workflows and improving patient outcomes. This role requires a strong foundation in system design and architecture.

Ai Engineer
Cohere is seeking a Member of Technical Staff in Modeling to design and implement novel research ideas and ship state-of-the-art models to production. This role emphasizes collaboration between engineering and research, contributing to AI systems that enhance user experiences.