
About Cohere
AI solutions built for enterprise trust and security
Key Highlights
- Headquartered in Grange Park, Toronto, ON
- $1.5 billion in funding from top investors
- Clients include Royal Bank of Canada, Fujitsu, and Oracle
- Focus on AI solutions for regulated industries
Cohere, headquartered in Grange Park, Toronto, ON, specializes in enterprise-grade AI solutions tailored for regulated industries such as banking and telecom. With $1.5 billion in funding, Cohere has secured contracts with major clients including Royal Bank of Canada, Fujitsu, and Oracle, providing ...
🎁 Benefits
Cohere offers comprehensive benefits including 100% coverage for health, dental, and vision insurance premiums, a $2,000 annual education benefit, six...
🌟 Culture
Cohere's culture emphasizes security and trust in AI adoption, focusing on enterprise needs rather than consumer trends. The company prioritizes a sup...
Skills & Technologies
Overview
Cohere is hiring a Member of Technical Staff specializing in Data Engineering to develop data pipelines for advanced language models. You'll work with technologies like Airflow and Apache Spark, focusing on data ingestion and optimization. This role requires experience in data management and engineering.
Job Description
Who you are
You have a strong background in data engineering, with experience in managing end-to-end data pipelines — you've worked with diverse data sources and understand the importance of data quality and reliability. Your expertise in Python and SQL allows you to efficiently manipulate and analyze large datasets, ensuring they are structured for optimal model performance.
You are familiar with tools like Airflow and Apache Spark, which you have used to automate data workflows and process large volumes of data — your ability to bridge the gap between raw data and AI models is a key strength. You thrive in collaborative environments, working closely with researchers and engineers to enhance the capabilities of AI systems.
What you'll do
In this role, you will be responsible for developing and maintaining the data pipeline that supports Cohere's advanced language models — this includes tasks such as data ingestion, cleaning, filtering, and optimization. You will ensure that datasets are structured and formatted correctly, working with various data types including web data, code data, and multilingual corpora.
You will collaborate with cross-functional teams to identify data needs and implement solutions that enhance model performance — your contributions will directly impact the quality and diversity of the training data used in AI systems. You will also monitor data quality and reliability, making adjustments as necessary to improve the overall data pipeline.
What we offer
Cohere provides a supportive work environment that values mental health and well-being — we offer a comprehensive parental leave top-up and personal enrichment benefits. You will enjoy a flexible work schedule with the option to work remotely or from our offices in Toronto, New York, San Francisco, London, and Paris. We also provide generous vacation time, allowing you to recharge and maintain a healthy work-life balance.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Cohere.
Similar Jobs You Might Like
Based on your interests and this role

Engineering Manager
Stripe is hiring an Engineering Manager for their Data Engineering team to architect and maintain core data infrastructure. You'll work with technologies like AWS, Apache Spark, and Airflow. This position requires strong technical expertise in modern data systems.

Data Engineer
Spotify is seeking a Mid-Level Data Engineer to join their Finance Engineering organization. You'll build and operate data systems for financial forecasting and performance analysis, utilizing skills in Python, SQL, and machine learning.

Machine Learning Engineer
Cohere is hiring a Machine Learning Engineer specializing in pre-training data to develop data pipelines for advanced language models. You'll work with Python, TensorFlow, and PyTorch to enhance model performance. This role requires experience in machine learning and data engineering.

Data Engineer
Spotify is hiring a Mid-Level Data Engineer to build data-driven engineering initiatives within the Platform Central Data squad. You'll design reliable datasets and workflow automations, leveraging skills in SQL and Python.

Machine Learning Engineer
Cohere is hiring a Machine Learning Engineer specializing in synthetic data to develop and manage the synthetic data pipeline for advanced language models. You'll work with Python and generative AI technologies to enhance model quality. This role requires experience in data analysis and machine learning.