Cohere

About Cohere

AI solutions built for enterprise trust and security

🏢 Tech👥 501-1000 employees📅 Founded 2019📍 Grange Park, Toronto, ON💰 $1.5b4
B2BArtificial IntelligenceMachine LearningSaaS

Key Highlights

  • Headquartered in Grange Park, Toronto, ON
  • $1.5 billion in funding from top investors
  • Clients include Royal Bank of Canada, Fujitsu, and Oracle
  • Focus on AI solutions for regulated industries

Cohere, headquartered in Grange Park, Toronto, ON, specializes in enterprise-grade AI solutions tailored for regulated industries such as banking and telecom. With $1.5 billion in funding, Cohere has secured contracts with major clients including Royal Bank of Canada, Fujitsu, and Oracle, providing ...

🎁 Benefits

Cohere offers comprehensive benefits including 100% coverage for health, dental, and vision insurance premiums, a $2,000 annual education benefit, six...

🌟 Culture

Cohere's culture emphasizes security and trust in AI adoption, focusing on enterprise needs rather than consumer trends. The company prioritizes a sup...

Cohere

Data Engineer Mid-Level

CohereToronto - Hybrid

Apply Now →

Skills & Technologies

Overview

Cohere is hiring a Member of Technical Staff specializing in Data Engineering to develop data pipelines for advanced language models. You'll work with technologies like Airflow and Apache Spark, focusing on data ingestion and optimization. This role requires experience in data management and engineering.

Job Description

Who you are

You have a strong background in data engineering, with experience in managing end-to-end data pipelines — you've worked with diverse data sources and understand the importance of data quality and reliability. Your expertise in Python and SQL allows you to efficiently manipulate and analyze large datasets, ensuring they are structured for optimal model performance.

You are familiar with tools like Airflow and Apache Spark, which you have used to automate data workflows and process large volumes of data — your ability to bridge the gap between raw data and AI models is a key strength. You thrive in collaborative environments, working closely with researchers and engineers to enhance the capabilities of AI systems.

What you'll do

In this role, you will be responsible for developing and maintaining the data pipeline that supports Cohere's advanced language models — this includes tasks such as data ingestion, cleaning, filtering, and optimization. You will ensure that datasets are structured and formatted correctly, working with various data types including web data, code data, and multilingual corpora.

You will collaborate with cross-functional teams to identify data needs and implement solutions that enhance model performance — your contributions will directly impact the quality and diversity of the training data used in AI systems. You will also monitor data quality and reliability, making adjustments as necessary to improve the overall data pipeline.

What we offer

Cohere provides a supportive work environment that values mental health and well-being — we offer a comprehensive parental leave top-up and personal enrichment benefits. You will enjoy a flexible work schedule with the option to work remotely or from our offices in Toronto, New York, San Francisco, London, and Paris. We also provide generous vacation time, allowing you to recharge and maintain a healthy work-life balance.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Cohere.

Similar Jobs You Might Like

Based on your interests and this role

Stripe

Engineering Manager

Stripe📍 Toronto

Stripe is hiring an Engineering Manager for their Data Engineering team to architect and maintain core data infrastructure. You'll work with technologies like AWS, Apache Spark, and Airflow. This position requires strong technical expertise in modern data systems.

Lead
3d ago
Spotify

Data Engineer

Spotify📍 Toronto

Spotify is seeking a Mid-Level Data Engineer to join their Finance Engineering organization. You'll build and operate data systems for financial forecasting and performance analysis, utilizing skills in Python, SQL, and machine learning.

Mid-Level
1 month ago
Cohere

Machine Learning Engineer

Cohere📍 Toronto - Hybrid

Cohere is hiring a Machine Learning Engineer specializing in pre-training data to develop data pipelines for advanced language models. You'll work with Python, TensorFlow, and PyTorch to enhance model performance. This role requires experience in machine learning and data engineering.

🏢 HybridMid-Level
8 months ago
Spotify

Data Engineer

Spotify📍 Toronto

Spotify is hiring a Mid-Level Data Engineer to build data-driven engineering initiatives within the Platform Central Data squad. You'll design reliable datasets and workflow automations, leveraging skills in SQL and Python.

Mid-Level
3 months ago
Cohere

Machine Learning Engineer

Cohere📍 Toronto - Hybrid

Cohere is hiring a Machine Learning Engineer specializing in synthetic data to develop and manage the synthetic data pipeline for advanced language models. You'll work with Python and generative AI technologies to enhance model quality. This role requires experience in data analysis and machine learning.

🏢 HybridMid-Level
2 months ago