Mithrl

About Mithrl

Automating bioinformatics for faster scientific discovery

🏢 Tech👥 11-50📅 Founded 2023📍 San Francisco, California, United States

Key Highlights

  • Headquartered in San Francisco, California
  • Specializes in automating NGS data workflows
  • Utilizes AI and natural language processing
  • Team size of 11-50 employees

Mithrl is an AI-powered platform headquartered in San Francisco, California, that automates Next-Generation Sequencing (NGS) data workflows for scientific labs. With its innovative use of natural language processing, Mithrl generates custom bioinformatics pipelines in minutes, significantly reducing...

🎁 Benefits

Mithrl offers competitive salaries, equity options, flexible PTO, and a remote-friendly work policy to support work-life balance....

🌟 Culture

Mithrl fosters a culture of innovation and efficiency, emphasizing the importance of leveraging AI to streamline scientific research and improve outco...

Skills & Technologies

Overview

Mithrl is hiring a Data Engineer to build and own an AI-powered ingestion and normalization pipeline for scientific data. You'll work with Python, SQL, and Airflow to transform messy biological data into clean datasets. This role requires experience in data engineering and cloud technologies.

Job Description

Who you are

You have 3+ years of experience in data engineering, with a strong background in building data pipelines and working with various data formats. You are proficient in Python and SQL, and you understand the intricacies of data ingestion and normalization processes. Your experience with cloud platforms, particularly AWS, allows you to effectively manage data workflows in a scalable manner. You are familiar with tools like Airflow for orchestrating complex data workflows and have experience with containerization technologies such as Docker. You are detail-oriented and have a knack for cleaning and structuring messy datasets, ensuring data quality and consistency.

Desirable

Experience with machine learning frameworks or AI tools is a plus, as is familiarity with data visualization tools. You are comfortable working in a collaborative environment and enjoy engaging with cross-functional teams to understand their data needs. You have a proactive approach to problem-solving and are eager to learn and adapt to new technologies.

What you'll do

In this role, you will be responsible for building and maintaining an AI-powered data ingestion and normalization pipeline. You will work with various data sources, including unprocessed Excel/CSV uploads and lab exports, to ensure that data is accurately ingested and transformed. Your tasks will include developing robust schema mapping and conversion logic to standardize data formats and ensure consistency across datasets. You will leverage your expertise in Python and SQL to clean and prepare data for downstream analytics, ensuring that the AI Co-Scientist has access to high-quality data for analysis.

You will collaborate closely with data scientists and other stakeholders to understand their data requirements and provide them with the necessary datasets for their analyses. You will also be involved in optimizing data workflows and ensuring that all transformations are executed efficiently during the ingestion process. Your work will directly contribute to the success of Mithrl's mission to accelerate drug discovery and improve patient outcomes.

What we offer

Mithrl offers a dynamic work environment where innovation and collaboration are at the forefront. You will have the opportunity to work with cutting-edge AI technologies and contribute to meaningful projects that have a real impact on the life sciences industry. We provide competitive compensation and benefits, along with opportunities for professional growth and development. Join us in our mission to transform the way scientific data is utilized in drug discovery.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Mithrl.

Similar Jobs You Might Like

Based on your interests and this role

Krea

Data Engineer

Krea📍 San Francisco

Krea is hiring a Data Engineer to build distributed systems for processing large amounts of data. You'll work with technologies like Kubernetes, PyTorch, and Pandas. This position requires experience in data engineering and familiarity with machine learning pipelines.

Mid-Level
6 months ago
Mercor

Data Engineer

Mercor📍 San Francisco - On-Site

Mercor is hiring a Data Engineer to build and maintain data pipelines that support their Data Science and Engineering teams. You'll work with technologies like MongoDB, Airtable, and PostgreSQL in San Francisco, focusing on data reliability and collaboration.

🏛️ On-SiteMid-Level
4 months ago
Column

Analytics Engineer

Column📍 San Francisco - On-Site

Column is hiring a Data Analytics Engineer to build the foundation for data operations and analysis. You'll work with SQL, Tableau, and Python to create dashboards and enable data-driven decision-making. This role is ideal for entry-level candidates looking to grow in the fintech space.

🏛️ On-SiteEntry-Level
4 months ago
Plaid

Data Engineer

Plaid📍 San Francisco - On-Site

Plaid is hiring a Data Engineer to build robust data sets that drive insights and support business goals. You'll leverage SQL and Python, using tools like dbt and Airflow to orchestrate data workflows. This role requires experience in data engineering and a collaborative mindset.

🏛️ On-SiteMid-Level
9 months ago
Brex

Data Engineer

Brex📍 San Francisco

Brex is hiring a Data Engineer to transform raw data into actionable insights across the organization. You'll collaborate with Data Scientists and Software Engineers to create efficient data models and analytics frameworks. This role requires experience in data engineering and a strong understanding of data infrastructure.

Mid-Level
1 month ago