Datadog

About Datadog

The cloud monitoring platform engineers love

Key Highlights

  • Public company (NYSE: DDOG) - strong equity upside
  • 26,000+ enterprise customers including Netflix & Samsung
  • NYC headquarters with offices in Paris, Dublin, Sydney
  • $1.5B raised from Sequoia, IVP, and Index Ventures

Datadog (NYSE: DDOG) is a leading cloud observability platform that provides monitoring and analytics for applications, infrastructure, and logs. Trusted by over 26,000 customers including major companies like Netflix, Samsung, and Airbnb, Datadog is headquartered in New York City. The company went ...

🎁 Benefits

Datadog offers competitive salaries, equity options, generous PTO policies, and a flexible remote work policy. Employees also benefit from a learning ...

🌟 Culture

Datadog fosters an engineering-first culture, with 70% of its workforce comprising engineers. The company emphasizes a strong focus on solving complex...

Overview

Datadog is hiring a Senior Software Engineer (MLOps) to build and scale evaluation systems for AI models. You'll work with technologies like Python, Docker, and AWS to ensure models are reliable and production-ready. This role requires strong experience in machine learning and data engineering.

Job Description

Who you are

You have 5+ years of experience in software engineering with a focus on machine learning operations — you've designed and built systems that support the evaluation and deployment of AI models at scale. Your expertise in Python and familiarity with machine learning frameworks like TensorFlow and PyTorch enable you to create robust solutions for model benchmarking and evaluation. You understand the intricacies of data engineering and have experience working with data pipelines and telemetry systems. Your knowledge of containerization and orchestration tools such as Docker and Kubernetes allows you to deploy applications efficiently in production environments. You are comfortable collaborating with cross-functional teams, including data scientists and engineers, to drive trust and safety observability in AI products. You value a hybrid work culture that promotes collaboration and creativity, and you are eager to contribute to a team that is at the forefront of AI technology.

Desirable

Experience with AWS services and tools for machine learning, such as SageMaker, is a plus. Familiarity with MLflow for managing the machine learning lifecycle will enhance your contributions. You have a proactive approach to problem-solving and are always looking for ways to improve processes and systems.

What you'll do

In this role, you will design and build systems that automate the evaluation of AI models, including large language models (LLMs) and agents. You will lead efforts to develop benchmark suites and evaluation pipelines that incorporate trust and safety metrics. Your work will involve building and maintaining integrations with labeling systems, such as Label Studio, to streamline the dataset labeling process. You will collaborate closely with data scientists to ensure that the models are reliable and safe for production use. Additionally, you will drive the development of performance diagnostics tools that provide insights into model behavior and effectiveness. Your contributions will directly impact the quality and safety of Datadog's AI product offerings, ensuring they meet the highest standards of reliability.

What we offer

Datadog fosters a collaborative and inclusive work environment where creativity thrives. We offer a hybrid workplace model that allows you to balance your professional and personal life effectively. You will have the opportunity to work on cutting-edge AI technologies and contribute to projects that have a significant impact on our products and customers. We provide competitive compensation and benefits, along with opportunities for professional growth and development. Join us at Datadog and be part of a team that is shaping the future of AI in the tech industry.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Datadog.

Similar Jobs You Might Like

Based on your interests and this role

Datadog

Mlops Engineer

Datadog📍 Paris - Hybrid

Datadog is seeking a Senior MLOps Engineer to lead the design and development of high-scale model serving systems. You'll work with Ray-based infrastructure and CI/CD pipelines to ensure reliable deployment of ML models. This role requires expertise in machine learning and Python.

🏢 HybridSenior
1 month ago
Datadog

Software Engineering

Datadog📍 Paris - Hybrid

Datadog is seeking a Senior Software Engineer for their AI Platform to design and build scalable tools and infrastructure for AI applications. You'll work with technologies like Python, MLOps, and AWS in a hybrid environment based in Paris or Sophia Antipolis.

🏢 HybridSenior
1 month ago
Datadog

Mlops Engineer

Datadog📍 Paris - Hybrid

Datadog is hiring a Senior MLOps Engineer to design and build robust backend systems for AI infrastructure. You'll work with technologies like Python, Docker, and Kubernetes to enhance ML workflows. This role requires significant experience in MLOps and distributed systems.

🏢 HybridSenior
1 month ago
Datadog

Engineering Manager

Datadog📍 New York - Hybrid

Datadog is hiring a Lead Engineering Manager for their AI Platform to oversee the Evaluation & Annotation team. You'll manage a team of engineers and define the technical roadmap while working with AI infrastructure. This role requires experience in AI and data science.

🏢 HybridLead
1w ago
Sentry

Software Engineering

Sentry📍 San Francisco - Hybrid

Sentry is hiring a Senior Software Engineer for their AI/ML team to build evaluation infrastructure for AI systems. You'll work with Python and machine learning technologies to ensure the accuracy and reliability of AI features. This role requires strong experience in software engineering and AI.

🏢 HybridSenior
3w ago