Datadog

About Datadog

The cloud monitoring platform engineers love

Key Highlights

  • Public company (NYSE: DDOG) - strong equity upside
  • 26,000+ enterprise customers including Netflix & Samsung
  • NYC headquarters with offices in Paris, Dublin, Sydney
  • $1.5B raised from Sequoia, IVP, and Index Ventures

Datadog (NYSE: DDOG) is a leading cloud observability platform that provides monitoring and analytics for applications, infrastructure, and logs. Trusted by over 26,000 customers including major companies like Netflix, Samsung, and Airbnb, Datadog is headquartered in New York City. The company went ...

🎁 Benefits

Datadog offers competitive salaries, equity options, generous PTO policies, and a flexible remote work policy. Employees also benefit from a learning ...

🌟 Culture

Datadog fosters an engineering-first culture, with 70% of its workforce comprising engineers. The company emphasizes a strong focus on solving complex...

Datadog

Staff Engineer Lead

DatadogBoston - Hybrid

Posted 2d ago🏢 HybridLeadStaff Engineer📍 Boston📍 New York💰 $234,000 - $300,000 / yearly
Apply Now →

Overview

Datadog is hiring a Staff Software Engineer for their ML Observability team to develop tools for monitoring and improving AI systems. You'll work with technologies like Python, Java, and TensorFlow to enhance observability for LLMs. This role requires expertise in machine learning and software engineering.

Job Description

Who you are

You have a strong background in software engineering with a focus on machine learning — your experience includes building and deploying AI systems in production environments. You possess deep knowledge of large language models and generative AI, enabling you to tackle complex challenges in AI observability. Your proficiency in programming languages such as Python and Java allows you to develop robust solutions that enhance AI system performance. You are familiar with containerization and orchestration tools like Docker and Kubernetes, which are essential for deploying scalable applications. You thrive in collaborative environments, working cross-functionally with product, UX, and applied science teams to drive innovation and product-market fit. You are passionate about creating tools that make AI systems understandable and reliable, and you are eager to lead the development of new features that will impact customers positively.

What you'll do

In this role, you will drive the design and implementation of observability features for large language models — your work will involve ideating, prototyping, and scaling new product features that provide insights into generative AI systems. You will collaborate closely with other engineering teams to iterate quickly and ensure that the tools you develop meet customer needs effectively. Your responsibilities will include developing and extending tools for tracing, evaluating, and debugging AI models, ensuring that they perform optimally in production. You will also shape the product direction by applying your deep understanding of AI systems and software engineering to solve open-ended problems in the fast-moving AI landscape. Your contributions will directly impact how customers monitor, troubleshoot, and optimize their LLM-based applications, enabling them to ship AI with confidence.

What we offer

At Datadog, we value our office culture and the relationships built within our teams — we operate as a hybrid workplace to ensure that our employees can create a work-life harmony that best fits them. You will have the opportunity to work on cutting-edge technology that is shaping the future of AI observability. We offer competitive compensation and benefits, along with a supportive environment that encourages professional growth and development. Join us in building foundational tools that make AI systems observable, understandable, and reliable in the real world.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Datadog.

Similar Jobs You Might Like

Based on your interests and this role

Astronomer

Staff Engineer

Astronomer📍 New York - Hybrid

Astronomer is hiring a Staff Software Engineer to lead the design and development of their observability platform. You'll work with Apache Airflow and Python to solve complex technology problems. This position requires significant experience in software engineering.

🏢 HybridSenior
3w ago
Pinterest

Staff Engineer

Pinterest📍 San Francisco - Remote

Pinterest is hiring a Staff Software Engineer for their Observability team to design and build infrastructure for large-scale distributed systems. You'll work with technologies related to observability solutions and data pipelines. This position requires deep technical expertise in distributed systems.

🏠 RemoteSenior
1w ago
Databricks

Staff Engineer

Databricks📍 Mountain View - On-Site

Databricks is hiring a Staff Software Engineer for their Observability team to develop solutions that provide insights into product health and performance. You'll work with technologies like Java and Python, focusing on AWS and observability tools. This position requires significant experience in software engineering.

🏛️ On-SiteSenior
1d ago
Google

Machine Learning Engineer

Google📍 New York

Google is hiring a Staff ML Engineer to develop and productionize machine learning systems. You'll work with technologies such as Keras and TFX while leveraging your expertise in machine learning and experiment design. This position requires 8+ years of experience in software development and 5+ years in machine learning.

Staff
2w ago
Datadog

Product Designer

Datadog📍 New York - Hybrid

Datadog is hiring a Staff Product Designer for AI Observability to own the end-to-end design of tools for AI developers. You'll collaborate with product managers, engineers, and data scientists, utilizing Figma and Adobe XD to create intuitive user experiences.

🏢 HybridStaff
3w ago