Microsoft

About Microsoft

Empowering every person and organization on the planet

🏢 Tech👥 100K+📅 Founded 1975📍 Redmond, Washington, United States

Key Highlights

  • Market cap exceeds $2 trillion
  • 100,000+ employees worldwide
  • Leading cloud services through Azure
  • Major clients include Walmart and BMW

Microsoft Corporation, headquartered in Redmond, Washington, is a leading technology company known for its software products like Windows and Office, as well as cloud services through Azure. With over 100,000 employees, Microsoft serves millions of customers globally, including major enterprises lik...

🎁 Benefits

Microsoft offers competitive salaries, stock options, generous PTO policies, and comprehensive health benefits. Employees also enjoy a flexible remote...

🌟 Culture

Microsoft fosters a culture of innovation and inclusivity, emphasizing collaboration across teams and a commitment to diversity. The company values em...

Microsoft

Ai Operations Engineer Mid-Level

MicrosoftUnited States

Posted 2w agoMid-LevelAi Operations Engineer📍 United States💰 $100,600 - $199,000 / yearly
Apply Now →

Overview

Microsoft is hiring an AI Operations Engineer II to build and maintain operational infrastructure for their Security AI Platform. You'll work with CI/CD pipelines, Kubernetes deployments, and monitoring systems. This role requires expertise in Azure and DevOps practices.

Job Description

Who you are

You have a solid background in operational infrastructure, particularly in managing CI/CD pipelines and Kubernetes deployments. Your experience includes maintaining and extending CI/CD workflows using Azure DevOps and GitHub Actions, ensuring smooth build automation and deployment processes. You are familiar with observability tools like Prometheus and Grafana, enabling you to develop and maintain effective monitoring systems. You thrive in collaborative environments, working closely with engineering teams to enhance production reliability and incident response.

You possess strong troubleshooting skills, allowing you to debug and diagnose issues effectively. Your ability to analyze logs, traces, and metrics helps you identify problems and work with senior engineers on complex issues. You are also experienced in maintaining Infrastructure as Code, updating Bicep templates and Helm values to ensure branch health and security scanning.

What you'll do

In this role, you will be responsible for building and maintaining the operational infrastructure for the Security AI Platform at Microsoft. You will manage CI/CD pipelines, ensuring they are efficient and reliable, while also overseeing Kubernetes deployments, including Helm chart updates and pod troubleshooting. Your contributions will directly impact the reliability of AI-native security capabilities at scale.

You will participate in an on-call rotation, responding to alerts and triaging incidents as they arise. Your role will involve documenting findings and collaborating with senior engineers to improve production operations. You will also develop expertise in observability, creating Grafana dashboards and alerting rules to enhance system monitoring. Your work will be crucial in maintaining the health and performance of the platform, contributing to the overall success of the Security AI team.

What we offer

Microsoft provides a dynamic work environment where you can grow your skills and advance your career. You will have the opportunity to work with cutting-edge technologies and collaborate with talented professionals in the field. The company values diversity and encourages you to apply even if your experience doesn't match every requirement. Join us to make a significant impact in the world of AI and security.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Microsoft.

Similar Jobs You Might Like

Based on your interests and this role

Optiver

Ai Operations Engineer

Optiver📍 Sydney - On-Site

Optiver is hiring an AI Operations Engineer to support their growing AI program in Sydney. You'll be responsible for architecting, implementing, and monitoring deployments of their AI platform, utilizing skills in Docker, Kubernetes, and Python. This role requires experience in operations and cloud technologies.

🏛️ On-SiteMid-Level
2 months ago
TripActions

Ai Engineer

TripActions📍 Palo Alto

TripActions is hiring a Senior AI Operations Engineer to architect a platform for managing a fleet of specialized AI services. You'll work with AWS and SageMaker to optimize inference and ensure reliability. This position requires expertise in MLOps and machine learning.

Senior
1w ago
TripActions

Ai Engineer

TripActions📍 Tel Aviv

TripActions is hiring a Senior AI Operations Engineer to architect a platform for managing a fleet of specialized AI services. You'll work with Python, SageMaker, and Terraform to optimize AI operations. This role requires expertise in MLOps and orchestration of language models.

Senior
1w ago
Klaviyo

Ai Engineer

Klaviyo📍 Boston

Klaviyo is hiring an AI Engineer II to design and build scalable backend systems for AI products. You'll work with Python, machine learning technologies, and AWS to drive impactful solutions. This position requires experience in AI and backend development.

Mid-Level
1w ago
Snappr

Ai Operations Manager

Snappr📍 Metro Manila

Snappr is hiring an AI Operations Manager to lead a team of editors ensuring high-quality visual content. You'll manage AI-driven image creation processes to enhance customer experience. This role requires strong leadership and operational skills.

Mid-Level
11 months ago