
About Databricks
Empowering data teams with unified analytics
Key Highlights
- Headquartered in San Francisco, CA
- Valuation of $43 billion with $3.5 billion raised
- Serves over 7,000 customers including Comcast and Shell
- Utilizes Apache Spark for big data processing
Databricks, headquartered in San Francisco, California, is a unified data analytics platform that simplifies data engineering and collaborative data science. Trusted by over 7,000 organizations, including Fortune 500 companies like Comcast and Shell, Databricks has raised $3.5 billion in funding, ac...
🎁 Benefits
Databricks offers competitive salaries, equity options, generous PTO policies, and a remote-friendly work environment. Employees also benefit from a l...
🌟 Culture
Databricks fosters a culture of innovation with a strong emphasis on data-driven decision-making. The company values collaboration across teams and en...
Skills & Technologies
Overview
Databricks is hiring a Senior Platform Monitoring Engineer to lead platform incident investigations and enhance customer experience. You'll work with AWS, Linux, Docker, and Kubernetes in a hybrid role based in São Paulo.
Job Description
Who you are
You have a strong background in platform reliability and incident response, with at least 5 years of experience in a technical role focused on monitoring and observability. You thrive in high-pressure situations, coordinating cross-functional teams to resolve incidents swiftly and effectively. Your expertise in AWS and Linux allows you to design and implement robust monitoring solutions that enhance platform stability and customer satisfaction. You are familiar with containerization technologies like Docker and orchestration tools such as Kubernetes, enabling you to manage complex environments efficiently. Your analytical mindset helps you identify systemic issues and drive improvements that benefit both the platform and its users. You possess excellent communication skills, allowing you to articulate technical concepts to non-technical stakeholders and foster collaboration across teams.
What you'll do
As a Senior Platform Monitoring Engineer at Databricks, you will lead the investigation of platform incidents, ensuring rapid detection, mitigation, and resolution to minimize customer impact. You will design observability solutions that provide deep insights into platform performance, enabling proactive management of potential issues. Your role will involve collaborating with engineering teams to implement best practices in monitoring and incident response, ensuring that the platform remains reliable and performant. You will also be responsible for conducting post-incident reviews, identifying root causes, and driving systemic improvements to prevent future occurrences. Your contributions will directly enhance the customer experience, making Databricks' platform a leader in data and AI infrastructure. You will work in a hybrid environment, attending the São Paulo office at least three times a week, and your shift will typically run from 1 PM to 10 PM São Paulo time.
What we offer
At Databricks, we offer a dynamic work environment where you can tackle some of the most challenging problems in data and AI. You will be part of a passionate team that values collaboration and innovation. We provide competitive compensation and benefits, along with opportunities for professional growth and development. Join us in our mission to empower data teams and make a significant impact in the industry.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Databricks.
Similar Jobs You Might Like
Based on your interests and this role

Platform Engineer
Databricks is hiring a Senior Platform Monitoring Engineer to lead platform incident investigations and enhance customer experience. You'll work with AWS, Docker, and Kubernetes in a hybrid role based in Heredia, Costa Rica.

Cloud Engineer
Cloudbeds is hiring a Cloud Operations Engineer to support their global infrastructure and ensure operational stability across their AWS-based environment. You'll work with tools like Datadog and CloudWatch to monitor systems and respond to incidents. This role requires experience in cloud operations and a strong understanding of monitoring platforms.

Noc Engineer
DAZN is hiring a NOC Engineer to manage and maintain a global network infrastructure. You'll work with technologies like Cisco and Linux to ensure seamless service delivery to millions of users. This position requires experience in network support and incident management.

Noc Engineer
Navisite is hiring a NOC Engineer to provide first-level support for customer applications and equipment. You'll work with monitoring tools like Zabbix and ServiceNow to ensure high-quality customer service. This position requires 3-5 years of experience in a NOC/SOC environment.

Noc Engineer
NICE is hiring a NOC Engineer to ensure maximum service availability and performance for all CXone products. You'll provide support for various internal teams and troubleshoot network and platform issues. This role requires experience in networking and troubleshooting.