Job Description
### **About the Role**
We are looking for a Senior Sensor Reliability Engineer to own the observability, alerting, and automation that ensures Uberβs in-vehicle sensor data collection systems operate reliably at scale.
This role is centered on **maximizing sensor uptime, data yield, and supply hours** across a large, geographically distributed fleet. You will design systems that determine _when to react_ to issues impacting data recording capability, whether caused by failing sensors, degraded onboard computers, software regressions, or systemic environmental factors.
As the technical owner for sensor reliability and observability, you will build the infrastructure that converts low-level signals into actionable intelligence and automated responses. This is a seniorrole requiring strong software engineering fundamentals, deep systems thinking, and the ability to drive cross-team technical direction without direct authority.
### **What Youβll Do**
**Sensor & Data Collection Observability Architecture**
- Design and implement the observability and monitoring infrastructure for in-vehicle sensor packages and data recording pipelines, including signal ingestion, storage, correlation, and consumption.
- Build systems that remain reliable despite hardware diversity, intermittent connectivity, and rapid fleet scaling.
**Signal Quality, Alerting & Reaction Design**
- Define alerting strategies and criticality models that distinguish transient anomalies from conditions that materially impact sensor uptime, data yield, or supply hours.
- Design detection logic for real-world failure modes (e.g., silent sensor degradation, partial data loss, compute saturation, or recording pipeline stalls).
**Automation & Fleet-Scale Efficiency**
- Design and implement automated detection, triage, and mitigation mechanisms to reduce manual intervention as fleet size increases.
- Partner with operations and engineering teams to safely automate responses to common and recurring failure scenarios.
**Telemetry & Data Contracts**
- Work with sensor, compute, and platform teams to define the signals and data contracts required to make the sensor stack observable by design.
- Ensure signal consistency and correctness across hardware variants, software versions, and deployment environments.
**Operational Interfaces & Feedback Loops**
- Build technical interfaces that allow Operations to surface issues quickly and Engineering to diagnose and deploy mitigations efficiently.
- Improve time-to-detection and time-to-mitigation for issues impacting data collection reliability.
**Infrastructure as Code & Reliability Ownership**
- Own the deployment, evolution, and reliability of fleet-wide monitoring and reporting systems using infrastructure-as-code best practices.
- Continuously assess and improve system robustness as new sensors, compute platforms, and data collection workflows are introduced.
**Cross-Team Technical Leadership**
- Lead design reviews and reliability discussions across organizational boundaries.
- Translate reliability gaps and operational pain points into concrete technical requirements and prioritized engineering work.
### **Basic Qualifications**
- Proficiency in one or more of Go, Python, or C++, with experience building and operating production systems.
- Proficiency in Linux and shell scripting for triaging and debugging edge devices
- Strong software engineering fundamentals with the ability to debug across services, containers, and hardware-adjacent systems.
- Proven experience owning reliability, infrastructure, or platform systems that support production workloads.
- Experience designing and operating observability systems, including metrics, logging, alerting, and dashboards.
- Track record of driving complex technical projects across multiple teams from design through production.
### **Preferred Qualifications**
- Experience leading large-scope reliability or infrastructure initiatives consistent with a Senior role.
- Deep experience with modern observability platforms (e.g., Prometheus, Grafana, ELK), especially in edge, IoT, or hardware-integrated environments.
- Experience designing alerting strategies and criticality models that balance signal quality, noise reduction, and operational impact.
- Strong automation mindset, including experience building automated detection, triage, or mitigation systems to reduce manual operational toil.
- Experience operating systems where uptime, yield, or availability are core business KPIs.
- Ability to design reliability systems that remain effective as hardware platforms, software stacks, and data collection workflows evolve.
For San Francisco, CA-based roles: The base salary range for this role is USD$180,000 per year - USD$200,000 per year.
For Seattle, WA-based roles: The base salary range for this role is USD$180,000 per year - USD$200,000 per year.
For Sunnyvale, CA-based roles: The base salary range for this role is USD$180,000 per year - USD$200,000 per year.
For all US locations, you will be eligible to participate in Uber's bonus program, and may be offered an equity award & other types of comp. All full-time employees are eligible to participate in a 401(k) plan. You will also be eligible for various benefits. More details can be found at the following link [https://jobs.uber.com/en/benefits](https://jobs.uber.com/en/benefits).
Uber's mission is to reimagine the way the world moves for the better. Here, bold ideas create real-world impact, challenges drive growth, and speed fuels progress. What moves us, moves the world - let's move it forward, together.
Uber is proud to be an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to sex, gender identity, sexual orientation, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. If you have a disability or special need that requires accommodation, please let us know by completing [this form](https://forms.gle/aDWTk9k6xtMU25Y5A).
Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Uber.