About Braze

The customer engagement platform that drives connections

🏢 Tech👥 1001+ employees📅 Founded 2011📍 Rose Hill, New York, NY💰 $182.7m⭐ 4.1

B2BMarketingAnalyticsCommunicationSaaS

Key Highlights

Headquartered in Rose Hill, New York, NY
Over 1,000 employees and growing
$182.7 million raised in Series D funding
Clients include Heineken and Domino's

Braze is a leading customer engagement platform headquartered in Rose Hill, New York, NY. With over 1,000 employees, Braze has raised $182.7 million in funding through Series D rounds, serving clients like Heineken and Domino's. The platform enables brands to create personalized messaging experience...

🎁 Benefits

Braze offers a flexible vacation policy, equal paid family services including fertility benefits and parental leave, and retirement planning with matc...

🌟 Culture

Braze fosters a culture focused on building strong customer relationships through technology. The company emphasizes innovation in mobile engagement, ...

🌐 Website 💼 LinkedIn 𝕏 Twitter All 113 jobs →

Site Reliability Engineer • Senior

Braze • San Francisco - On-Site

Posted 1w ago🏛️ On-Site Senior Site Reliability Engineer 📍 San Francisco💰 $128,842 - $232,200 / yearly

Apply Now →

Skills & Technologies

linux distributed systems networking automation

Overview

Braze is hiring a Senior Site Reliability Engineer to ensure the uptime of internal-facing services and platforms. You'll work with Linux, distributed systems, and automation to maintain high service availability. This position requires a strong background in system administration and software engineering.

Job Description

Who you are

You have 5+ years of experience in site reliability engineering or a related field, demonstrating a strong understanding of system administration and software engineering principles. Your expertise in Linux and networking allows you to effectively manage and troubleshoot complex systems, ensuring high availability and performance. You are passionate about automation and have experience implementing CI/CD pipelines to streamline operations and improve efficiency. Your background in distributed systems equips you with the skills to handle the challenges of scaling applications and maintaining uptime in a fast-paced environment.

You thrive in collaborative settings and enjoy working with cross-functional teams to solve challenging problems. Your strong communication skills enable you to articulate technical concepts to non-technical stakeholders, fostering a culture of transparency and teamwork. You are driven by a desire to learn and grow, always seeking new perspectives and approaches to enhance your work and the team's success. You understand the importance of accountability and take ownership of your responsibilities, ensuring that you contribute positively to the team's goals.

Desirable

Experience with cloud platforms such as AWS or GCP is a plus, as is familiarity with container orchestration tools like Kubernetes. A background in security practices and incident management will further enhance your ability to contribute to the team's objectives.

What you'll do

As a Senior Site Reliability Engineer at Braze, you will be responsible for maintaining the uptime of our internal-facing services and platforms. You will apply sound engineering principles and operational discipline to ensure that our systems run smoothly and efficiently. Your role will involve monitoring system performance, troubleshooting issues, and implementing solutions to prevent downtime. You will collaborate closely with software engineers to design and build scalable systems that can handle increasing loads while maintaining high availability.

You will also be involved in automating operational tasks and improving our CI/CD processes, allowing for faster and more reliable deployments. Your expertise in distributed systems will be crucial as you work to optimize our infrastructure and ensure that it can scale effectively. You will participate in incident response and post-mortem analysis, helping to identify root causes and implement preventive measures to enhance system reliability.

What we offer

At Braze, we foster a culture of collaboration and support, where your contributions are valued and recognized. We offer competitive compensation and benefits, along with opportunities for professional development and growth. You will be part of a passionate team that is dedicated to making a real impact in the industry. We encourage you to apply even if your experience doesn't match every requirement, as we believe in the potential of diverse backgrounds and perspectives.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Braze.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Site Reliability Engineer

Stellar Development Foundation•📍 San Francisco - On-Site

Stellar Development Foundation is hiring a Senior Site Reliability Engineer to enhance the reliability and scalability of their systems. You'll work with AWS, GCP, and Kubernetes to support the Stellar blockchain ecosystem. This role requires strong experience in infrastructure management and automation.

🏛️ On-SiteSenior

3w ago

Site Reliability Engineer

Braze•📍 San Francisco - On-Site

Braze is seeking a Senior Site Reliability Engineer for their Currents team to build and maintain a high-scale data export system. You'll work with technologies like Kafka and AWS to handle billions of messages daily. This role requires strong experience in site reliability engineering and cloud infrastructure.

🏛️ On-SiteSenior

1w ago

Site Reliability Engineer

Together AI•📍 San Francisco

Together AI is hiring a Site Reliability Engineer to ensure the reliability and performance of user-facing services and production systems. You'll work with Ansible, Terraform, and Kubernetes to build and manage infrastructure. This role requires 2+ years of experience in SRE or a related field.

Mid-Level

2w ago

Site Reliability Engineer

Mercor•📍 San Francisco - On-Site

Mercor is seeking a Site Reliability Engineer to own production reliability across critical systems. You'll work with AWS, Kubernetes, and Terraform to build and improve high-availability systems in San Francisco.

🏛️ On-SiteMid-Level

1 month ago

Site Reliability Engineer

WorkOS•📍 San Francisco - Remote

WorkOS is hiring a Site Reliability Engineer to ensure the platform remains fast, reliable, and resilient at scale. You'll work with AWS, Docker, and Kubernetes to build systems that handle hundreds of millions of requests. This role requires a strong understanding of complex systems and incident response.

🏠 Remote

8 months ago

Browse all jobs →