
About Braze
The customer engagement platform that drives connections
Key Highlights
- Headquartered in Rose Hill, New York, NY
- Over 1,000 employees and growing
- $182.7 million raised in Series D funding
- Clients include Heineken and Domino's
Braze is a leading customer engagement platform headquartered in Rose Hill, New York, NY. With over 1,000 employees, Braze has raised $182.7 million in funding through Series D rounds, serving clients like Heineken and Domino's. The platform enables brands to create personalized messaging experience...
🎁 Benefits
Braze offers a flexible vacation policy, equal paid family services including fertility benefits and parental leave, and retirement planning with matc...
🌟 Culture
Braze fosters a culture focused on building strong customer relationships through technology. The company emphasizes innovation in mobile engagement, ...
Skills & Technologies
Overview
Braze is hiring a Senior Site Reliability Engineer to ensure the uptime of internal-facing services and platforms. You'll work with Linux, distributed systems, and automation to maintain high service availability. This position requires a strong background in system administration and software engineering.
Job Description
Who you are
You have 5+ years of experience in site reliability engineering or a related field, demonstrating a strong understanding of system administration and software engineering principles. Your expertise in Linux and networking allows you to effectively manage and troubleshoot complex systems, ensuring high availability and performance. You are passionate about automation and have experience implementing CI/CD pipelines to streamline operations and improve efficiency. Your background in distributed systems equips you with the skills to handle the challenges of scaling applications and maintaining uptime in a fast-paced environment.
You thrive in collaborative settings and enjoy working with cross-functional teams to solve challenging problems. Your strong communication skills enable you to articulate technical concepts to non-technical stakeholders, fostering a culture of transparency and teamwork. You are driven by a desire to learn and grow, always seeking new perspectives and approaches to enhance your work and the team's success. You understand the importance of accountability and take ownership of your responsibilities, ensuring that you contribute positively to the team's goals.
Desirable
Experience with cloud platforms such as AWS or GCP is a plus, as is familiarity with container orchestration tools like Kubernetes. A background in security practices and incident management will further enhance your ability to contribute to the team's objectives.
What you'll do
As a Senior Site Reliability Engineer at Braze, you will be responsible for maintaining the uptime of our internal-facing services and platforms. You will apply sound engineering principles and operational discipline to ensure that our systems run smoothly and efficiently. Your role will involve monitoring system performance, troubleshooting issues, and implementing solutions to prevent downtime. You will collaborate closely with software engineers to design and build scalable systems that can handle increasing loads while maintaining high availability.
You will also be involved in automating operational tasks and improving our CI/CD processes, allowing for faster and more reliable deployments. Your expertise in distributed systems will be crucial as you work to optimize our infrastructure and ensure that it can scale effectively. You will participate in incident response and post-mortem analysis, helping to identify root causes and implement preventive measures to enhance system reliability.
What we offer
At Braze, we foster a culture of collaboration and support, where your contributions are valued and recognized. We offer competitive compensation and benefits, along with opportunities for professional development and growth. You will be part of a passionate team that is dedicated to making a real impact in the industry. We encourage you to apply even if your experience doesn't match every requirement, as we believe in the potential of diverse backgrounds and perspectives.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Braze.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
Stellar Development Foundation is hiring a Senior Site Reliability Engineer to enhance the reliability and scalability of their systems. You'll work with AWS, GCP, and Kubernetes to support the Stellar blockchain ecosystem. This role requires strong experience in infrastructure management and automation.

Site Reliability Engineer
Braze is seeking a Senior Site Reliability Engineer for their Currents team to build and maintain a high-scale data export system. You'll work with technologies like Kafka and AWS to handle billions of messages daily. This role requires strong experience in site reliability engineering and cloud infrastructure.

Site Reliability Engineer
Together AI is hiring a Site Reliability Engineer to ensure the reliability and performance of user-facing services and production systems. You'll work with Ansible, Terraform, and Kubernetes to build and manage infrastructure. This role requires 2+ years of experience in SRE or a related field.

Site Reliability Engineer
Mercor is seeking a Site Reliability Engineer to own production reliability across critical systems. You'll work with AWS, Kubernetes, and Terraform to build and improve high-availability systems in San Francisco.

Site Reliability Engineer
WorkOS is hiring a Site Reliability Engineer to ensure the platform remains fast, reliable, and resilient at scale. You'll work with AWS, Docker, and Kubernetes to build systems that handle hundreds of millions of requests. This role requires a strong understanding of complex systems and incident response.