
About Affirm
Transparent financing for modern consumers
Key Highlights
- 21M+ consumers and 337,000+ merchants using Affirm
- Raised $1.1B in funding, currently in Series F
- Flexible payback options from 3 to 36 months
- Headquartered in Chinatown, San Francisco, CA
Affirm, headquartered in Chinatown, San Francisco, CA, is a leading fintech company specializing in point-of-sale installment loans. With over 21 million consumers and 337,000+ merchants including Shopify, KAYAK, and Walmart, Affirm offers flexible payback options ranging from 3 to 36 months. The co...
🎁 Benefits
Affirm offers a remote-first workforce policy, allowing employees to work from anywhere in their home country. Benefits include 18 weeks of paid paren...
🌟 Culture
Affirm's culture is centered around transparency and consumer empowerment, with a focus on delivering honest financial products. The company actively ...
Skills & Technologies
Overview
Affirm is seeking a Staff Site Reliability Engineer to enhance platform reliability and incident management. You'll work with AWS, Docker, and Kubernetes to ensure application performance and resilience. This role requires extensive experience in SRE practices.
Job Description
Who you are
You have a strong background in Site Reliability Engineering with at least 5 years of experience in building and maintaining reliable systems. Your expertise in AWS and Kubernetes allows you to design and implement robust infrastructure solutions that support high availability and scalability. You are proficient in Docker and understand container orchestration, which enables you to streamline deployment processes and improve system performance. Your experience with Linux systems equips you with the skills to troubleshoot and optimize server environments effectively. You are familiar with monitoring tools like Prometheus, which helps you maintain visibility into application performance and reliability. You have a solid understanding of Terraform for infrastructure as code, allowing you to automate and manage cloud resources efficiently.
Desirable
Experience with incident management frameworks and best practices is a plus, as is familiarity with chaos engineering principles. You are comfortable engaging in architectural discussions and can recommend observability and alerting configurations that enhance system reliability. Your ability to collaborate with engineering teams to define service level objectives (SLOs) is essential for driving performance improvements across the organization.
What you'll do
In this role, you will lead efforts to enhance the reliability and performance of Affirm's applications. You will guide the development of SLOs and drive the incident management process, ensuring that teams are equipped to handle incidents effectively. You will engage in service and architectural conversations, providing insights that help shape the direction of our systems. Your responsibilities will include recommending observability and alerting configurations that improve our ability to respond to incidents and maintain system health. You will also build tooling and provide training to engineering teams, fostering a culture of reliability and resilience throughout the organization. Your work will directly impact the customer experience by ensuring that our services operate smoothly and efficiently.
What we offer
At Affirm, we are committed to creating a supportive and inclusive work environment. We offer competitive compensation and benefits, including flexible work arrangements that allow you to balance your professional and personal life. You will have the opportunity to work with a talented team of engineers who are passionate about building reliable systems and improving the customer experience. We encourage you to apply even if your experience doesn't match every requirement — your unique perspective and skills could be a great fit for our team. Join us in our mission to reinvent credit and make it more honest and friendly for consumers.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Affirm.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
Affirm is seeking a Staff Site Reliability Engineer to enhance platform reliability and incident management. You'll work with AWS, Docker, and Kubernetes to ensure application performance and resilience. This role requires extensive experience in SRE practices.

Site Reliability Engineer
Affirm is seeking a Senior Site Reliability Engineer to enhance the reliability of their cloud infrastructure. You'll work with Kubernetes and automation tools to support Affirm's engineering teams. This role requires strong cloud engineering skills and experience in operational excellence.

Site Reliability Engineer
PandaDoc is seeking a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes, maintain observability tools, and contribute to service codebases using Python and Java. This role requires strong experience in AWS and Kubernetes.

Site Reliability Engineer
Jamf is hiring a Senior Site Reliability Engineer to balance development velocity with system stability using SRE best practices. You'll work with AWS, Docker, and Kubernetes in a remote role based in Poland.

Site Reliability Engineer
Assured is hiring a Staff Site Reliability Engineer to build efficient, reliable, secure, and scalable infrastructure. You'll work with AWS, Docker, and Kubernetes to automate the delivery of modern SaaS platforms. This position requires experience in a start-up environment.