
About Groupon
Find amazing deals on experiences near you
Key Highlights
- Headquartered in Chicago, Illinois
- Over 300,000 deals available across various categories
- Approximately 3,000 employees
- Publicly traded since 2011 (NASDAQ: GRPN)
Groupon, headquartered in Chicago, Illinois, connects consumers with local businesses through its platform, offering over 300,000 deals across various categories including travel, dining, and entertainment. Founded in 2008, Groupon has served millions of customers and employs approximately 3,000 peo...
π Benefits
Groupon offers competitive salaries, stock options, flexible PTO, and a remote work policy to support work-life balance. Employees also benefit from w...
π Culture
Groupon fosters a culture of innovation and customer-centricity, encouraging employees to explore new ideas and solutions to enhance user experiences....
Skills & Technologies
Overview
Groupon is hiring a Principal Site Reliability Engineer to lead the evolution of their platform with a focus on AI-driven resilience. You'll design intelligent, self-healing systems to ensure reliable experiences for millions of customers. This role requires expertise in AI and infrastructure automation.
Job Description
Who you are
You have extensive experience in site reliability engineering, with a strong focus on building and maintaining high-availability systems. Your background includes designing self-healing systems that leverage AI and machine learning to predict and prevent incidents before they occur. You understand the importance of reliability in a marketplace environment and are passionate about creating seamless experiences for users.
You possess a deep understanding of infrastructure as code principles and have hands-on experience with automation tools. Your ability to collaborate with cross-functional teams allows you to effectively communicate technical concepts to non-technical stakeholders. You thrive in environments that encourage innovation and are eager to take risks to achieve results.
What you'll do
In this role, you will architect and maintain self-healing systems that meet availability targets of 99.9% or higher. You will utilize AI and machine learning techniques to automate infrastructure management, ensuring that systems are resilient and capable of handling millions of daily interactions. Your responsibilities will include monitoring system performance, conducting incident response, and implementing best practices for reliability engineering.
You will lead initiatives to modernize Groupon's global platform, driving the transition from reactive maintenance to proactive, predictive reliability. Collaborating closely with engineering teams, you will influence the design and implementation of new features that enhance system reliability and performance. Your expertise will be crucial in shaping the future of Groupon's technology stack and ensuring that local businesses can thrive in a competitive landscape.
What we offer
Groupon provides a hybrid work model that allows you to balance your professional and personal life. You will be part of a culture that values innovation and rewards risk-taking, giving you the autonomy to make a meaningful impact. We encourage you to apply even if your experience doesn't match every requirement, as we believe diverse teams build better products.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Groupon.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
Groupon is seeking a Principal Site Reliability Engineer to lead the evolution of their global platform with a focus on AI-driven resilience. You'll design intelligent, self-healing systems to ensure high availability and reliability. This role requires expertise in AI and infrastructure automation.

Site Reliability Engineer
PandaDoc is hiring a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes and contribute to service codebases using Python and Java. This role requires strong experience with AWS and Kubernetes.

Site Reliability Engineer
PandaDoc is seeking a Senior Site Reliability Engineer to ensure reliable service with minimal downtime. You'll manage incident processes, maintain observability tools, and contribute to service codebases using Python and Java. This role requires strong experience in AWS and Kubernetes.

Site Reliability Engineer
N26 is seeking a Senior Site Reliability Engineer to enhance the reliability and scalability of their AI Platform infrastructure. You'll work with cloud infrastructure, networking, and CI/CD practices. This role requires expertise in SRE principles and a passion for AI technologies.

Site Reliability Engineer
amo is hiring a Lead Site Reliability Engineer (SRE) to ensure their systems handle high traffic and maintain performance and reliability. You'll work with technologies like ScyllaDB and focus on automation and system design. This role requires strong leadership and experience in distributed systems.