
About Wikimedia
Empowering the world with free knowledge access
Key Highlights
- Operates Wikipedia, serving over 1.5 billion unique devices monthly
- Headquartered in San Francisco, California
- Funded by millions of individual donations averaging $15
- 501(c)(3) tax-exempt nonprofit organization
The Wikimedia Foundation is the nonprofit organization behind Wikipedia, the world's largest online encyclopedia, and other free knowledge projects. Headquartered in San Francisco, California, Wikimedia operates with a mission to provide free access to knowledge for everyone. The foundation relies o...
🎁 Benefits
Wikimedia offers a flexible remote work policy, generous PTO, and a commitment to employee well-being. Employees also have access to professional deve...
🌟 Culture
Wikimedia fosters a culture of openness and collaboration, emphasizing the importance of free knowledge. The organization values community contributio...
Skills & Technologies
Overview
Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires strong experience in distributed systems and automation.
Job Description
Who you are
You have 5+ years of experience in site reliability engineering or a related field, with a strong background in operating large-scale distributed systems. You are proficient in Kubernetes and have hands-on experience with data processing frameworks like Hadoop and OpenSearch. Your expertise extends to monitoring and optimizing system performance, ensuring that services are reliable and scalable.
You possess strong problem-solving skills and can proactively identify sources of instability in complex systems. Your ability to analyze incidents and implement effective solutions is key to your success in this role. You are comfortable working in a remote environment and can effectively collaborate with a global team.
You have a passion for automation and streamlining operations, and you enjoy mentoring peers in your areas of technical strength. Your communication skills enable you to support users effectively, helping them overcome challenges and improve productivity.
Desirable
Experience with data orchestration tools like Airflow and messaging systems like Kafka is a plus. Familiarity with cloud platforms and infrastructure as code practices will enhance your contributions to the team.
What you'll do
As a Senior Site Reliability Engineer at Wikimedia, you will be responsible for operating and enhancing the systems that support our data-oriented teams. You will work closely with engineering teams to design and implement new systems and solutions that meet the growing demands of our users. Your role will involve simplifying operations by standardizing deployment processes and leveraging containerization technologies.
You will monitor systems and services, optimizing performance and resource utilization to ensure reliability. Investigating incidents and analyzing how complex systems fail will be part of your daily responsibilities, allowing you to proactively address potential issues before they impact users.
Collaboration is key in this role, as you will work with a global team that communicates asynchronously. You will have the opportunity to mentor peers, sharing your knowledge and helping them grow in their technical and operational capabilities. Additionally, you may travel domestically or internationally a few times a year for team gatherings and conferences.
What we offer
Wikimedia offers a supportive and inclusive work environment where you can thrive as a Senior Site Reliability Engineer. You will have the chance to work on impactful projects that contribute to the mission of making knowledge freely available to everyone. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.
Join us in our commitment to reliability and excellence in data platform engineering, and be part of a team that values collaboration, innovation, and continuous improvement.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Wikimedia.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes and Hadoop to ensure system reliability and scalability. This role requires strong experience in distributed systems and automation.

Site Reliability Engineer
Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems supporting data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability.

Site Reliability Engineer
Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires significant experience in SRE practices.

Site Reliability Engineer
Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires strong experience in managing distributed systems.

Site Reliability Engineer
Circonus is hiring a Site Reliability Engineer to ensure the reliability of their SaaS and on-premise services. You'll work on automation, scalability, and performance improvements while collaborating with various departments. This role is fully remote.