About Wikimedia

Empowering the world with free knowledge access

Key Highlights

Operates Wikipedia, serving over 1.5 billion unique devices monthly
Headquartered in San Francisco, California
Funded by millions of individual donations averaging $15
501(c)(3) tax-exempt nonprofit organization

The Wikimedia Foundation is the nonprofit organization behind Wikipedia, the world's largest online encyclopedia, and other free knowledge projects. Headquartered in San Francisco, California, Wikimedia operates with a mission to provide free access to knowledge for everyone. The foundation relies o...

🎁 Benefits

Wikimedia offers a flexible remote work policy, generous PTO, and a commitment to employee well-being. Employees also have access to professional deve...

🌟 Culture

Wikimedia fosters a culture of openness and collaboration, emphasizing the importance of free knowledge. The organization values community contributio...

🌐 Website All 40 jobs →

Site Reliability Engineer • Senior

Wikimedia • Remote - Remote

Posted 3w ago🏠 Remote Senior Site Reliability Engineer

Apply Now →

Skills & Technologies

kubernetes hadoop opensearch airflow kafka

Overview

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires strong experience in distributed systems and automation.

Job Description

Who you are

You have 5+ years of experience in site reliability engineering or a related field, with a strong background in operating large-scale distributed systems. You are proficient in Kubernetes and have hands-on experience with data processing frameworks like Hadoop and OpenSearch. Your expertise extends to monitoring and optimizing system performance, ensuring that services are reliable and scalable.

You possess strong problem-solving skills and can proactively identify sources of instability in complex systems. Your ability to analyze incidents and implement effective solutions is key to your success in this role. You are comfortable working in a remote environment and can effectively collaborate with a global team.

You have a passion for automation and streamlining operations, and you enjoy mentoring peers in your areas of technical strength. Your communication skills enable you to support users effectively, helping them overcome challenges and improve productivity.

Desirable

Experience with data orchestration tools like Airflow and messaging systems like Kafka is a plus. Familiarity with cloud platforms and infrastructure as code practices will enhance your contributions to the team.

What you'll do

As a Senior Site Reliability Engineer at Wikimedia, you will be responsible for operating and enhancing the systems that support our data-oriented teams. You will work closely with engineering teams to design and implement new systems and solutions that meet the growing demands of our users. Your role will involve simplifying operations by standardizing deployment processes and leveraging containerization technologies.

You will monitor systems and services, optimizing performance and resource utilization to ensure reliability. Investigating incidents and analyzing how complex systems fail will be part of your daily responsibilities, allowing you to proactively address potential issues before they impact users.

Collaboration is key in this role, as you will work with a global team that communicates asynchronously. You will have the opportunity to mentor peers, sharing your knowledge and helping them grow in their technical and operational capabilities. Additionally, you may travel domestically or internationally a few times a year for team gatherings and conferences.

What we offer

Wikimedia offers a supportive and inclusive work environment where you can thrive as a Senior Site Reliability Engineer. You will have the chance to work on impactful projects that contribute to the mission of making knowledge freely available to everyone. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.

Join us in our commitment to reliability and excellence in data platform engineering, and be part of a team that values collaboration, innovation, and continuous improvement.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Wikimedia.

Apply Now →Get Job Alerts

✨

Similar Jobs You Might Like

Based on your interests and this role

Site Reliability Engineer

Wikimedia•📍 Remote - Remote

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes and Hadoop to ensure system reliability and scalability. This role requires strong experience in distributed systems and automation.

🏠 RemoteSenior

3w ago

Site Reliability Engineer

Wikimedia•📍 Remote - Remote

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems supporting data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability.

🏠 RemoteSenior

3w ago

Circonus is hiring a Site Reliability Engineer to ensure the reliability of their SaaS and on-premise services. You'll work on automation, scalability, and performance improvements while collaborating with various departments. This role is fully remote.

🏠 Remote

4 years ago

Browse all jobs →