Wikimedia

About Wikimedia

Empowering the world with free knowledge access

Key Highlights

  • Operates Wikipedia, serving over 1.5 billion unique devices monthly
  • Headquartered in San Francisco, California
  • Funded by millions of individual donations averaging $15
  • 501(c)(3) tax-exempt nonprofit organization

The Wikimedia Foundation is the nonprofit organization behind Wikipedia, the world's largest online encyclopedia, and other free knowledge projects. Headquartered in San Francisco, California, Wikimedia operates with a mission to provide free access to knowledge for everyone. The foundation relies o...

🎁 Benefits

Wikimedia offers a flexible remote work policy, generous PTO, and a commitment to employee well-being. Employees also have access to professional deve...

🌟 Culture

Wikimedia fosters a culture of openness and collaboration, emphasizing the importance of free knowledge. The organization values community contributio...

Wikimedia

Site Reliability Engineer Senior

WikimediaRemote - Remote

Apply Now →

Overview

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems supporting data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability.

Job Description

Who you are

You have 5+ years of experience in site reliability engineering, with a strong background in operating and optimizing complex distributed systems. Your expertise includes Kubernetes, Hadoop, and other data technologies, allowing you to effectively manage and scale systems to meet user demands.

You are skilled in monitoring system performance and resource utilization, proactively identifying and resolving sources of instability. Your experience with automation and streamlining operations helps improve efficiency and productivity for your team and users alike.

You thrive in collaborative environments, having worked with global teams and supporting users in their technical endeavors. Your ability to mentor peers and share knowledge strengthens the team's overall capabilities.

You are adaptable and open to learning, especially in remote work settings, and you are willing to travel occasionally for team gatherings and conferences. Your communication skills enable you to effectively interact with various stakeholders, ensuring alignment and understanding across teams.

Desirable

Experience with cloud platforms and infrastructure as code practices is a plus. Familiarity with incident management and response processes will help you excel in this role.

What you'll do

As a Senior Site Reliability Engineer at Wikimedia, you will be responsible for operating and enhancing the systems that support our data-oriented teams. You will work closely with engineering teams to design and implement new systems and solutions, ensuring that our infrastructure scales effectively to meet demand.

Your role will involve simplifying operations by standardizing deployment processes and leveraging virtualization and containerization techniques. You will monitor systems and services, optimizing performance and resource utilization to maintain high reliability.

You will proactively identify sources of instability in our distributed systems, analyzing failures and implementing solutions to enhance reliability and resilience. Your focus on automation will help streamline tasks and identify process gaps, contributing to a more efficient workflow.

Collaboration is key in this role, as you will work with a global team that communicates asynchronously. You will support your users by removing roadblocks and making them more productive, ensuring they can focus on their core tasks.

Mentoring peers in your areas of technical and operational strength will be an important aspect of your role, helping to elevate the team's overall expertise and performance.

What we offer

Wikimedia offers a supportive and inclusive work environment where you can grow your skills and make a meaningful impact. You will have the opportunity to work with cutting-edge technologies and contribute to projects that benefit millions of users worldwide. We encourage you to apply even if your experience doesn't match every requirement, as we value diverse perspectives and backgrounds.

Interested in this role?

Apply now or save it for later. Get alerts for similar jobs at Wikimedia.

Similar Jobs You Might Like

Based on your interests and this role

Wikimedia

Site Reliability Engineer

Wikimedia📍 Remote - Remote

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes and Hadoop to ensure system reliability and scalability. This role requires strong experience in distributed systems and automation.

🏠 RemoteSenior
3w ago
Wikimedia

Site Reliability Engineer

Wikimedia📍 Remote - Remote

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires strong experience in distributed systems and automation.

🏠 RemoteSenior
3w ago
Wikimedia

Site Reliability Engineer

Wikimedia📍 Remote - Remote

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires significant experience in SRE practices.

🏠 RemoteSenior
3w ago
Wikimedia

Site Reliability Engineer

Wikimedia📍 Remote - Remote

Wikimedia is hiring a Senior Site Reliability Engineer to operate and enhance systems for data-oriented teams. You'll work with technologies like Kubernetes, Hadoop, and Kafka to ensure system reliability and scalability. This role requires strong experience in managing distributed systems.

🏠 RemoteSenior
3w ago
Circonus

Site Reliability Engineer

Circonus📍 Remote - Remote

Circonus is hiring a Site Reliability Engineer to ensure the reliability of their SaaS and on-premise services. You'll work on automation, scalability, and performance improvements while collaborating with various departments. This role is fully remote.

🏠 Remote
4 years ago