
About Instructure
Empowering education through innovative technology solutions
Key Highlights
- Creator of the Canvas Learning Management Platform
- Market cap of $1 billion and $5.5 million in funding
- Headquartered in Cottonwood Heights, UT
- Acquired Concentric Sky and LearnPlatform for enhanced features
Instructure, headquartered in Cottonwood Heights, UT, is the creator of the Canvas Learning Management Platform, widely adopted by educational institutions from pre-school to higher education. Founded in 2008, Instructure has achieved a market cap of $1 billion and raised $5.5 million in funding. Th...
đ Benefits
Instructure offers a comprehensive benefits package including equity and 401k options, medical, dental, and life insurance, as well as a flexible work...
đ Culture
Instructure fosters a unique culture with a strong emphasis on research and development, dedicating much of its workforce to innovation. The company v...

Site Reliability Engineer âĸ Senior
Instructure âĸ Budapest
Overview
Instructure is seeking a Senior Site Reliability Engineer to build and maintain a reliable and scalable infrastructure on AWS. You'll work with technologies like Docker, Kubernetes, and Terraform to automate operations and ensure platform stability.
Job Description
Who you are
You have a strong background in site reliability engineering with at least 5 years of experience in building and maintaining scalable infrastructure. Your expertise in AWS is complemented by a solid understanding of automation practices, allowing you to tackle operational challenges effectively. You are passionate about operational excellence and have a proven track record of applying a software engineering mindset to improve system reliability and performance.
Your experience with modern technologies such as Docker and Kubernetes enables you to manage containerized applications efficiently. You are familiar with infrastructure as code tools like Terraform and Vault, which you use to automate deployments and manage secrets securely. Your programming skills in Ruby and Go allow you to develop scripts and tools that enhance operational workflows.
You thrive in collaborative environments and enjoy working closely with cross-functional teams to drive improvements in system reliability. Your communication skills help you articulate complex technical concepts to non-technical stakeholders, ensuring everyone is aligned on operational goals. You are committed to continuous learning and staying updated with industry best practices in site reliability engineering.
Desirable
Experience with monitoring and alerting tools such as Prometheus or Grafana would be a plus. Familiarity with CI/CD pipelines and incident management processes is also beneficial. You understand the importance of security in operations and are knowledgeable about best practices in securing cloud environments.
What you'll do
As a Senior Site Reliability Engineer at Instructure, you will play a crucial role in building and maintaining a highly reliable and scalable infrastructure on AWS. You will focus on automating operational tasks to ensure that our platform remains stable and responsive as we grow. Your responsibilities will include designing and implementing infrastructure solutions that meet the needs of our applications and users.
You will collaborate with development teams to integrate reliability into the software development lifecycle, ensuring that new features are designed with operational considerations in mind. Your expertise will guide the team in adopting best practices for system architecture and deployment strategies, helping to minimize downtime and improve performance.
You will be responsible for monitoring system performance and implementing alerting mechanisms to proactively identify and resolve issues before they impact users. Your role will involve conducting post-incident reviews to analyze failures and develop strategies to prevent recurrence, fostering a culture of continuous improvement within the team.
In addition to your technical responsibilities, you will mentor junior engineers, sharing your knowledge and experience to help them grow in their roles. You will contribute to the development of documentation and training materials that support operational excellence across the organization.
What we offer
At Instructure, we believe in empowering our employees to grow and succeed. We offer a collaborative work environment where your contributions are valued and recognized. You will have access to professional development opportunities to enhance your skills and advance your career.
We provide competitive compensation and benefits, including health insurance, retirement plans, and flexible work arrangements. Our culture emphasizes work-life balance, and we encourage you to take time for personal development and well-being. Join us in our mission to simplify learning and personal development through innovative technology.
Interested in this role?
Apply now or save it for later. Get alerts for similar jobs at Instructure.
Similar Jobs You Might Like
Based on your interests and this role

Site Reliability Engineer
GoDaddy is seeking a Senior Site Reliability Engineer to build and maintain their eCommerce platform. You'll work with Kubernetes, CI/CD pipelines, and AWS to ensure system reliability. This role requires 4+ years of experience in site reliability engineering.

Site Reliability Engineer
Crusoe is hiring a Site Reliability Engineer to ensure the reliability and performance of their cloud infrastructure. You'll work with Linux, networking, and automation to maintain high service levels. This role requires experience in SRE practices and distributed systems.

Site Reliability Engineer
Together AI is hiring a Site Reliability Engineer to ensure the reliability and performance of user-facing services and production systems. You'll work with Ansible, Terraform, and Kubernetes to build and manage infrastructure. This role requires 2+ years of experience in SRE or a related field.

Site Reliability Engineer
GoDaddy is seeking a Site Reliability Engineer to build and support next-generation cloud infrastructure. You'll work with technologies like Ansible, Puppet, and Python to maintain large-scale private cloud systems. This role is fully remote based in Romania.

Site Reliability Engineer
Talkspace is seeking a Site Reliability Engineer to ensure the reliability and performance of their behavioral health platform. You'll leverage your technical skills in AWS, Docker, and Linux to maintain live services and implement monitoring strategies. This role requires strong collaboration and communication skills.