cover image
McCain Foods

Site Reliability Engineer

On site

Toronto, Canada

Senior

Full Time

03-12-2025

Share this job:

Skills

Leadership Go Incident Response Cloud Security SAP CI/CD Architecture Enterprise Architecture Organization Azure CI/CD Pipelines Terraform Infrastructure as Code Microservices

Job Specifications

Position Title: Site Reliability Engineer

Position Type: Regular - Full-Time

Position Location: Toronto HQ

Requisition ID: 36904

Our Global Technology team’s goal is to leverage technology and data to drive profitable growth, focus on enhancing customer experience and to further our purpose of 'Celebrating real connections through delicious, planet-friendly food'. McCain has embarked on an ambitious digital transformation across our business from Agriculture to Manufacturing and commercial capabilities to enhance our customer obsession. As part of this transformation, we are making significant investments in our digital platforms, technology transformations, and in building a data driven culture. We are building digital products for our customers, suppliers/growers, and McCain team members to enable digital processes and data-driven automation. Through our investments, we will transform McCain into a company that empowers our teams with easy-to-use systems which will help them collaborate better, be productive and make data driven decisions. Will you be part of this exciting journey?

About The Role.

The Site Reliability Engineer will ensure the reliability and availability of software systems by designing resilient architectures, automating infrastructure management, and implementing effective incident response processes. By optimizing system performance, capacity, and automation, contribute to the organization's ability to deliver reliable and scalable services, enhancing the user experience and minimizing downtime.

What You'll Be Doing.

Architect, design, and implement reliable and scalable systems in Azure cloud.
Instrument distributed systems using OpenTelemetry for logs, metrics, and traces — embedding observability into the codebase across microservices and critical applications.
Drive SAP observability strategy as we migrate to SAP RISE — integrating New Relic with SAP to provide full-stack visibility, performance insights, and business-critical alerting.
Automate infrastructure and operations at scale using Infrastructure as Code (Terraform or Bicep), CI/CD pipelines, and self-healing systems to reduce manual toil.
Collaborate with developers and platform teams to define and implement SLOs, SLIs, and Error Budgets, and embed SRE practices across product teams.
Lead incident response and root cause analysis, building a blameless culture while hardening systems against future failures.
Contribute to the evolution of McCain’s SRE playbooks, tooling, and engineering standards as a founding member of the global SRE practice.

What You'll Need To Be Successful.

Bachelor's Degree in related field, such as Computer Science or related technical field
7+ years of software engineering experience, including at least 5 year working experience as a Site Reliability Engineer accountable for SLOs.
Experience with deployment and development on Azure
Experience in Continuous Delivery methodologies and tools.
Good knowledge on resiliency patterns and cloud security
Experience troubleshooting issues with users and ability to collaborate effectively with cross-functional teams.
Any certifications on Azure presferred

Measures Of Success

Resilience: Services meet or exceed availability targets across mission-critical systems (including SAP).
Observability: End-to-end visibility with actionable dashboards and alerts.
Automation: Manual tasks are eliminated via tooling and scripts; infrastructure is code-first.
Influence: SRE principles are embedded into our engineering culture, with you as a key driver.
Trust: Stakeholders see you as the go-to expert for system reliability and design best practices.

About The Team.

Reporting to the Senior Engineering Manager, SRE & Observability, this role will require collaboration with several key internal stakeholders, including Infrastructure & Operations, SAP, Application Support, DevSecOps, the Cloud Center of Excellence, InfoSec, Enterprise Architecture, Regional HR Teams, the Global Leadership Team, and Corporate Communications. Externally, the position interacts with various vendors. Travel may be required, and the role is primarily performed in a standard office environment.

About McCain.

Click Here to learn more about McCain and how we provide you with opportunities to make an impact that matters.

Leadership principles.

At McCain, our leadership principles guide how we engage with customers, collaborate as a team, and achieve success. We focus on understanding customer needs, driving innovation, empowering people, and taking ownership to clear obstacles and deliver results.

The McCain Experience.

We are McCain. This statement is a testament to our collective strength and our individual value. Your contributions play a vital role in our success. Our winning culture is rooted in authenticity and trust, empowering us to bring out the best in one another. Here, you’ll find opportunities to learn, grow, and thrive. Join us and experience why we’re better together.

About the Company

At McCain, we believe food plays an important role in people’s lives, with the power to bring individuals, families, and communities together. As a privately owned family company with over 67 years of experience, a presence in over 160 countries, and a global team of 23,000+ people, our values and culture are at the heart of everything we do. Our product quality, people and customer dedication help us achieve global sales in excess of CDN $14 billion. Through our investment and innovation agenda, we continue to be a globa... Know more