- Company Name
- IFS
- Job Title
- Lead Site Reliability Engineer | Copperleaf
- Job Description
-
**Job Title:** Lead Site Reliability Engineer (Azure)
**Role Summary:**
Senior SRE responsible for designing, implementing, and continuously improving Azure‑based infrastructure for high‑availability SaaS services. Leads automation initiatives, ensures reliability, scalability, and security, mentors junior engineers, and collaborates with development teams to embed operational excellence into the software development lifecycle.
**Expectations:**
- Own end‑to‑end reliability of mission‑critical Azure SaaS workloads.
- Drive automation and reduce manual toil across Cloud Operations.
- Establish and enforce SLO/SLI/SLA targets.
- Lead incident response, root‑cause analysis, and post‑mortem processes.
- Mentor and develop junior SRE/CloudOps staff.
- Evaluate and adopt new Azure services for performance, cost, and security gains.
**Key Responsibilities:**
- Architect and implement Azure infrastructure (App Services, AKS, Azure SQL, Storage, Networking, etc.) for high‑availability.
- Build and maintain CI/CD pipelines using Azure DevOps, Terraform, ARM/Bicep, and related tools.
- Develop and manage monitoring, alerting, and incident response frameworks (Azure Monitor, Log Analytics).
- Define and track reliability metrics (SLOs, SLIs, SLAs).
- Enforce security best practices: identity, access control, secret and certificate management.
- Conduct root‑cause analysis of production incidents and drive corrective actions.
- Produce and keep up‑to‑date architecture diagrams, runbooks, and operational documentation.
- Provide technical guidance and coaching to junior team members.
**Required Skills:**
- 5+ years SRE/DevOps/Cloud Operations experience; ≥3 years focused on Microsoft Azure.
- Deep knowledge of Azure services: App Service, AKS, Azure SQL, Storage, Networking, Security Center, Monitor.
- Strong scripting/automation: PowerShell, Python, Bash (or equivalent).
- Infrastructure‑as‑Code expertise: Terraform, ARM templates or Bicep.
- Proven incident management and root‑cause analysis capabilities.
- Experience defining and managing SLO/SLI/SLAs.
- Solid understanding of cloud security, IAM, secret management, and compliance.
- Excellent communication and mentoring skills.
**Required Education & Certifications:**
- Bachelor’s degree in Computer Science, Engineering, Information Technology, or related field (or equivalent practical experience).
- Preferred: Microsoft Certified: Azure Administrator Associate or Azure Solutions Architect Expert.
Staines-upon-thames, United kingdom
Remote
Senior
04-02-2026