- Company Name
- Devoteam
- Job Title
- SRE (Site Reliability Engineer) - H/F
- Job Description
-
**Job Title:** Site Reliability Engineer (SRE) – H/F
**Role Summary:**
Integrate reliability practices into the software development lifecycle, ensuring high performance, availability, and rapid deployment of multi‑cloud platforms. Collaborate with development, security, and operations teams to implement CI/CD, observability, and incident response processes that meet defined SLOs/SLIs.
**Expectations:**
- Minimum 5 years of professional experience in IT operations with at least 2 years as an SRE.
- Proven ability to work in fast‑paced, agile environments and adapt to evolving technologies.
- Strong communication skills in French and English, capable of leading cross‑functional discussions and presenting technical concepts to diverse stakeholders.
**Key Responsibilities:**
- Design, implement, and maintain CI/CD pipelines, automated rollback mechanisms, and testing frameworks that enforce reliability standards.
- Conduct post‑mortem analyses, root‑cause investigations, and recommend architectural or process improvements to eliminate recurring failures.
- Deploy and configure observability tools (e.g., Dynatrace, Datadog, Prometheus, Grafana, ELK, Splunk) to monitor application and infrastructure metrics.
- Define, track, and report on SLOs/SLIs, ensuring alignment with business objectives.
- Perform resilience testing, disaster‑recovery drills, and define RPO/RTO/RTW objectives.
- Stay current on reliability best practices, emerging technologies, and industry trends; advocate for adoption within the organization.
**Required Skills:**
- Monitoring & log management: Dynatrace, Datadog, Prometheus, Grafana, ELK Stack, Splunk.
- Scripting/programming: Python, Bash, Go, or Rust.
- Container orchestration: Docker, Kubernetes.
- Cloud platforms: AWS, GCP, Azure.
- Infrastructure‑as‑Code: Terraform, Ansible (or equivalent).
- Strong analytical mindset, problem‑solving, and incident‑management experience.
- Effective written and verbal communication in French and English.
**Required Education & Certifications:**
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent professional experience).
- Relevant certifications are a plus: AWS Certified SysOps Administrator, GCP Professional Cloud DevOps Engineer, Azure DevOps Engineer, Kubernetes Administrator, or equivalent.