- Company Name
- Galent
- Job Title
- Site Reliability Engineer - In person interview
- Job Description
-
Job Title: Site Reliability Engineer - In person interview
Role Summary:
Architect, deploy, and maintain production‑grade Kubernetes environments across major cloud providers, ensuring high availability, security, and cost efficiency while enabling continuous delivery through GitOps and Infrastructure as Code.
Expactations:
- Deliver reliable, zero‑downtime services in a multi‑cluster, hybrid‑cloud setting.
- Consistently enforce security, compliance, and operational best practices.
- Mentor junior teams and elevate overall SRE competency.
- Resolve incidents swiftly with thorough root‑cause analysis and preventive actions.
Key Responsibilities:
- Design, provision, and scale EKS, AKS, GKE, or OpenShift clusters with Terraform, Pulumi, or Ansible.
- Build and optimize CI/CD pipelines; implement GitOps using ArgoCD, Flux, or similar tools.
- Manage Kubernetes networking, storage, RBAC, Pod Security Standards, and vulnerability scanning.
- Deploy observability stack (Prometheus, Grafana, ELK, OpenTelemetry) and automate monitoring alerts.
- Drive cost‑optimization for Kubernetes workloads and resource usage.
- Lead incident response, root‑cause analysis, capacity planning, and post‑mortem documentation.
- Collaborate with architecture, development, and security teams on cloud‑native strategy.
- Mentor senior and mid‑level engineers on best practices.
Required Skills:
- 6+ years in DevOps/SRE, 4+ years Kubernetes (cloud‑native).
- Deep expertise in Kubernetes networking, storage, RBAC, security, Helm, Operators, service mesh (Istio/Linkerd).
- Advanced proficiency in Terraform, Pulumi, and Ansible.
- Strong scripting/programming in Go, Python, or Bash.
- Extensive experience on AWS, Azure, or GCP with multi‑cluster/hybrid deployments.
- Proficient with GitOps tools (ArgoCD, Flux).
- Proven leadership and mentoring capability.
Required Education & Certifications:
- Certified Kubernetes Administrator (CKA) and/or Certified Kubernetes Security Specialist (CKS).
- Bachelor’s degree in Computer Science, Engineering, or related field (preferred).