- Company Name
- E-Space
- Job Title
- Site Reliability Engineer (SRE) / DevOps Engineer
- Job Description
-
Job Title: Site Reliability Engineer (SRE) / DevOps Engineer
Role Summary:
Design, operate, and scale mission‑critical software systems on AWS with a focus on reliability, performance, and security. Lead infrastructure automation, CI/CD pipelines, monitoring, and incident response to ensure highly available, scalable services that support space‑connectivity operations.
Expectations:
* 5+ years in SRE, DevOps, or platform engineering roles.
* Proven design and operation of highly available, mission‑critical AWS services.
* Strong ownership of system reliability, SLI/SLO, error budgets, and incident handling.
Key Responsibilities:
* Architect and maintain scalable, highly‑available applications on AWS.
* Deploy and manage containerized workloads on Amazon EKS; manage Helm charts.
* Build and maintain Terraform (or OpenTofu) code for AWS infrastructure.
* Develop, maintain, and optimize CI/CD pipelines (Bitbucket favored).
* Implement monitoring, alerting, and observability with CloudWatch, Prometheus, Grafana.
* Define SLIs/SLOs, error budgets, and incident response procedures.
* Automate security, governance, and compliance controls in the cloud.
* Participate in on‑call rotation and lead production incident response.
* Collaborate with engineering teams to improve deployment, scaling, and operations.
Required Skills:
* Advanced proficiency in Terraform (or OpenTofu) for IaC.
* Deep experience with Kubernetes, EKS, Helm, and container orchestration.
* Strong CI/CD pipeline development and management (Bitbucket preferred).
* Proficient in Python and Bash scripting for automation.
* Experience with monitoring/observability tools: Prometheus, Grafana, ELK stack.
* Knowledge of capacity planning, performance optimization, and database scaling (RDS, Aurora).
* Familiarity with GitOps tools (ArgoCD, Flux) and service mesh technologies (Istio, Linkerd) is a plus.
Required Education & Certifications:
* Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
* AWS Certified Solutions Architect – Professional, Certified Kubernetes Administrator (CKA), or equivalent continuous‑learning certifications are highly desirable.
***END***