- Company Name
- Saransh Inc
- Job Title
- DevOps/SRE
- Job Description
-
**Job Title**
DevOps / SRE
**Role Summary**
Lead and operate enterprise‑grade OpenShift platforms, integrating DevOps practices, performance testing, and infrastructure security. Design, build, and maintain scalable, reliable Kubernetes clusters across on‑prem and cloud environments, providing self‑service capabilities to developers via GitOps. Conduct performance engineering to validate application behavior under load and refine platform resilience.
**Expectations**
- 10+ years in DevOps, SRE, or platform engineering; 6+ years Kubernetes; 3+ years hands‑on OpenShift 4.x at scale.
- Proven track record in multi‑cluster governance, cluster upgrades, and automation.
- Ability to deliver performance tests (JMeter, load, stress, endurance) and monitor with Dynatrace, Grafana, or Kibana.
- Strong security posture including RBAC, SCC/PSA, network policies, supply‑chain controls, and vulnerability mitigation.
- Experience with IaC (Terraform, Ansible) and GitOps tools (Argo CD, Tekton, Helm/Kustomize).
- Red Hat certification preferred (RHCSA, RHCE, EX280/EX288).
**Key Responsibilities**
- Design, deploy, and operate OpenShift clusters (IPI/UPI) across on‑prem and multi‑cloud.
- Manage day‑2 operations: machine configs, upgrades, node pool creation, disaster recovery, backup/restore (Velero, OADP) and storage tuning (ODF/OCS, Ceph, Portworx).
- Implement multi‑cluster governance with ACM/OCM and enforce policies (Gatekeeper/Kyverno, compliance operator/OpenSCAP).
- Enable continuous delivery pipelines: Argo CD (app‑of‑apps), Tekton/OpenShift Pipelines, container registry management (Quay/Harbor), image signing, SBOM, vulnerability scanning (RHACS, Trivy).
- Conduct performance testing: build JMeter scripts, analyze metrics with Dynatrace/Grafana/Kibana, recommend capacity and scaling actions.
- Apply network and ingress management: OVN‑Kubernetes, Multus, Ingress Controllers, L4/L7 load balancing, DNS/TLS.
- Maintain security and compliance: RBAC, SCC/PSA, network policies, secrets (External Secrets/Vault), SSO (Keycloak), and service mesh (Istio).
- Collaborate with developers, architects, and business stakeholders to translate performance and reliability requirements into platform capabilities.
- Lead platform upgrades, scale operations, and mentor junior team members.
**Required Skills**
- Linux (RHEL), networking (TCP/IP, DNS, TLS, routing), storage fundamentals.
- Kubernetes/K8s expertise and OpenShift 4.x operations at scale.
- Infrastructure‑as‑Code: Terraform, Ansible.
- GitOps & CI/CD: Argo CD, Tekton, Helm/Kustomize, Operators.
- Performance testing tools: Apache JMeter; monitoring: Dynatrace, Grafana, Kibana.
- Security: RBAC, SCC/PSA, network policies, supply‑chain controls, vulnerability remediation.
- Storage: ODF/OCS, Ceph, Portworx; backup/restore (Velero, OADP).
- Multi‑cluster governance: ACM/OCM.
- Service mesh: Istio/Red Hat Service Mesh.
- SSO: Keycloak.
- Cloud integration: AWS/Azure/GCP (LB, DNS, IAM, secrets).
- External secrets management: Vault, External Secrets.
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience).
- Red Hat certifications: RHCSA, RHCE, EX280/EX288 highly preferred.
- Additional certifications in Kubernetes (CKA/CKAD) or Cloud (AWS/Azure/GCP) are a plus.