- Company Name
- Phenom
- Job Title
- DevOps Engineer (Onsite/Hybrid)
- Job Description
-
Job Title: DevOps Engineer (Onsite/Hybrid)
Role Summary:
Deliver end‑to‑end operational excellence for a cloud‑native platform by building, deploying, and maintaining highly reliable, scalable infrastructure and services. Partner with Engineering, Product, and Platform teams to manage production changes, automate processes, and embed security in CI/CD pipelines.
Expectations:
- Proactive ownership of cloud infrastructure reliability, performance, and cost optimization.
- Seamless coordination across cross‑functional teams for change approval, incident response, and post‑incident improvement.
- Continuous improvement of DevOps practices, tooling, and observability to meet SLIs/SLOs/SLAs.
Key Responsibilities
- Ensure high availability, performance, and capacity planning for cloud components, databases, and services.
- Integrate security controls into CI/CD, perform vulnerability scans, and handle security incidents.
- Design and maintain auto‑scaling, load balancing, and capacity‑planning patterns.
- Implement GitOps workflows for immutable infrastructure and auto‑deployment via ArgoCD/Helm.
- Collaborate with platform engineering to enable developer tools and automate build/run pipelines.
- Drive structured change management using ServiceNow, including risk assessment, approval, and rollback plans.
- Act as a technical responder during major incidents, coordinating communication and restoration activities.
- Participate in root‑cause analysis and contribute corrective/preventive actions (CAPA).
- Support production releases, database schema migrations, and rollbacks, ensuring release readiness reviews and Go/No‑Go sign‑offs.
Required Skills
- 5+ years of Cloud Ops/DevOps/SRE/Software Engineering in production environments.
- Proficient in one or more Scripting/Programming languages (Python, JavaScript/TypeScript, Java).
- Deep experience with Kubernetes, ArgoCD, Helm, and service meshes (Linkerd/Istio/Nginx).
- Expertise in public cloud platforms (AWS, GCP, or Azure) with compute, networking, and storage.
- Hands‑on with Kafka, Redis, MongoDB, PostgreSQL/MySQL/Aurora.
- CI/CD pipeline design, container orchestration, and Infrastructure‑as‑Code (IaC).
- Strong Linux system administration and troubleshooting.
- Familiarity with observability stack (metrics, logs, tracing) for applications and databases.
- Knowledge of ServiceNow change‑management processes.
- Understanding of SLIs, SLOs, error budgets, incident management, and RCA.
- Experience with database reliability or mission‑critical database support is a plus.
Required Education & Certifications
- Bachelor’s degree in Computer Science, Engineering, or equivalent technical field.
- Professional certifications such as AWS Certified DevOps Engineer, GCP Professional DevOps Engineer, Azure DevOps Engineer Expert, or Kubernetes Certified Application Developer (CKAD/CKA).
- Certifications in security (e.g., CISSP, GCISEC) or ITSM (e.g., ITIL Foundation) are advantageous.