- Company Name
- BrightAI
- Job Title
- Senior Engineer, DevOps
- Job Description
-
Job title: Senior DevOps Engineer
Role Summary: Design, implement, and maintain a high‑scale cloud platform that processes millions of event streams daily. Lead infrastructure, release engineering, and automation initiatives across multiple AWS accounts, ensuring reliability, scalability, and operational excellence.
Expectations:
- 4+ years in software engineering or equivalent hands‑on experience.
- Expertise in AWS cloud services, Kubernetes, Terraform, and CI/CD pipelines.
- Strong background in system operations, monitoring, and cost optimization.
- Proactive about learning emerging cloud technologies and applying best practices.
Key Responsibilities:
- Build and nurture self‑service tools, processes, and documentation for engineering teams.
- Define and enforce infrastructure‑as‑code standards and patterns using Terraform, CloudFormation, Helm, Kustomize, or Ansible.
- Manage and support software releases, automate build, test, and deployment pipelines.
- Maintain multi‑region, high‑availability services on AWS (EKS, ECS, Lambda, RDS, MSK, etc.).
- Operate and monitor large Kafka clusters, ensuring performance and fault tolerance.
- Create comprehensive dashboards, alerts, and monitoring solutions (Datadog, Prometheus, Grafana, etc.).
- Optimize cloud spend with reserved instances, cost‑explorer, and budget monitoring.
- Stay current on cloud trends and prototype new solutions to improve platform resilience.
Required Skills:
- AWS services (EKS, EC2, VPC, Route53, S3, RDS/Postgres, MSK, Lambda, ECS, DocumentDB).
- Kubernetes cluster deployment, scaling, and management.
- Infrastructure‑as‑code (Terraform, CloudFormation, Helm, Kustomize, Ansible).
- Scripting (Python, Go, TypeScript, JavaScript, Bash).
- CI/CD tooling (GitHub Actions, AWS CodePipeline, Jenkins, etc.).
- Monitoring & alerting (Datadog, Prometheus, Grafana, OpsGenie, SumoLogic).
- Multi‑region, multi‑AZ high‑availability architecture.
- Linux administration, networking, and shell scripting.
- Agile development practices.
Required Education & Certifications:
- Bachelor’s or Master’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.
- AWS Certified Solutions Architect or equivalent professional certifications preferred.