- Company Name
- Techfellow Limited
- Job Title
- Senior Cloud Engineer - Infrastructure & Platform Services | Dynamic Asset Management Leader
- Job Description
-
Job Title: Senior Cloud Engineer – Infrastructure & Platform Services
Role Summary:
Lead the design, implementation, and continuous improvement of large‑scale public cloud environments (primarily AWS, with Azure exposure). Drive automation, platform standardisation, and operational excellence while mentoring junior team members.
Expectations:
- Deliver resilient, secure, and scalable cloud architectures that support trading and research workloads.
- Mentor, coach, and influence engineering best practices within the team.
- Act as a technical advocate for cloud adoption and cost‑optimisation.
Key Responsibilities:
- Design, deploy, and evolve AWS cloud environments; integrate hybrid on‑premise components.
- Build and maintain IaC solutions using Terraform (or CloudFormation) with Python/Bash scripting.
- Support Kubernetes (EKS/AKS/GKE) clusters: lifecycle, security, upgrades, and CI/CD pipelines.
- Create self‑service portals and automation frameworks to accelerate resource provisioning.
- Ensure consistent user experience across on‑premise and cloud systems; enforce standards.
- Develop monitoring/observability stacks (Grafana, Prometheus, OpenTelemetry).
- Maintain operational documentation and architecture artefacts.
- Participate in on‑call rotations for high‑availability production support.
- Evaluate emerging cloud services and recommend enhancements.
Required Skills:
- 6+ years designing, launching, and managing enterprise‑scale cloud environments (AWS).
- Strong IaC expertise: Terraform, CloudFormation, or equivalent.
- Deep knowledge of cloud services (compute, networking, storage, IAM).
- Hands‑on Kubernetes proficiency (EKS/AKS/GKE, security, upgrades, workload orchestration).
- Python or Bash scripting for automation, CI/CD, and configuration management.
- Linux systems administration (RHEL‑based).
- Hybrid networking: VPN, load balancers, routing, security.
- Container technologies (Docker, Podman).
- FinOps awareness: cost optimisation and cloud utilisation.
- Monitoring/observability experience (Grafana, Prometheus, OpenTelemetry).
- Backup, DR, and business continuity in cloud environments.
- Preferred: Ansible/Puppet, AI/ML or HPC workload experience.
Required Education & Certifications:
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- Preferred certifications: AWS Solutions Architect, AWS SysOps Administrator, Kubernetes CKA/CKAD.