cover image
Cleo

Platform Engineer (SRE)

Hybrid

London, United kingdom

Full Time

03-10-2025

Share this job:

Skills

Leadership Ruby DevOps Kubernetes Monitoring Test Coaching Redis Terraform Prometheus Grafana Infrastructure as Code PostGres

Job Specifications

About The Company

About Cleo

At Cleo, we're not just building another fintech app. We're embarking on a mission to fundamentally change humanity's relationship with money. Imagine a world where everyone, regardless of background or income, has access to a hyper-intelligent financial advisor in their pocket. That's the future we're creating.

Cleo is a rare success story: a profitable, fast-growing unicorn with over $200 million in ARR and growing over 2x year-over-year. This isn't just a job; it's a chance to join a team of brilliant, driven individuals who are passionate about making a real difference. We have an exceptionally high bar for talent, seeking individuals who are not only at the top of their field but also embody our culture of collaboration and positive impact.

If you're driven by complex challenges that push your expertise, the chance to shape something truly transformative, and the potential to share in Cleo's success as we scale, while growing alongside a company that's scaling fast, this might be your perfect fit.

Follow us on LinkedIn to keep up to date with new product features and insights from the team.

About The Role

Platform Engineer (SRE)

(Alternative titles: Platform Reliability Engineer, Site Reliability Engineer, Platform Engineer, Software Engineer (Platform)

At Cleo, we're not just building another fintech app. We're embarking on a mission to fundamentally change humanity's relationship with money. Imagine a world where everyone, regardless of background or income, has access to a hyper-intelligent financial advisor in their pocket. That's the future we're creating.

Cleo is a rare success story: a profitable, fast-growing unicorn with over $200 million in ARR and growing over 2x year-over-year. This isn't just a job; it's a chance to join a team of brilliant, driven individuals who are passionate about making a real difference. We have an exceptionally high bar for talent, seeking individuals who are not only at the top of their field but also embody our culture of collaboration and positive impact.

If you're driven by complex challenges that push your expertise, the chance to shape something truly transformative, and the potential to share in Cleo's success as we scale, while growing alongside a company that's scaling fast, this might be your perfect fit.

Role Overview

As a Site Reliability/Platform Engineer, you will be an integral part of our platform team, responsible for the infrastructure and tooling that empowers our product teams to deliver Cleo. Your focus will be on ensuring reliability, scalability, and performance. You will proactively identify areas for improvement, possess a deep understanding of user-centric measurement and optimization, and collaborate closely with other product teams on their releases to offer your support.

What You Will Be Doing

Platform Tooling & Best Practices: Build and iterate on tooling and best practices, including SLIs/SLOs, to gain a deep understanding of Cleo's performance and enable product squads to develop effectively.
Coaching our engineering squads on effective implementation of monitoring and alerting
Incident Learning & Resilience: Drive a culture of post-incident learning and proactive resilience testing.
Scaling & Optimization:
Identify and test for potential scaling challenges across codebase, compute, and data infrastructure.
Develop and implement best practices and tools for scaling the Cleo Ruby monolith, including new patterns for the wider engineering team to adopt.
Deeply understand and optimize platform costs.
Infrastructure as Code (IaaC): Develop our IaaC (Terraform) to ensure modularity and ease of use.
Engineering Team Support: Partner with and support the wider engineering team on best practices and troubleshooting their use of platform tools.
Deployment Pipeline Management: Manage and optimize deployment pipelines in CircleCI.
Manage the infrastructure: Cleo runs on RDS (Postgres), Redis, Heroku/EKS

What Skills Do You Need

4+ years in SRE, DevOps, or platform-engineering roles, specifically with high-traffic services.
Deep, hands-on expertise with Amazon EKS/Kubernetes, RDS/Postgres and Redis
Proven track record in setting SLIs/SLOs and conducting error-budget reviews.
Experience instrumenting with Prometheus, Grafana, Coralogix, Sentry and OpenTelemetry.
Knowledge of Ruby on Rails
Strong leadership in incident command and post-mortems
Passion for continuous improvement and challenging the status quo to identify areas for enhancement.

What do you get for all your hard work?

A competitive compensation package (base + equity) with bi-annual reviews, aligned to our quarterly OKR planning cycles. You can view our public progression framework and salary bandings here: https://cleo-ai.progressionapp.com/
Work at one of the fastest-growing tech startups, backed by top VC firms, Balderton & EQT Ventures
A clear progression plan. We want you to keep growing. That means trying new things, leading others,

About the Company

Cleo. Where AI meets money. Cleo launched in 2016 by people who have been burned by banks. Fed up with the broken system, we decided it was time to make a change. Since then, the cost of living has skyrocketed, the rich have continued getting richer, and our vision has only become clearer. Our mission? To change the world's relationship with money. Cleo is a platform for the 99% – an AI assistant defining a new category, one that goes beyond saving and budgets to actually changing how we feel about our finances. Usi... Know more