cover image
FactSet

Lead Site Reliability Engineer - (DevOps, AWS Focus, PostgreSQL, CI/CD Systems) - Hybrid

On site

Toronto, Canada

Senior

Full Time

01-10-2025

Share this job:

Skills

Communication PostgreSQL Incident Response GitHub GitLab CI/CD DevOps Kubernetes Monitoring Roadmap planning Decision-making Networking Databases AWS Software Development Analytics Snowflake CI/CD Pipelines Gitlab CI Terraform Express Prometheus Grafana Infrastructure as Code GitHub Actions

Job Specifications

FactSet creates flexible, open data and software solutions for over 200,000 investment professionals worldwide, providing instant access to financial data and analytics that investors use to make crucial decisions.

At FactSet, our values are the foundation of everything we do. They express how we act and operate, serve as a compass in our decision-making, and play a big role in how we treat each other, our clients, and our communities. We believe that the best ideas can come from anyone, anywhere, at any time, and that curiosity is the key to anticipating our clients' needs and exceeding their expectations.

About Irwin

Irwin, a FactSet Company, is a leading provider of capital markets-focused financial technology with a mission to seamlessly connect the world's capital seekers and allocators to make them more productive, innovative, and successful. Our flagship product, Irwin, is a software platform used by investor relations and investment banking professionals all over the world. In October 2024, Irwin joined FactSet and added its investor relations solution to its existing offering.

Role Overview

We are seeking a seasoned Senior Site Reliability Engineer with deep expertise in AWS to own, architect, and continuously evolve Irwin's core infrastructure. You will plan, build, and optimize the systems that support our web applications and internal tools, ensuring scalability, reliability, observability, and security. Your technical judgment, roadmap planning skills, and hands-on expertise will enable our engineering teams to ship features with velocity and confidence.

Key Responsibilities

Strategic Road mapping:

Design and execute long-term strategies for scalable, secure infrastructure to host the Irwin web application and associate tooling on AWS/EKS with PostgreSQL.

AWS Infrastructure:

Architect and manage highly available cloud environments on EKS/Kubernetes using best practices for cost, performance, and security.

Database Operations:

Oversee, tune, and ensure the high availability of large-scale PostgreSQL databases; optimize for performance, backup, disaster recovery, and observability. Bonus points for experience using Snowflake or other OLAP systems.

Infrastructure as Code:

Lead the adoption and maintenance of Terraform workflows to manage infrastructure; ensure reproducibility, modularity, and CI/CD integration.

Continuous Integration & GitOps:

Build, maintain and scale CI/CD pipelines using GitOps principles to automate deployments, reduce risk, and speed up delivery cycles.

Kubernetes:

Design, deploy, and manage production-grade Kubernetes clusters; automate scaling and implement robust security practices.

Monitoring & Incident Response:

Implement monitoring, logging, and alerting solutions; establish best practices for incident detection and resolution.

Security & Compliance:

Apply industry best practices for infrastructure and data security; ensure governance and compliance with relevant standards (e.g., SOC2, GDPR).

Collaboration:

Mentor SRE peers and engineering teams on DevOps/SRE methodologies; document, communicate, and evangelize infrastructure best practices.

Required Skills And Experience

Minimum Requirements:

10+ years as a Site Reliability Engineer, DevOps, or similar role in cloud-native environments (AWS focus).

Critical Skills

Deep technical proficiency with AWS services (EC2, EKS, S3, RDS, IAM, etc.).
Expert-level experience managing, tuning, and scaling PostgreSQL databases.
Advanced skill in Terraform (modular design, environment promotion, CI/CD integration).
Proficient in building and operating CI/CD systems (Gitlab CI, GitHub Actions, or equivalent).
Hands-on experience with GitOps workflows (Argo CD, Flux, etc.).
Strong knowledge of Kubernetes (deployment, scaling, networking, security).
Experience with monitoring and logging stacks (DataDog, Prometheus, Grafana, ELK, etc.).
Track record in designing, communicating, and executing complex infrastructure roadmaps.
Experience mentoring and enabling engineering teams.
Strong written and verbal communication skills.

Preferred/Desired Qualifications

Professional certifications (AWS Solutions Architect, Kubernetes, Terraform).
Experience in fin-tech, SaaS, or high-compliance industries.
Exposure to data privacy regulations and secure software development practices.

Education

Bachelors degree in computer science or similar

Why Irwin?

Influence the technology roadmap at a pivotal growth stage.
Build infrastructure for mission-critical applications.
Work with passionate, high-performing teams.
Competitive compensation, benefits, and equity options.

Here Is What To Expect

First interview with me (hiring manager) to assess experience, basic technical skills and cultural fit - 1 hour
Second deeper technical interview with two team member for a deeper technical assessment - 1 hour
Optional third technical interview also with the same two team members if they are not able to cover everything in the firs

About the Company

FactSet creates flexible, open data and software solutions for tens of thousands of investment professionals around the world, providing instant access to financial data and analytics that investors use to make crucial decisions. For 40 years, through market changes and technological progress, our focus has always been to provide exceptional client service. From more than 60 offices in 23 countries, we’re all working together toward the goal of creating value for our clients, and we’re proud that 95% of asset managers who ... Know more