Job Specifications
THE POSITION
Our roster has an opening with your name on it
At FanDuel, our data platforms power mission-critical products used by millions of customers every day. To meet the scale, speed, and complexity of our business, we need to ensure our data services are resilient, performant, and highly reliable—even in the face of failures or large-scale disruption.
This Senior Site Reliability Engineer (SRE) role sits within our Data organization and will play a critical role in building and maturing our reliability engineering and disaster recovery capabilities. You will help define the standards, architectures, and operating models that allow FanDuel’s data infrastructure to withstand outages, recover rapidly, and continuously deliver against demanding reliability targets.
As a Senior Engineer, you will not only be deeply hands-on in implementation but will also act as a technical leader—mentoring engineers, setting direction, and driving adoption of best practices across the organization. You will work closely with engineering and operations teams to establish observability frameworks, define service-level objectives (SLOs), automate recovery procedures, and architect cross-region failover solutions. Your expertise will directly shape how FanDuel safeguards its data platforms and ensures business continuity during critical events.
In addition to the specific responsibilities outlined above, employees may be required to perform other such duties as assigned by the Company. This ensures operational flexibility and allows the Company to meet evolving business needs.
THE GAME PLAN
Everyone on our team has a part to play
Design and maintain highly available, scalable data platform infrastructure on AWS (Databricks, Redshift, Airflow).
Collaborate with engineering teams to optimize performance across clusters, pipelines, and workflows.
Implement infrastructure-as-code (Terraform, CloudFormation, CDK) to automate deployment and recovery.
Establish and monitor SLOs/SLIs, building observability and automation for reliability at scale.
Lead business continuity and disaster recovery planning, including cross-region failover and backup strategies.
Drive incident management end-to-end: response, escalation, post-mortems, and root cause analysis.
Champion continuous improvement, applying learnings from incidents and performance data to strengthen resilience.
THE STATS
What we're looking for in our next teammate
5+ years of experience in Site Reliability Engineering, DevOps, Infrastructure, or Data Platform Engineering.
Strong knowledge of AWS services (EC2, S3, RDS, Lambda, CloudWatch, etc.) and infrastructure-as-code (Terraform, CloudFormation, Ansible).
Hands-on experience with data platform technologies (Databricks, Redshift, Airflow, dbt, Spark, Kafka).
Proficiency in at least one programming language (Python, Go, Java, or similar).
Experience with monitoring, observability, and log analysis tools (Prometheus, Grafana, DataDog, ELK, Splunk).
Solid understanding of CI/CD pipelines, version control (Git), and automation practices.
Proven ability to design and implement business continuity and disaster recovery solutions.
Strong incident management skills, including root cause analysis and post-mortem processes.
Knowledge of network security and compliance frameworks.
Excellent communication and documentation skills, with ability to influence and advocate for best practices across teams.
Demonstrated track record of improving system reliability, performance, and efficiency at scale.
About Fanduel
FanDuel Group is the premier mobile gaming company in the United States and Canada. FanDuel Group consists of a portfolio of leading brands across mobile wagering including: America’s #1 Sportsbook, FanDuel Sportsbook; its leading iGaming platform, FanDuel Casino; the industry’s unquestioned leader in horse racing and advance-deposit wagering, FanDuel Racing; and its daily fantasy sports product.
In addition, FanDuel Group operates FanDuel TV, its broadly distributed linear cable television network and FanDuel TV+, its leading direct-to-consumer OTT platform. FanDuel Group has a presence across all 50 states, Canada, and Puerto Rico.
The company is based in New York with US offices in Los Angeles, Atlanta, and Jersey City, as well as global offices in Canada and Scotland. The company’s affiliates have offices worldwide, including in Ireland, Portugal, Romania, and Australia.
FanDuel Group is a subsidiary of Flutter Entertainment, the world's largest sports betting and gaming operator with a portfolio of globally recognized brands and traded on the New York Stock Exchange (NYSE: FLUT).
Player Benefits
We treat our team right
We offer amazing benefits above and beyond the basics. We have an array of health plans to choose from (some as low as $0 per paycheck) that include programs for fertility and family planning, mental health support, and fitness benefits. We offer generous paid time off (PTO & sick leave), annu
About the Company
FanDuel Group is an innovative sports-tech entertainment company that is changing the way consumers engage with their favorite sports, teams, and leagues. The premier gaming destination in the North America, FanDuel Group consists of a portfolio of leading brands across gaming, sports betting, daily fantasy sports, advance-deposit wagering, and TV/media, including FanDuel, Stardust Casino and TVG.
The company is based in New York with US offices in Los Angeles, Atlanta, and Jersey City, as well as global offices in Canada ...
Know more