cover image
Happening

Senior Site Reliability Engineer (Kafka SME)

On site

All, United states

Senior

Full Time

03-11-2025

Share this job:

Skills

Java MongoDB PostgreSQL GitHub GitLab CI/CD DevOps Monitoring Version Control Jenkins Networking Architecture Databases Azure AWS Software Development Redis CI/CD Pipelines Kafka Microservices

Job Specifications

Do you have a software engineering background, but love working on infrastructure? We are looking for a skilled Site Reliability Engineer to join our ambitious team working on core platform solutions which are building blocks used by other product teams. That includes setting up and managing most of the technologies used to deliver a great betting product to our customers, working with our product teams to help them adopt and use common infrastructure and libraries, while collaborating with our system operations and IT team which is responsible for maintaining underlying virtual hardware and networking.

As a Site Reliability Engineer, you will be in a position to directly influence the work of multiple engineering product teams across dozens of microservices that process hundreds of thousands of betslips per day through their whole lifecycle!

We're looking for someone who:

Has a Computer Science or related degree;
Has experience in production-level software development, preferably in Java;
Is familiar with Kafka-based microservices architecture;
Has some knowledge of application-external infrastructure (e.g. databases, message queues);
Can evaluate system health through monitoring;
Is able to reason about different components of a large scale distributed system and their dependencies

Bonus points for:

Experience with cloud technologies such as AWS/Azure/Google Cloud, etc.;
Familiarity with DevOps methodologies;
Knowing your way around CI/CD processes, toolsets, and version control software like Jenkins, CircleCI, GitLab, GitHub;
Experience with Golang;
Experience with DBs such as PostgreSQL, Redis, MongoDB, CockroachDB;

What you'll be doing:

Optimizing and configuring Kafka infrastructure;
Setting up and improving monitoring and tracing infrastructure;
Scaling and optimizing logging infrastructure;
Improving CI/CD pipelines;
Implementing new libraries and driving their usage adoption among product teams;
Prototyping new technologies or architectures.

About the Company

Happening is the technology engine powering Superbet Group's global platforms and brands that bring thrill to customers across the world every day. We are a game-changing tech company rewriting the rules of sports betting and gaming. We are shaking up the status quo by building our own end-to-end tech stack, solving deep and complex challenges for millions of customers and shaping our culture to work uniquely for the tech community. A true challenger, our technology handles serious scale on par with the Big Techs, and cust... Know more