cover image
RED Global

HPC Engineer (High-Performance Computing)

Hybrid

London, United kingdom

Senior

Freelance

03-11-2025

Share this job:

Skills

Python Bash DevOps Configuration Management Ansible Agile methodologies Networking Research Linux System Administration cloud platforms Agile Openstack TCP/IP Terraform

Job Specifications

We are seeking an experienced and highly motivated High-Performance Computing (HPC) Engineer to join our team. The successful candidate will have a proven record of delivering robust HPC services and infrastructure, combined with the ability to work closely with the scientific and research community to optimise computational workflows.

The role requires an individual with strong technical expertise, an understanding of the evolving HPC landscape, and a commitment to delivering high-quality, scalable, and automated solutions. You will play a key role in building and maintaining advanced computing platforms, including containerised environments and cloud-based research computing services, while applying DevOps principles and Infrastructure-as-Code methodologies.

Role details

6 months contract + possible extension
Hybrid working - 3days on site 2 days remote
Competitive daily rate - Outside IR35
Start date 24th November

Key Responsibilities

Design, implement, and maintain secure and scalable HPC infrastructure using Infrastructure-as-Code (IaC) tools such as Terraform.
Develop, deliver, and support advanced research computing services and applications.
Apply Site Reliability Engineering (SRE) principles to ensure high availability, performance, and reliability across HPC environments.
Troubleshoot and resolve complex technical challenges affecting both the platform and user workloads.

Essential Skills and Experience

10+ years of hands-on experience designing, operating, or engineering large-scale computing environments (HPC, HTC, or Big Compute).
Proven ability to drive innovation and integrate emerging technologies into HPC solutions.
Administration experience with cluster and workload management software (e.g., Slurm, LSF, Grid Engine).
Strong knowledge of Linux system administration, TCP/IP networking, and storage systems.
Experience managing parallel file systems (e.g., Weka, GPFS, Lustre).
Hands-on experience with private cloud platforms (e.g., OpenStack).
Proficiency with configuration management tools (e.g., Ansible, Salt, Puppet).
Demonstrated experience in DevOps environments using agile methodologies.
Strong scripting skills in Bash and Python for automation and systems management.
Ability to build and maintain productive relationships with third-party suppliers.

About You

You are a problem solver with a deep understanding of research computing and a passion for leveraging technology to enable discovery. You thrive in complex technical environments, value collaboration, and are driven by the challenge of delivering high-quality, reliable computing services.

About the Company

25 years of tech recruitment! RED Global is one of the world's leading tech recruitment companies. Established in 2000, we focus specifically on SAP, Business Applications, Data & Analytics, Cloud & Infrastructure, Software Development, and Cybersecurity. - 400,000+ SAP and tech candidates. - 350+ employees - 8 office locations (Cologne, Frankfurt, London, Munich, New Jersey, Rotterdam, Warsaw, and Zurich) - Over 45,000 placements made - 1,600+ SAP & tech consultants working in current roles Our services range from pure ... Know more