- Company Name
- Riverlane
- Job Title
- Systems Administrator
- Job Description
-
Job Title: Systems Administrator
Role Summary:
Maintain, optimize, and scale a high‑performance computing (HPC) environment running Red Hat Linux. Ensure high availability, resource efficiency, and secure operation of compute, storage, networking, and EDA tooling components.
Expectations:
* Deliver a reliable, high‑performance HPC platform that supports research workloads.
* Drive performance improvements and infrastructure scalability.
* Maintain rigorous security, compliance, and disaster‑recovery standards.
* Provide timely, expert support to a technical user base.
Key Responsibilities:
1. Administer and maintain HPC clusters: compute nodes, parallel file systems, and networking.
2. Monitor system performance and availability; troubleshoot issues proactively.
3. Deploy, configure, and manage HPC tooling such as Slurm, Singularity/Docker, and Synopsys EDA license servers.
4. Automate routine tasks with Bash, Python, and Ansible; develop and maintain related playbooks.
5. Plan and implement infrastructure improvements in collaboration with the IT Manager.
6. Create and maintain documentation for systems, processes, and configurations.
7. Provide technical support and performance tuning guidance to HPC users.
8. Enforce security policies and ensure compliance across the HPC estate.
9. Manage backup solutions, disaster recovery plans, and data retention compliance.
Required Skills:
* Proven Linux system administration with hands‑on HPC cluster management.
* Expertise in Red Hat Enterprise Linux, Slurm job scheduling, and containerisation (Singularity/Docker).
* Solid scripting/automation in Bash, Python, and Ansible.
* Experience with monitoring (Prometheus, Grafana) and performance tuning.
* Strong networking knowledge (TCP/IP, routing, VLANs, subnetting).
* Backup and disaster‑recovery experience (NFS/SMB/GlusterFS, snapshot tools).
* Familiarity with Synopsys EDA tools or analogous electronic design automation environments.
* Excellent problem‑solving, communication, and collaborative skills.
Required Education & Certifications:
* Bachelor’s degree in Computer Science, Information Technology, or related field (or equivalent experience).
* Red Hat Certified System Administrator (RHCSA) preferred.
* Knowledge of CI/CD, DevOps practices, and GitHub integration is an advantage.
Cambridge, United kingdom
Hybrid
25-11-2025