- Company Name
- Hong Kong Exchanges and Clearing Limited (HKEX)
- Job Title
- Senior Application and Platform Engineer
- Job Description
-
**Job title**
Senior Application and Platform Engineer
**Role Summary**
Provide Level 2/3 technical support and reliability engineering for critical LME Middle Office, Back Office, and Market Data systems. Blend SRE principles with platform engineering to maintain 99.999% availability, automate operations, and drive continuous improvement across physical, virtual, containerised (OpenShift/Kubernetes) environments.
**Expectations**
- Deliver mission‑critical support and enhancements for pre‑ and post‑trade applications.
- Apply SRE best practices (SLIs/SLOs, error budgets) to increase system reliability.
- Design, build, migrate, and optimise infrastructure and platform services with focus on automation and resilience.
- Participate in project delivery (Agile, Waterfall, hybrid) and release management.
- Support 24/7/365 availability, duty‑roster, on‑call and weekend shifts with the UK and HK teams.
**Key Responsibilities**
- Embed SRE practices into operational workflows; define and monitor SLIs/SLOs.
- Design, implement, and maintain OpenShift/Kubernetes, physical and virtual environments.
- Build and manage CI/CD pipelines (Bamboo, BitBucket) and IaC (Ansible Tower).
- Create observability stack: Grafana, Prometheus, Splunk; design offensive monitoring rules.
- Automate repetitive tasks with Python, Bash or PowerShell.
- Manage day‑to‑day production incidents, root‑cause analysis, and post‑mortems.
- Validate changes through QA, automated testing, and change management processes.
- Conduct disaster‑recovery drills, chaos‑engineering exercises, and security vulnerability remediation.
- Keep documentation up to date; support regulatory & market‑growth projects.
**Required Skills**
- SRE fundamentals (error budgets, SLO/SLI, proactive monitoring).
- Platform engineering: OpenShift, Kubernetes, Linux, virtualization.
- CI/CD & IaC: Bamboo, BitBucket, Ansible Tower.
- Observability: Prometheus, Grafana, Splunk.
- Scripting: Python, Bash, PowerShell.
- SQL/database experience (MySQL, Oracle, Liquibase).
- Incident, change, problem management.
- Strong communication and stakeholder engagement.
**Required Education & Certifications**
- Bachelor’s degree in Computer Science, Engineering, or related field, **OR** 5+ years of equivalent professional experience.
- Preferred certifications: Certified Kubernetes Administrator (CKA) or similar; AWS/Azure/GCP platform certifications; relevant SRE or DevOps credentials.