Job Specifications
In this role, you will be responsible for designing, implementing, and optimizing infrastructure solutions to support our next-generation systems. The ideal candidate will have a deep understanding of performance tuning and system architecture, as well as hands-on experience managing and scaling distributed systems in cloud environments.
Key Responsibilities:
Low Latency Infrastructure: Architect, build, and optimize low-latency infrastructures to ensure optimal system performance for high-demand applications.
Infrastructure as Code (IaC): Use Terraform to manage and automate cloud infrastructure, ensuring scalability, security, and consistency across environments.
Kubernetes Management: Lead the deployment, scaling, and management of containerized applications using Kubernetes. You’ll work to ensure that our Kubernetes clusters are highly available, efficient, and performant.
Collaborate with development teams to optimize infrastructure for high-performance workloads, tuning the network, hardware, and software layers for low-latency needs.
Experience in Linux-based systems, ensuring they are secure, stable, and performant. Experience with system monitoring, logging, and troubleshooting is a must.
Build and maintain automated pipelines for continuous integration and delivery (CI/CD) to enhance the development cycle, leveraging tools like Jenkins, GitLab CI, or similar.
Work closely with engineering, product, and operations teams to ensure the infrastructure meets the needs of both internal and external users. You will also mentor junior engineers and provide technical leadership in infrastructure design and implementation.
Cloud Infrastructure: Manage cloud resources (AWS, GCP, Azure, etc.), ensuring optimal resource allocation, cost efficiency, and performance for production environments.
Monitoring & Incident Response: Implement robust monitoring solutions to track system health and performance. Be part of the on-call rotation and act swiftly to troubleshoot and resolve infrastructure issues, particularly in latency-sensitive systems.
Required Qualifications:
5+ years of experience in infrastructure engineering, with a focus on low-latency systems and performance tuning.
Proven experience with Terraform for infrastructure automation and management.
Extensive hands-on experience with Kubernetes, managing clusters, and deploying microservices.
Strong expertise in managing and troubleshooting Linux-based systems
Strong understanding of networking fundamentals and optimizations, especially in low-latency environments.
Experience with cloud services such as AWS