- Company Name
- Partnerize
- Job Title
- Senior Site Reliability Engineer
- Job Description
-
Job Title: Senior Site Reliability Engineer
Role Summary:
Architect, build, and maintain scalable, secure, and highly available infrastructure for a distributed automation platform. Lead the SRE team, mentor peers, and collaborate with development and security teams to deliver production-quality services.
Expactations:
- Drive the design, deployment, and evolution of the platform’s cloud, networking, and DevOps tooling.
- Provide expert leadership, coaching, and knowledge sharing within the SRE squad.
- Ensure continuous uptime, performance, and security compliance across all services.
- Own the end‑to‑end lifecycle of reliability initiatives from scoping to delivery.
Key Responsibilities:
- Deploy and maintain Linux‑based production environments, covering compute, storage, networking, and monitoring.
- Design, implement, and manage CI/CD pipelines, Infrastructure as Code (IaC), and configuration management.
- Conduct threat modeling, vulnerability assessments, and security reviews for new and existing services.
- Collaborate with development teams on secure code practices, code reviews, and architecture decisions.
- Lead incident response, post‑mortem analysis, and blameless retrospectives to improve reliability.
- Mentor junior engineers, drive best‑practice adoption, and facilitate knowledge‑sharing workshops.
- Measure system performance, capacity, and cost; recommend and implement optimizations.
- Serve as the primary point of contact for platform security and compliance initiatives.
- Engage stakeholders to translate business needs into reliable, scalable technical solutions.
Required Skills:
- Proficient in Linux system administration, Bash/Shell scripting, and system performance tuning.
- Hands‑on experience with cloud platforms (AWS, GCP, or Azure) and container orchestration (Kubernetes).
- Strong knowledge of networking (TCP/IP, DNS, load balancing), virtualization, and storage concepts.
- Familiarity with IaC tools (Terraform, CloudFormation) and CI/CD platforms (Jenkins, GitHub Actions, GitLab CI).
- Experience conducting threat modeling, vulnerability scans (OWASP ZAP, Nessus), and penetration testing basics.
- Solid understanding of DevOps culture, continuous monitoring (Prometheus, Grafana), and incident management.
- Excellent communication, collaboration, and mentoring abilities.
Required Education & Certifications:
- Bachelor’s degree in Computer Science, Information Technology, or related field, OR equivalent industry experience.
- Professional certifications preferred: AWS Certified Solutions Architect – Associate / Professional, Certified Kubernetes Administrator (CKA), or relevant security certifications such as CISSP or OSCP.