cover image
Synergyassure Inc

Site Reliability Engineering Manager

On site

Santa clara county, United states

Freelance

06-11-2025

Share this job:

Skills

Python Go Bash MySQL Incident Response Kubernetes Monitoring Jenkins Prometheus Grafana

Job Specifications

Role: Site Reliability Engineer with a Semiconductor or Electronic background 8-10+ Needed

Must Have Skills: Preferred Semiconductor or Electronic Software companies experience.

Role: Site Reliability Engineer

Location: Santa Clara CA (Onsite)

Duration: Long-Term Contract

NO H1B OR NO GC

Rate: $50/hour on C2C

End client: Nvidia

Job Details:

Technical Skills

Bare metal data center machine management tools: IPMI, Redfish, KVM
Automation: Jenkins, Python, Go, Bash
Infrastructure tools: Kubernetes, MySQL, Prometheus, Grafana, ELK
Familiarity with hardware (GPU & Tegra) is a plus

Responsibilities

Guard SLAs:
Implement monitoring, alerting, and incident response for critical engineering services
Perform root cause analysis and post-mortems for threshold breaches
Observability:
Set up/manage monitoring & logging tools (Prometheus, Grafana, ELK)
Maintain KPI pipelines using Jenkins, Python, ELK
Add custom alerts based on business needs
Automation & Optimization:
Capacity planning, optimization, and utilization improvements
Day-to-Day Support:
Handle user-reported issues and alerts
Participate in WAR rooms for critical issues
Collaboration & Documentation:
Maintain operational procedures, configurations, and troubleshooting guides

About the Company

Synergyassure is dedicated to provide comprehensive technology solutions and guidance to businesses across various industries. With a wealth of expertise and a team of highly skilled professionals, we empower organizations to navigate the rapidly evolving digital landscape and leverage technology for enhanced productivity, efficiency, and growth. Know more