cover image
OVHcloud

OVHcloud

www.ovhcloud.com

8 Jobs

2,951 Employees

About the Company

A OVHcloud é um ator mundial e o líder europeu em serviços cloud, com mais de 450 000 servidores nos seus 37 datacenters distribuídos por quatro continentes. Desde há 20 anos, o grupo baseia-se num modelo integrado que lhe confere um controlo total sobre a sua cadeia de valor: da conceção dos servidores, passando pela administração dos seus datacenters e a orquestração da sua rede de fibra ótica. Esta abordagem única permite-lhe dar resposta, de forma totalmente independente, às necessidades dos seus 1,6 milhões de clientes em mais de 140 países. A OVHcloud oferece soluções de última geração que combinam rendimento, um preço previsível e uma total soberania dos dados para impulsionar o crescimento dos seus clientes com total liberdade.

Listed Jobs

Company background Company brand
Company Name
OVHcloud
Job Title
Senior Site Reliability Engineer (Overnight)
Job Description
Job title: Senior Site Reliability Engineer (Overnight) Role Summary: Provides 24/7 operational support and incident response for OVHcloud services during overnight shifts (Sunday‑Thursday 10:00 PM‑6:00 AM CST). Maintains high availability, performance, and reliability of distributed Linux/Unix and Windows infrastructure, automates operational tasks, and contributes to new product deployments. Expactations: • Work standard overnight shift with weekend on‑call rotation. • Manage comprehensive SRE responsibilities, including monitoring, alerting, root‑cause analysis, and documentation. • Deliver proactive automation, scripting, and tooling to increase reliability and efficiency. • Collaborate with development teams on microservices, APIs, and UAT for product launches. • Communicate findings and recommendations clearly to technical and business stakeholders. Key Responsibilities: - Monitor alerting systems (Grafana, Nagios) and adjust configurations to ensure 99.99% availability. - Diagnose incidents using data‑driven analysis, perform root‑cause analysis, and document solutions. - Develop and maintain automation scripts in Bash, Python, Go, or Perl. - Contribute to microservice build, deployment, and troubleshooting. - Configure and deploy infrastructure via Terraform, Ansible, or Puppet. - Maintain monitoring, metrics, and logging stacks (OpenSearch, Grafana). - Produce automated reports and dashboards for teams and leadership. - Write and update knowledge‑base articles, SOPs, and incident post‑mortems. - Conduct User Acceptance Testing for new services and features. Required Skills: - 5+ years in SRE, DevOps, or related role. - 3+ years administering Linux/Unix and Windows systems. - Experience with microservices, APIs, and cloud‑native architectures. - Proficiency in scripting (Perl, Python, Bash, Go). - Hands‑on with monitoring/alerting (Nagios, Grafana, OpenSearch). - Familiarity with configuration management (Puppet, Ansible) and IaC (Terraform). - Knowledge of virtualization, containers (Docker, Kubernetes). - Strong analytical, problem‑solving, and documentation skills. - Effective prioritization and ability to handle competing tasks. Required Education & Certifications: - Bachelor’s degree in Computer Science or related field preferred; equivalent experience acceptable. - No mandatory certifications required, but familiarity with relevant cloud or DevOps certifications is a plus.
Dallas, United states
On site
Senior
10-09-2025
Company background Company brand
Company Name
OVHcloud
Job Title
Site Reliability Engineer - Openstack H/F/N
Job Description
**Job Title** Site Reliability Engineer – OpenStack **Role Summary** A SRE focused on ensuring high availability, reliability, and performance of a large-scale OpenStack cloud environment. Responsibilities include monitoring, incident management, capacity planning, and continuous improvement of service resilience through automation and best‑practice application. **Expectations** - Deliver rapid, knowledge‑sharing support for OpenStack services. - Take full ownership of incident response, post‑mortem analysis, and root‑cause investigations. - Define and refine reliability metrics (SLIs, SLOs) and error budgets. - Proactively identify and implement performance and automation opportunities. - Maintain comprehensive architecture and operational documentation. **Key Responsibilities** - Maintain stability and resilience of OpenStack services (Neutron, Nova, Glance, Cinder, Keystone). - Design, deploy, and improve monitoring, logging, and alerting pipelines. - Lead incident handling, root‑cause analysis, and post‑incident reviews. - Analyze system trends, optimize performance, and support scalability. - Automate repetitive processes and strengthen infrastructure protection. - Document architecture, operating procedures, and troubleshooting playbooks. - Collaborate with cross‑functional teams to shape long‑term reliability roadmaps. **Required Skills** - Deep understanding of OpenStack architecture and its key components. - Proven experience with SRE practices: monitoring, alerting, incident response, capacity planning, and automation. - Strong monitoring and analytics proficiency (e.g., Prometheus, Grafana, ELK/EFK). - Excellent scripting/automation skills (Python, Bash, Terraform, Ansible, or equivalent). - Ability to analyze logs, metrics, and performance data for root‑cause diagnostics. - Proficiency in English (both written and spoken). - Collaborative mindset and effective communication with engineering and operations teams. - Prior experience in IT infrastructure management is highly desirable. **Required Education & Certifications** - Bachelor’s degree in Computer Science, Information Technology, or a related field **or** equivalent hands‑on experience. - OpenStack certification (e.g., OCA – OpenStack Certified Administrator) is a plus but not mandatory.
Montreal, Canada
Hybrid
10-09-2025
Company background Company brand
Company Name
OVHcloud
Job Title
Technicien de déploiement informatique H/F/N
Job Description
**Job Title** IT Deployment Technician (M/F/Other) **Role Summary** Support the deployment, installation, and maintenance of datacenter hardware. Works within a 15‑person deployment team, collaborating with network technicians and a manager to ensure efficient, secure, and quality delivery of new equipment to clients. **Expectations** * 6 Months: * Operate racks, unpack, and test servers for quality delivery. * Use communication interfaces (Recycle, Webex, ServiceNow, Jira, Confluence, Picomto, etc.). * Independently perform server rack and de‑rack operations. * 1 Year: * Upgrade and customize servers by replacing obsolete components. * Lead autonomous rack deployments and contribute to continuous improvement. * Participate in cross‑functional projects (mentoring, product launches, process enhancements). **Key Responsibilities** * Join the Datacenter deployment team to install and maintain hardware equipment. * Rack new servers in designated locations and de‑rack existing units to optimize capacity. * Remove obsolescent components, handle incoming materials from headquarters or vendors, and track inventory. * Optimize server racks for new deployments. * Conduct quality tests on servers before hand‑over. * Use service management and collaboration tools (ServiceNow, Jira, Confluence). **Required Skills** * Strong knowledge of server hardware and datacenter infrastructure. * Ability to read and follow technical specifications and safety regulations. * Proficiency with rack‑ing, de‑rack­ing, and component replacement. * Technical English reading and communication skills. * Adaptability to a dynamic work environment and strong teamwork, rigor, and organization. **Required Education & Certifications** * Minimum of BAC‑2 (DUT, BTS, or Bac Pro) in IT systems, digital technology, networking, or telecommunications. * Awareness of industrial safety standards and regulations. * Valid Driver’s License B (preferred). ---
Gravelines, France
On site
16-10-2025
Company background Company brand
Company Name
OVHcloud
Job Title
Technicien de maintenance informatique en datacentre H/F/N
Job Description
**Job Title:** Data Center IT Maintenance Technician (M/F/N) **Role Summary:** Perform installation, preventive and corrective maintenance of hardware across four data centers, ensure 24/7 service continuity, monitor server health, and support internal and client‑facing applications. Work within a rotating 3‑shift schedule as part of a collaborative technical team. **Expectations:** - **First 6 months:** Independently respond to monitoring alerts and client tickets; operate monitoring, ticketing, and collaboration tools; execute routine hardware preventive maintenance. - **By 12 months:** Optimize servers by replacing obsolete components; assess and prioritize incidents; perform first‑level infrastructure upkeep (network, power, cooling, security); contribute to cross‑functional projects and continuous‑improvement initiatives. **Key Responsibilities:** - Install, configure, and maintain servers, storage, and networking equipment in the data center. - Monitor server performance, analyze alerts, and resolve incidents according to defined processes. - Execute scheduled preventive maintenance and hardware upgrades. - Participate in 24 × 7 shift coverage (3 × 8 h) to guarantee uninterrupted service. - Evaluate incident severity, prioritize tickets, and coordinate resolution with senior technicians. - Perform first‑level checks on data‑center infrastructure (power, cooling, physical security). - Document actions in ServiceNow, Jira, Confluence, and related systems. - Support cross‑team projects (mentoring, product launches, process improvements). - Contribute to quality‑management compliance and continuous‑improvement activities. **Required Skills:** - Strong knowledge of server hardware, storage, and networking components. - Proficiency with monitoring platforms, ServiceNow, Jira, Confluence, Webex, and related communication tools. - Ability to diagnose, troubleshoot, and resolve hardware and infrastructure incidents. - Experience working in a 24/7 shift environment and following standard operating procedures. - Good written and verbal communication in English (French optional). - Team‑oriented mindset with the ability to work autonomously. **Required Education & Certifications:** - Technical diploma or degree in Computer Science, Information Technology, Electronics, or related field. - Preferred certifications: CompTIA A+, Server+; Cisco CCNA or equivalent; ITIL Foundation. - Valid work authorization for the location of the data center.
Strasbourg, France
On site
16-10-2025