cover image
Crusoe

Crusoe

crusoe.ai

2 Jobs

727 Employees

About the Company

Crusoe is the industry's first vertically integrated, purpose-built AI cloud platform. The company is redefining AI cloud infrastructure and its platform is recognized as the "gold standard" among builders for its reliability and performance in developing, training, and deploying AI models. Powered by clean, renewable energy, Crusoe aligns the future of computing with the future of the climate. Leading Fortune 500 companies trust Crusoe's advanced, AI-optimized cloud to support their most demanding AI applications.

Listed Jobs

Company background Company brand
Company Name
Crusoe
Job Title
Senior Cloud Support Engineer
Job Description
Job Title: Senior Cloud Support Engineer Role Summary: Deliver first‑line and advanced technical support for a sustainable GPU cloud platform, ensuring high availability, rapid issue resolution, and customer satisfaction. Act as the primary escalation point, collaborate cross‑functionally with SRE, networking, and storage teams, and contribute to knowledge base and automation initiatives. Expectations: - 5 + years of experience in customer or technical support within cloud, storage, or networking environments. - Proven ability to manage high‑volume, critical incidents 24/7 and meet SLA targets. - Excellent command of Linux CLI, Git, and cloud orchestration tools. - Strong analytical, communication, and customer‑service skills. Key Responsibilities: - Provide exceptional technical support to customers via Zendesk; maintain CSAT > 95 % and meet SLA metrics. - Participate in a 24/7 on‑call rotation, ensuring timely incident triage and resolution. - Diagnose and resolve issues with VMs, hardware, scaling tests, containers (Kubernetes), workload managers (Slurm, Terraform), and monitoring tools (Grafana). - Manage alert triage, prepare for maintenance windows, and conduct node delivery tests. - Work closely with SRE, networking, and storage teams from initial triage through root‑cause analysis and issuance of RCAs. - Develop and maintain onboarding/training materials, knowledge‑base articles, and SOPs for support processes. - Collaborate with global teams to adhere to ticketing and handoff procedures. - Contribute to automation scripts and tools that improve support efficiency. Required Skills: - Linux command‑line proficiency (bash, ssh, systemd). - Version control with Git (branching, pull requests). - Container orchestration (Kubernetes) and workload management (Slurm, Terraform). - Monitoring and alerting (Grafana, Prometheus, internal tools). - Public cloud fundamentals (AWS, Azure, GCP). - HPC knowledge: Infiniband, RDMA, RoCE, SDN. - Strong problem‑solving, analytical, and troubleshooting abilities. - Excellent verbal and written communication. Required Education & Certifications: - Bachelor’s degree in IT, Computer Science, Engineering, or related field, **or** 4 + years of equivalent technical experience. - Valid certifications (optional but preferred): CKA, CKAD, CKS, KCNA, AWS ML‑Specialty, AWS Solutions Architect – Professional, NVIDIA AI Infrastructure, Linux Foundation IT Associate, System Administrator. ---
Denver, United states
On site
Senior
27-11-2025
Company background Company brand
Company Name
Crusoe
Job Title
Staff Hardware Systems Engineer
Job Description
**Job Title** Staff Hardware Systems Engineer **Role Summary** Lead the design, development, and validation of system firmware (BIOS/UEFI), kernel-level software, and low‑level tooling for high‑performance, sustainable server platforms. Drive hardware‑firmware integration, GPU compatibility, and bare‑metal infrastructure support to enhance performance, reliability, and scalability of cloud computing systems. **Expectations** - Deliver end‑to‑end firmware bring‑up, validation, and debugging for ARM and x86 server platforms. - Collaborate with silicon vendors to integrate next‑generation GPU hardware. - Ensure rigorous feature testing, hardware‑software parity, and performance verification. - Contribute to kernel‑level configuration and testing where applicable. **Key Responsibilities** 1. Lead design, development, and bring‑up of BIOS/UEFI and kernel‑level firmware for server platforms. 2. Develop and maintain low‑level validation frameworks, unit tests, and diagnostics tools. 3. Partner with GPU and CPU vendors to enable new hardware support and resolve integration issues. 4. Conduct feature testing, system‑level validation, and hardware‑software parity checks on in‑house platforms. 5. Perform root‑cause analysis and debugging across firmware, hardware, and integration challenges. 6. Support end‑to‑end integration and solution testing to meet performance, reliability, and scalability targets. 7. Document findings, update technical specifications, and communicate cross‑team progress. **Required Skills** - Deep knowledge of BIOS/UEFI architecture, firmware development, and kernel bring‑up. - Hands‑on experience with system‑level bring‑up, validation, and firmware unit testing. - Proficiency in scripting/automation (Python, Bash) for validation tooling. - Strong debugging, documentation, and cross‑team communication skills. - Experience with ARM and x86 server architectures, GPU/CPU hardware, and high‑speed digital interfaces. - Ability to collaborate with silicon and hardware vendors for platform development. **Required Education & Certifications** - Bachelor’s or Master’s degree in Electrical Engineering, Computer Engineering, or related field, **or** equivalent professional experience. - Preference for certifications related to firmware, hardware design, or low‑level software (e.g., EFI, ARM, x86 architecture). **Additional Desirable Experience** - 8+ years in hardware systems development or low‑level firmware engineering. - Proven platform enablement and firmware‑hardware integration for server‑class products. - Experience with bare‑metal environments and kernel‑level testing. - Knowledge of sustainable and energy‑efficient hardware design principles. - Familiarity with AMD/x86 software stack and GPU‑to‑GPU communication design. - Exposure to cutting‑edge GPU architectures and ecosystem integration.
San francisco, United states
On site
Senior
28-11-2025