- Company Name
- TechShack
- Job Title
- Solution Architect
- Job Description
-
Job Title: Solution Architect
Role Summary: Design, architect, and deliver high‑performance GPU cloud environments that enable efficient AI, HPC, and ML workloads at scale. Collaborate with customers, engineering, and infrastructure teams to define and implement compute, storage, networking, and deployment architectures, ensuring optimal performance and scalability.
Expectations:
- Own end‑to‑end GPU solution design and deployment lifecycle for new and existing customers.
- Bridge technical requirements with business objectives, delivering clear architectural recommendations.
- Drive performance tuning, capacity planning, and scaling strategies for GPU clusters.
- Facilitate onboarding, proof‑of‑concepts, and production deployments with minimal downtime.
- Participate in technical calls, reviews, and occasional customer site visits.
Key Responsibilities
- Design GPU clusters, including sizing, networking (L2/L3, InfiniBand, RoCE), and storage.
- Provide architecture guidance on compute, networking, storage, and deployment (container/Kubernetes, IaC).
- Identify and resolve performance or scaling bottlenecks, recommending improvements.
- Lead customer onboarding, POCs, and production roll‑outs, ensuring alignment with business needs.
Required Skills
- Deep experience in GPU compute, HPC, or AI/ML infrastructure.
- Strong networking knowledge (L2/L3, InfiniBand, RoCE).
- Proficient with storage systems for high‑throughput data access.
- Hands‑on experience deploying and supporting GPU clusters.
- Expertise in Kubernetes (cluster design, deployment patterns) and Terraform (IaC).
- Ability to translate complex technical requirements to clear architectural decisions.
- Excellent communication and stakeholder management skills.
Required Education & Certifications
- Bachelor’s degree in Computer Science, Computer Engineering, or related field (or equivalent industry experience).
- Certifications in Kubernetes (CKA/CKAD), Terraform (HashiCorp) or relevant cloud GPU services are highly desirable.