- Company Name
- TalentAlly
- Job Title
- AI Engineer
- Job Description
-
Job Title: AI Engineer
Role Summary:
Lead architect and developer of Gen AI Platform Services. Design, build, and maintain end‑to‑end AI systems including foundation model training, LLM inference, similarity search, guardrails, and observability on cloud and container platforms. Drive performance optimization, scalability, cost efficiency, and responsible AI practices across production workloads.
Expactations:
- Deliver high‑quality AI products that transform customer and associate experiences.
- Translate cutting‑edge research into production‑ready solutions.
- Collaborate closely with cross‑functional teams (engineering, research, product, PM) to define technical vision and roadmap.
- Ensure systems are secure, compliant, and ethically responsible.
Key Responsibilities:
- Design, develop, test, deploy, and support AI software components (training, inference, similarity search, guardrails, evaluation, experimentation, governance, observability).
- Utilize AWS Ultraclusters, HuggingFace, VectorDBs, Nemo Guardrails, PyTorch, and related OSS/SaaS tools.
- Invent and implement LLM optimization techniques to improve scalability, latency, throughput, and cost.
- Contribute to the technical vision and long‑term roadmap of foundational AI systems.
- Mentor teammates, share knowledge, and advocate for responsible AI practices.
Required Skills:
- Proficiency in Python (4+ years) and at least one of Go, Scala, Java (4+ years).
- Experience with AI/ML frameworks (PyTorch, TensorFlow, etc.) and large‑scale inference/LLM pipelines.
- Strong understanding of cloud platforms (AWS, GCP, Azure) and container orchestration (Kubernetes).
- Expertise in vector search, similarity search, memory, and guardrails.
- Proven ability to optimize training and inference for hardware utilization, latency, throughput, and cost.
- Familiarity with AI governance, observability, and responsible AI principles.
- Solid foundation in software engineering, mathematics, and algorithm design.
Required Education & Certifications:
- Bachelor’s degree in Computer Science, AI, Electrical/Computer Engineering (≥4 years experience) or Master’s degree with ≥2 years experience in AI/ML development.
- No specific certifications required, but experience with cloud certifications (AWS, GCP, Azure) is a plus.
New york city, United states
Hybrid
Mid level
07-10-2025