- Company Name
- Volgo Technologies
- Job Title
- Gen AI Engineer (LLM)
- Job Description
-
Job title: Gen AI Engineer (LLM)
Role Summary: Design, develop, fine‑tune and deploy large language model (LLM) applications. Collaborate with product managers and software engineers to deliver scalable, secure, high‑performance AI solutions using GPT, Claude, LLaMA or comparable models.
Expactations:
- Successfully deliver production‑ready LLM services that meet business goals and comply with responsible AI standards.
- Maintain low latency, high accuracy, and cost efficiency across cloud deployments.
- Participate in continuous model improvement through evaluation, feedback loops, and staying current with NLP advancements.
Key Responsibilities:
- Design, build, and deploy LLM‑based APIs and microservices in cloud environments (AWS, Azure, GCP).
- Fine‑tune and evaluate domain‑specific LLMs; employ prompt engineering, retrieval‑augmented generation (RAG), and agent workflows.
- Optimize model performance, latency, and operating cost.
- Ensure data privacy, bias mitigation, and security throughout the AI lifecycle.
- Integrate LLMs into production systems via REST APIs, Docker/Kubernetes, and CI/CD pipelines.
- Monitor outputs, evaluate metrics, and iterate to improve results.
- Keep abreast of NLP research, LLM frameworks, and cloud AI offerings.
Required Skills:
- Python (core) with optional JavaScript or Java; strong coding standards.
- Hands‑on experience with LLMs and NLP techniques.
- Proficiency in LangChain, LlamaIndex, Hugging Face, OpenAI APIs, or similar.
- Knowledge of transformers, deep learning, and ML fundamentals.
- Cloud deployment experience (AWS, Azure, or GCP), including Vertex AI.
- Familiarity with REST APIs, microservices, Docker, Kubernetes, and MLOps tooling.
Required Education & Certifications:
- Bachelor’s degree in Computer Science, Engineering, or related field (or equivalent experience).
- Certifications in cloud platforms (AWS/Azure/GCP) or relevant AI/ML credentialing preferred.