A fast-scaling company in the Enterprise AI and Intelligent Automation sector building production-grade generative AI solutions for enterprise search, virtual assistants, and decision automation. We are hiring for the primary role: Generative AI Engineer. Location: India — On-site.
Role & Responsibilities
- Design and implement end-to-end generative AI solutions: data ingestion, model fine-tuning, RAG pipelines, and production inference.
- Fine-tune, evaluate, and benchmark LLMs using SFT/RLHF techniques; maintain reproducible experiment tracking and model versioning.
- Build retrieval-augmented generation architectures integrating vector stores and semantic search to improve accuracy and context grounding.
- Develop scalable inference services with optimization techniques (quantization, batching, model sharding) to meet latency and cost SLAs.
- Collaborate with MLOps and backend teams to create CI/CD, monitoring, alerting, and automated retraining pipelines for model governance.
- Drive technical best practices: code reviews, testable deployments, documentation, and mentor junior engineers on ML engineering standards.
Skills & Qualifications
Must-Have
- Python
- PyTorch
- Hugging Face Transformers
- LangChain
- Docker
- Kubernetes
Preferred
- FAISS
- ONNX
- Triton Inference Server
Additional Qualifications
- Proven experience deploying LLM-based features in production environments and ownership of lifecycle from training to monitoring.
- Familiarity with cloud platforms (AWS/GCP/Azure), GPU inference optimizations, and data privacy/compliance considerations for model deployment.
Benefits & Culture Highlights
- Hands-on ownership of core AI products and opportunity to influence technical roadmap across product and infra domains.
- Collaborative, engineering-first culture with emphasis on learning, experimentation, and applied research to production ship.
- Competitive compensation, on-site collaboration, and career growth through mentoring and cross-functional exposure.
Apply if you are passionate about turning state-of-the-art generative AI research into reliable, efficient, and secure production services that drive measurable business impact.