
GenAI Engineer
- Pune, Maharashtra
- Permanent
- Full-time
- Develop, fine-tune, and integrate LLMs (GPT, LLaMA, Falcon, Claude, etc.) into enterprise applications.
- Implement prompt engineering, embeddings, and RAG (Retrieval Augmented Generation) pipelines.
- Build and maintain APIs, microservices, and front-end/back-end integrations for GenAI applications.
- Work with vector databases (Pinecone, FAISS, Weaviate, Milvus) to enable semantic search.
- Deploy AI solutions on cloud AI platforms (Azure OpenAI, AWS Bedrock, GCP Vertex AI).
- Optimize model performance, latency, and scalability for production use cases.
- Collaborate with data scientists, architects, and business teams to deliver PoCs and production-ready applications.
- Ensure adherence to Responsible AI, data privacy, and security guidelines.
- Strong programming expertise in Python (preferred) with experience in APIs & microservices (Flask/FastAPI/Django).
- Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers.
- Knowledge of vector embeddings & vector databases.
- Experience with LangChain, LlamaIndex, or similar orchestration frameworks.
- Familiarity with Docker, Kubernetes, and MLOps practices for deploying AI models.
- Proven experience with Azure AI services (Azure OpenAI, Cognitive Search, etc.); AWS/GCP experience is a plus.
- Strong problem-solving, debugging, and optimization skills.