GenAI Engineer

Epergne Solutions

  • Pune, Maharashtra
  • Permanent
  • Full-time
  • 9 days ago
Job Description : GenAI EngineerLocation : Mysore, Pune, BangaloreQualification : BE/B.Tech/ME/M.TechExperience : 6+ years (mandatory)Role OverviewWe are seeking an experienced GenAI Engineer / LLM Specialist to design, build, and deploy enterprise-grade AI solutions leveraging Large Language Models (LLMs). The ideal candidate will have hands-on expertise in LLMs, vector databases, Python, Azure, and microservices, with a strong background in building scalable AI-driven applications.Key Responsibilities
  • Develop, fine-tune, and integrate LLMs (GPT, LLaMA, Falcon, Claude, etc.) into enterprise applications.
  • Implement prompt engineering, embeddings, and RAG (Retrieval Augmented Generation) pipelines.
  • Build and maintain APIs, microservices, and front-end/back-end integrations for GenAI applications.
  • Work with vector databases (Pinecone, FAISS, Weaviate, Milvus) to enable semantic search.
  • Deploy AI solutions on cloud AI platforms (Azure OpenAI, AWS Bedrock, GCP Vertex AI).
  • Optimize model performance, latency, and scalability for production use cases.
  • Collaborate with data scientists, architects, and business teams to deliver PoCs and production-ready applications.
  • Ensure adherence to Responsible AI, data privacy, and security guidelines.
Required Skills
  • Strong programming expertise in Python (preferred) with experience in APIs & microservices (Flask/FastAPI/Django).
  • Hands-on experience with PyTorch, TensorFlow, Hugging Face Transformers.
  • Knowledge of vector embeddings & vector databases.
  • Experience with LangChain, LlamaIndex, or similar orchestration frameworks.
  • Familiarity with Docker, Kubernetes, and MLOps practices for deploying AI models.
  • Proven experience with Azure AI services (Azure OpenAI, Cognitive Search, etc.); AWS/GCP experience is a plus.
  • Strong problem-solving, debugging, and optimization skills.

Epergne Solutions