EY - GDS Consulting - AI And DATA - NVIDIA Enterprise AI - Senior
- Thiruvananthapuram, Kerala
- Permanent
- Full-time
- Architect end to end solutions: data ingestion → retrieval (hybrid + rerank) → RAG/GraphRAG → orchestration → inference → observability; design for latency, cost, accuracy.
- Design multi agent systems (hierarchical/peer to peer/ReAct) with shared memory (short term state, vector DB, knowledge graph), tool use, planning, and guardrails.
- Productionize on NVIDIA AI Enterprise: package models as NIM microservices, optimize with Triton/TensorRT, track with LLMOps (evals, canary, drift).
- Implement Responsible AI: safety (hallucination mitigation, jailbreak detection), privacy/PII, bias testing, lineage/citations; align to NIST AI RMF / ISO 42001 where relevant.
- Integrate with enterprise platforms (M365/Copilot, CRM/ERP/ITSM), lakehouse (Snowflake/Databricks), and vector stores (pgvector, Pinecone, Milvus).
- Deliverables: reference architectures, security blueprints, Bill of Materials (SBOM), VEX, deployment runbooks, cost models/TCO.
- Mentor engineers (Python/PyTorch), enable squads on LangChain/LangGraph, retrieval evaluation, and AgentOps/LLMOps.
- Proven experience taking GenAI from POC → production at scale, with measurable KPI lift (precision, latency, cost/1K tokens). (Market benchmark: Accenture/Deloitte senior roles emphasize E2E platform and governance.)
- Deep stack fluency: NeMo, NIM, NVIDIA Triton, TensorRT; or equivalent cloud AI runtimes.
- RAG architectures (hybrid retrieval + cross encoder reranking), query transformation (multi query/HyDE), GraphRAG; vector DB selection & scaling.
- Security & governance for AI platforms (HITL, audit trails, content filtering, citations), multi region data residency.
- Strong stakeholder communication and solution storytelling; hands on mindset.
- B.Tech/M.Tech/MCA (CS/EE/Math) or equivalent
- 5-8 years total with 3+ in enterprise AI/ML; shipped at least 2 production LLM solutions (RAG/agentic). (Peers: Wipro/HCL mid senior roles highlight similar thresholds.)
- Hands on Python, PyTorch, CUDA familiarity; containers (Docker), Kubernetes; CI/CD, IaC, telemetry.
- Cloud depth in Azure/AWS/GCP; exposure to Bedrock, Vertex, Azure AI.
- Knowledge graph/ontology modeling for DLMs; Snowflake/Databricks governance; RAG evaluation (RAGAS), prompt security.
- Familiarity with NVIDIA Blueprints and Nemotron reasoning models (e.g., Nemotron Super) for agentic workloads.
- A Team of people with technical consulting and architecting experience and enthusiasm to learn new things in this fast-moving environment
- A leader with technical consulting and architecting experience, passionate about innovation in 3D and Metaverse technologies.
- An opportunity to be part of a market-leading, multi-disciplinary team, shaping the future of digital collaboration.
- Enthusiasm for learning new technologies and driving change in a fast-moving environment.
- Support, coaching and feedback from some of the most engaging colleagues around
- Opportunities to develop new skills and progress your career
- The freedom and flexibility to handle your role in a way that's right for you