
Data Scientist Lead
- Hyderabad, Telangana
- Permanent
- Full-time
- Develop and productionize high-quality machine learning models, services, and platforms to create significant technological and business impact.
- Design and implement scalable and reliable data processing pipelines, performing analysis and deriving insights to optimize business outcomes.
- Use LLMs for Generative AI applications, including text generation, classification, and question answering.
- Collaborate with cross-functional teams to identify business requirements and develop data-driven solutions using GenAI technologies.
- Build and maintain data lakes and data processing workflows using Databricks to support machine learning operations.
- Implement data visualization and analytics solutions using ThoughtSpot to provide actionable insights to stakeholders.
- Conduct research on prompt engineering techniques to enhance the performance of LLM-based models.
- Analyze and interpret complex datasets to evaluate model performance and identify areas for improvement.
- Communicate technical concepts and results effectively to both technical and non-technical stakeholders.
- Develop end-to-end ML pipelines for real-time and batch predictions and integrate them with existing applications.
- Champion a DevOps model and support the maturity of the ML development life cycle (MDLC).
- Advanced degree in Computer Science, Data Science, Mathematics, or a related field.
- 8+ years of applied experience in data science, machine learning, or related areas.
- Strong programming skills in Python, with experience in machine learning frameworks such as PyTorch or TensorFlow.
- Experience in building and managing data lakes and data processing workflows using Databricks.
- Proficiency in using GenAI models (OpenAI or similar) to solve business problems and in building AI Agents and MCP Servers
- Solid understanding of data structures, algorithms, and machine learning concepts.
- Experience in NLP and deep learning, with recent exposure to prompt engineering on LLMs and with big data technologies (Hadoop, Spark, etc.)
- Hands-on experience with MLOps tools and practices, ensuring seamless integration of models into production environments.
- Experience with Enterprise Cloud infrastructure and monitoring tools like Data Dog,, Splunk, Elasticsearch, and Grafana.
- Familiarity with data visualization tools like ThoughtSpot.
- Experience in developing APIs and integrating machine learning models into software applications.
- Knowledge of modern development technologies and tools such as Agile, CI/CD, Git, Terraform, and Jenkins.