Senior Prompt Engineer - QA
- Bangalore, Karnataka
- Permanent
- Full-time
- Design, develop, and refine AI prompts and instruction workflows to meet specific product requirements and optimize LLM performance.
- Design, execute, and refine automated and manual tests for AI prompt sets and instruction workflows in applied AI or NLP contexts.
- Build and maintain evaluation frameworks for LLM prompts.
- Develop and uphold test automation scripts using Python and related frameworks.
- Conduct data-driven analyses using Python libraries to assess prompt quality, model performance, and workflow accuracy.
- Perform API and web/mobile application testing when needed.
- Integrate automated tests into CI/CD pipelines, ensuring high quality at every deployment stage.
- Collaborate with cross-functional teams to prioritize tasks, document findings, and drive continuous improvement.
- Clearly communicate technical findings and recommendations to both technical and non-technical stakeholders.
- Ensure deliverables meet security, quality, and performance standards.
- Bachelor’s degree in Data Science, Computer Science, Statistics, Engineering, related field, or equivalent experience.
- Strong analytical skills and hands-on experience in Python.
- Programming proficiency in at least one test automation language (Typescript, JavaScript, Python, Java).
- Hands-on experience with web automation frameworks.
- Proficiency in API testing tools and visual regression testing.
- Excellent written communication skills for clear test case and bug report documentation.
- Strong problem-solving mindset and a drive to optimize and automate workflows.
- Enthusiasm for agile, cross-functional teamwork, and iterative product development.
- Fluent in English, with clear written and verbal communication.
- Demonstrated experience in prompt engineering, including designing and optimizing prompts for various use cases.
- Experience with LLM evaluation, prompt engineering workflows, and prompt evaluation tools.
- Familiarity with machine learning concepts, NLP workflows, and data pipeline tools.
- Familiarity with design tools (e.g., Figma) and basic security testing concepts, especially for cloud-based applications.