
Data Quality Engineer
- Pune, Maharashtra
- Permanent
- Full-time
- Design and implement automated data quality checks and validation frameworks.
- Perform data profiling, anomaly detection, and root cause analysis using Python and SQL.
- Integrate data quality processes into ETL/ELT pipelines using PySpark and Databricks.
- Monitor and resolve data quality issues across Azure/AWS-based data platforms.
- Develop CI/CD pipelines for data quality automation using Git and DevOps tools.
- Work with JDBC connectors to validate data movement and integrity across systems.
- Document data quality rules, processes, and findings for technical and business stakeholders.
- Programming Languages: Python, SQL
- Big Data Tools: PySpark, Databricks
- Cloud Platforms: Azure/AWS (e.g., S3, Lambda)
- Data Connectivity: JDBC
- Version Control: Git
- DevOps/CI/CD Tools: Jenkins, GitHub Actions, Airflow
- Visualization: PowerBI
- Agile Experience: Proven experience working in Agile development environments
- Analytical Thinking: Strong analytical and problem-solving abilities
- Critical Thinking: Ability to evaluate data issues logically and propose effective, scalable solutions
- Communication: Excellent communication skills for cross-team collaboration and stakeholder engagement
- Experience with data governance and metadata management tools
- Familiarity with data cataloging solutions
- Understanding of data privacy compliance standards