
Consultant / Sr Consultant - Data Engineer (Databricks)
- Pune, Maharashtra
- Permanent
- Full-time
- Databricks Platform Proficiency: Deep understanding of the Databricks Lakehouse Platform, including its Medallion architecture, workspace, and capabilities.
- Apache Spark: Expertise in using Apache Spark for data processing, including Spark SQL, DataFrames, and RDDs.
- Data Engineering with Delta Lake: Knowledge of Delta Lake for managing data lakes, including features like ACID transactions, schema enforcement, and time travel.
- ETL Processes: Experience with Extract, Transform, Load (ETL) processes using Databricks and other tools to integrate and transform data from various sources.
- Performance tuning and optimization in SPARK
- Knowledge to integrate with JDBC, SFTP, REST API and Cloud storage account based source systems
- Knowledge of workload patterns , designing metadata driven framework for workloads
- Programming and Scripting: Proficiency in programming languages such as Python and SQL for data manipulation and pipeline development.
- Data Governance and Security: Understanding of data governance best practices and security measures to ensure data integrity and compliance. • Usage of UNITY Catalog , external and internal tables
- Data Analysis: Ability to analyze large datasets to extract meaningful insights and support datadriven decision-making.
- Problem-Solving: Strong analytical skills to troubleshoot and resolve data-related issues efficiently.
- Data Quality and Unit Testing: Ensuring data accuracy and integrity through rigorous testing and validation processes.
- Continuous Learning: Staying updated with the latest trends and advancements in data engineering and Databricks technologies.
- 2 – 8 years of experienced professionals.
- Python Scripting