
Lead Databricks Engineer / Architect
- Bangalore, Karnataka
- Permanent
- Full-time
We are seeking an experienced Lead Data Engineer / Architect with over 10 years of industry experience (5+ years in Databricks and modern data platforms) to design, implement, and lead enterprise-scale data solutions. The ideal candidate is hands-on with Databricks ecosystem, PySpark (UDFs, Pandas UDFs), dbt, and Airflow, and is capable of driving architecture decisions, leading engineering teams, and ensuring best practices in data engineering.
Key Responsibilities
- Lead the design and implementation of data lakehouse architectures using Databricks, Delta Lake, and Delta Live Tables.
- Define best practices for workspace setup, cluster management, repos, and job orchestration.
- Partner with stakeholders to translate business needs into scalable data engineering solutions.
- Data Engineering & Development
- Develop and optimize ETL/ELT pipelines using PySpark, dbt, and Airflow.
- Implement advanced PySpark transformations leveraging UDFs and Pandas UDFs for complex use cases.
- Ensure data quality, lineage, and observability across all pipelines.
- 10+ years of total IT experience, with at least 5 years of relevant data engineering experience on Databricks.
- Strong expertise in:
- Databricks (Workspace, Clusters, Jobs, Repos, Delta Live Tables)
- PySpark (UDFs & Pandas UDFs)
- dbt for data transformations
- Apache Airflow for orchestration
- Solid understanding of cloud data platforms (Azure/AWS/GCP).
- Hands-on with CI/CD, version control (Git), and DevOps practices for data engineering.
- Databricks Certified Data Engineer Associate (mandatory).
- Proven ability to lead and mentor teams, with excellent communication and stakeholder management skills.