
Data Engineer (Data bricks, azure, pyspark, Scala)
- Chennai, Tamil Nadu
- Permanent
- Full-time
- Responsible to assemble large, complex sets of data that meet non-functional and functional business requirements.
- Responsible to identify, design and implement internal process improvements including re-designing infrastructure for greater scalability, optimizing data delivery, and automating manual processes.
- Building required infrastructure for optimal extraction, transformation and loading of data from various data sources using Azure, Databricks and SQL technologies
- Responsible for the transformation of conceptual algorithms from R&D into efficient, production ready code. The data developer must have a strong mathematical background in order to be able to document and maintain the code
- Responsible for integrating finished models into larger data processes using UNIX scripting languages such as ksh, Python, Spark, Scala, etc.
- Produce and maintain documentation for released data sets, new programs, shared utilities, or static data. This must be done within department standards
- Ensure quality deliverables to clients by following existing quality processes, manually calculating comparison data, developing statistical pass/fail testing, and visually inspecting data for reasonableness: the requirement is on-time with zero defects
- Course work or experience in Numerical Analysis, Mathematics or Statistics is a plus
- Highly proficient in using the spark framework (python)
- Extensive knowledge of Data Warehousing concepts, strategies, methodologies.
- Programming experience in Python, SQL,
- Direct experience of building data pipelines using Apache Spark (preferably in Databricks), Airflow.
- Hands on experience designing and delivering solutions using Azure including Azure Storage, Azure SQL Data Warehouse, Azure Data Lake
- Minimum 3 - 6 year of experience as Data engineer
- Experience modeling or manipulating large amounts of data is a plus
- Experience with Demographic, Retail business is a plus
- Flexible working environment
- Volunteer time off
- LinkedIn Learning
- Employee-Assistance-Program (EAP)