Intermediate Data Engineer
CDM Smith View all jobs
- India
- Permanent
- Full-time
CDM Smith is seeking a Data Engineer to join our Digital Engineering Solutions team. This individual will be part of the Data & AI Technology group within the Digital Engineering Solutions team, helping to support strategic Architecture, Engineering and Construction (AEC) initiatives using modern data technologies and analytics to deliver actionable business insights and robust solutions for AEC professionals and client outcomes.The Data & AI Technology group will help lead the firm in AEC-focused Business Intelligence and data services by providing scalable data platforms, strong engineering practices, and reliable data pipelines to enable analytics and AI capabilities across the organization. The Data & AI Technology group will specifically utilize advanced analytics, data science, and AI/ML to give our business and our products a competitive advantage. It includes understanding and managing the data, how it interconnects, and engineering data pipelines and data assets to enable self-serve BI and BA opportunities.This position is for a person who has demonstrated strong foundational data engineering capabilities, is experienced with data technologies and processes, and enjoys framing a problem, contributing to solutions, and supporting implementation efforts. As a member of the Digital Engineering Solutions team, the Data & AI Technology group will also engage in research and development and provide support to AEC practices at CDM Smith, participating in product research, testing, and the incubation of data technology-related ideas that arise from around the company.Key Responsibilities:
- Design and develop data pipelines and workflows to extract, transform, and load (ETL/ELT) data from various sources into target systems.
- Automate workflows to improve efficiency, scalability, and reliability in data integration processes.
- Support the optimization and management of data storage solutions, including data lakes and warehouses, for performance and scalability.
- Ensure data quality by implementing processes to validate completeness, accuracy, and consistency of data.
- Ensure compliance with data governance policies and maintain data privacy and security standards.
- Collaborate with Data Scientists, AI engineers, Analytics Engineers, and stakeholders to understand business requirements and deliver reliable data solutions.
- Support Data Scientists and AI engineers by preparing and delivering trusted datasets and data pipelines that enable advanced analytics and AI/ML solutions.
- Assist in implementing monitoring and observability tools for data pipelines to ensure reliability and issue detection.
- Stay abreast of the latest developments and advancements, including new and emerging technologies & best practices and new tools & software applications and how they could impact CDM Smith.
- Assist with the development of documentation, standards, best practices, and workflows for data technology hardware/software in use across the business.
- Perform other duties as required.
- Experience with the Software Development Life Cycle (SDLC) and Agile Development methodologies.
- Experience with Microsoft Azure cloud services, including Azure Data Factory, Azure Databricks, and Azure Synapse Analytics.
- Proficiency in building and maintaining data systems using modern frameworks like Apache Spark and Databricks.
- Proficiency in programming languages such as Python, SQL.
- Knowledge of data modeling and designing scalable ETL/ELT processes.
- Familiarity with real-time streaming solutions, such as Kafka or Azure IoT Hub.
- Hands-on experience with distributed computing tools, including Apache Spark and Hadoop ecosystems.
- Familiarity with CI/CD pipelines and DevOps practices for data solutions.
- Knowledge of monitoring tools and techniques for ensuring pipeline observability and reliability.
- Strong problem-solving and critical thinking skills to identify and address technical challenges effectively.
- Excellent interpersonal and presentation skills to collaborate with colleagues, stakeholders, and partners.
- Ability to effectively communicate technical concepts to both technical and non-technical audiences.
- Detail oriented with the ability to execute complex or specialized technical tasks.
- Experience working with Databricks or modern Lakehouse architectures.
- Experience supporting data pipelines used by advanced analytics or AI/ML solutions.
- Experience working with large-scale enterprise data platforms and cloud-based data environments.