
Data Engineer
- Bangalore, Karnataka
- Permanent
- Full-time
- Design, develop, and maintain scalable and robust data pipelines for both batch and real-time processing.
- Extract, transform, and load (ETL) data from a wide variety of structured and unstructured data sources including:
- RESTful and SOAP APIs
- Databases (SQL, NoSQL)
- Cloud storage (e.g., S3, Google Cloud Storage)
- File formats (e.g., JSON, CSV, XML, Parquet)
- Web scraping tools where appropriate
- Build reusable data connectors and integration solutions to automate data ingestion.
- Collaborate with internal stakeholders to understand data requirements and ensure accessibility and usability.
- Monitor and optimize pipeline performance and troubleshoot data flow issues.
- Ensure data governance, security, and quality standards are applied across all pipelines.
- Experience with data manipulation and analysis libraries such as Pandas, Polars, or Dask for handling large datasets efficiently.
- Design and create data flow and architecture diagrams to visually represent data pipelines, system integrations, and data models, ensuring clarity and alignment among technical and non-technical stakeholders.
- Proficiency in SQL and at least one programming language (Python, Java, Scala).
- Experience with data pipeline and workflow tools (e.g., Apache Airflow, AWS Data Pipeline).
- Knowledge of relational and non-relational databases. (e.g., Oracle, SqlServer, MongoDB).
- Strong data modeling and data warehousing skills.
- Bachelor’s degree in Computer Science, Engineering, Information Systems, or related field (Master’s a plus).
- 5+ years of experience in a data engineering or similar role.
- Strong analytical and problem-solving abilities.
- Excellent communication and collaboration skills.
- Detail-oriented and proactive mindset.