Big Data Engineer - Scala, - Py spark, -Spark

Citigroup

  • Chennai, Tamil Nadu
  • Permanent
  • Full-time
  • 10 days ago
The Data Engineer is accountable for developing high quality data products to support theBank’s regulatory requirements and data driven decision making. A Data Engineer will serveas an example to other team members, work closely with customers, and remove orescalate roadblocks. By applying their knowledge of data architecture standards, datawarehousing, data structures, and business intelligence they will contribute to businessoutcomes on an agile team.ResponsibilitiesBig Data Engineer - Scala, - Py spark, -SparkDeveloping and supporting scalable, extensible, and highly available data solutionsDeliver on critical business priorities while ensuring alignment with the widerarchitectural visionIdentify and help address potential risks in the data supply chainFollow and contribute to technical standardsDesign and develop analytical data modelsRequired Qualifications & Work ExperienceFirst Class Degree in Engineering/Technology (4-year graduate course)5 to 8 years’ experience implementing data-intensive solutions using agilemethodologiesExperience of relational databases and using SQL for data querying, transformationand manipulationExperience of modelling data for analytical consumersAbility to automate and streamline the build, test and deployment of data pipelinesExperience in cloud native technologies and patternsA passion for learning new technologies, and a desire for personal growth, throughself-study, formal classes, or on-the-job trainingExcellent communication and problem-solving skillsTechnical Skills (Must Have)ETLHands on experience of building data pipelines. Proficiency in at least one of thedata integration platforms such as Ab Initio, Apache Spark, Talend and InformaticaBig DataExposure to ‘big data’ platforms such as Hadoop, Hive or Snowflake fordata storage and processingData Warehousing & Database ManagementUnderstanding of DataWarehousing concepts, Relational (Oracle, MSSQL, MySQL) and NoSQL (MongoDB,DynamoDB) database designData Modeling & DesignGood exposure to data modeling techniques; design,optimization and maintenance of data models and data structuresLanguagesProficient in one or more programming languages commonly used indata engineering such as Python, Java or ScalaDevOpsExposure to concepts and enablers - CI/CD platforms, version control,automated quality control managementTechnical Skills (Valuable)CloudGood exposure to public cloud data platforms such as S3, Snowflake,Redshift, Databricks, BigQuery, etc. Demonstratable understanding of underlyingarchitectures and trade-offsData Quality & ControlsExposure to data validation, cleansing, enrichment anddata controlsFile FormatsExposure in working on Event/File/Table Formats such as Avro,Parquet, Protobuf, Iceberg, DeltaOthersBasics of Job scheduler like Autosys. Basics of Entitlement managementCertification on any of the above topics would be an advantage.Job Family Group: TechnologyJob Family: Digital Software EngineeringTime Type: Full timeMost Relevant Skills Please see the requirements listed above.Other Relevant Skills For complementary skills, please see above and/or contact the recruiter.Citi is an equal opportunity employer, and qualified candidates will receive consideration without regard to their race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other characteristic protected by law.If you are a person with a disability and need a reasonable accommodation to use our search tools and/or apply for a career opportunity review .View Citi’s and the poster.

Citigroup