
Data Engineer
- Hyderabad, Telangana
- Permanent
- Full-time
- Design, develop, and deploy data pipeline for clinical domain dataset
- As an infrastructure programmer, continuously develop and support Data Scientist R-Platform and integration with various technology (Kubernetes Container, HashiCorp Vault, SAS Storage, and Data Science Work Bench)
- Design and build various reusable program components using innovative technology (NLP, AI, Python, R, etc) to transform and harmonize clinical dataset for insight generation
- Collaborate with Data Architects, Business SME’s, and Data Scientists to capture the business requirement and translate into Agile product backlog
- Serve as primary data engineer to manage and support AWS, Databricks, RStudio platform, and cloud AI based system production DevOps
- Align to best practices for coding, testing, and designing reusable code/component
- Explore new tools and technologies that will help streamline data pipeline and add new durable capability for clinical development
- Participate in sprint planning meetings and provide estimations on technical implementation
- Collaborate and communicate effectively with the product teams
- Advanced skills in SQL, Python, and R languages programing; AWS cloud technology and databricks data lake technology stacks
- Proficient skills and knowledge on common AI/Machine Learning technologies.
- Data modeling skills, and software development lifecycle knowledge and standard processes
- Learning ability of new technology in the information field
- Skill of using DevOps CI/CD tools, such Git, Jenkins and front UI Visualization technology
- Work experience in the biotechnology or pharmaceutical industry.
- Experience using and adopting Scaled Agile Framework (SAFe)
- Ability to work effectively in a fast-paced, dynamic environment.
- Experience with data modeling for both relationship databases, hands-on experience with SQL (PostgreSQL and Hive SQL is preferred)
- Experience with UI Visualization technology (Dash, RSConnect, Tableau preferred)
- Knowledge and experience in clinical trial design and development process and technology landscape, familiar with clinical trial data standards and structures
- SAFe for Teams certification (preferred)
- Strong communications skills in writing, speaking, presenting and time management skills
- Strong transformation and change management experience.
- Exceptional collaboration and communication skills.
- High degree of initiative and self-motivation.
- Ability to manage multiple priorities successfully.
- Team-oriented with a focus on achieving team goals.
- Strong presentation and public speaking skills.