
HPC (High Performance Compute) Cloud Engineer_Vice President_Cloud & Infrastructure Engineering
- India
- Permanent
- Full-time
- Working in a globally distributed team to provide innovative and robust Cloud centric solutions.
- Closely working with Product Management and Vendors to develop and deploy Cloud services to meet customer expectations. Integrate, configure, document and deploy compliant infrastructure and supporting services in the Cloud platform.
- Design, Optimization and Documentation of the Operational aspects of the Cloud platform.
- Troubleshooting problems, resolving root cause, and where possible, fixing the bug(s) Collaborate with Risk Management to ensure necessary controls to Cloud services are deployed and tested.
- Working closely with customers to develop robust and re-useable configuration management components
- 9-14 years of experience working with AWS and/or Azure and a proven track record of building complex infrastructure.
- At least 6 years' relevant experience would generally be expected to find the skills required for this role
- Strong development skills in Python and IaC (Terraform )
- Expert level knowledge with CSP design, architecture, and services (EC2, IAM, S3 etc.) Sound experience with Infrastructure as Code (Terraform, Ansible, CloudFormation etc.)
- Sound knowledge of server infrastructure, virtualization, and cloud computing
- Experience with architecting and maintaining high availability production systems
- Experience in Software Installation, configuration and Patching Knowledge of system monitoring in a cloud environment including cloud specific products and tools
- Experience with Agile and DevOps concepts Developing monitoring architecture and implementing monitoring agents, dashboards, escalations, and alerts
- Good knowledge of security (SAML, OAuth, OpenID, Kerberos, Policies, entitlements etc.)
- Good knowledge of Kubernetes and container management
- Experience in the financial industry
- Bachelor’s degree in a related field
- Knowledge about HPC
- Experience with Monitoring, Alerting and Logging