
AGM - Cloud Operations
- Pune, Maharashtra
- Permanent
- Full-time
- Responsible for operations and maintenance of Cloud Infra elements across OEM cloud having CEE, CBIS, CVIM, Redhat & Cloud IP infra nodes SDN, SDI, Leaf & spine
- To ensure 100% availability of the deployed cloud infrastructure.
- Streamline cloud operational processes to ensure healthy and robust network.
- Achieve set benchmarks for cloud KPI's
- Facilitate technology discussions, prioritization and governance to ensure timely execution
- Close Coordination with SNOC, Cluster and Code to address all issues related to cloud and get them resolved within TAT.
- Track release cycles of all software updates, firmware updates, security patches and ensure the cloud infrastructure are updated with latest software releases to maintain vendor support
- Alignment with SNOC/CODE/OEM for implementation of new feature, parameter changes to maintain uptime and improve CXX score.
- Work towards automation for various routine activities to reduce manual interventions and increase efficiency.
- Decision on maintenance upgrade and FNI
- Change request approval for Service affecting and Service threatening operational changes
- Operational processes review, adopt and implement process changes
- Driving OEM Care support & support SNOC/circle for case closure such as repetitive faults
- Analyze incidences, involve in design related re-engineering of solution in improving system availability
- Providing support involving multiple stakeholders - NOC, OEM or Planning team for critical issues & activities
- OEM governance to support circles for ongoing/recurring issues
- Analyze spare requirement basis fault trends, and share to CODE team for procurement
- Arranging solution from Vendor on operational related concerns
- Initiate competency building programs for operational teams basis requirements
- On-boarding SNOC and circle teams on changes in operational model basis organizational needs
- Drive circle and SNOC for roll-out of new changes/improvements/upgrades on Live nodes based on CODE and vendor recommendations
- Trigger CODE team to resolve operational challenges related to product or solution
- Gov and drive traffic migration requirement to support consolidation and capacity optimization
- Life cycle management of the deployed cloud infrastructure
- Track KPIs of the cloud infrastructure and conduct improvement activities to ensure 100% availability
- Conduct optimisation activities related to cloud and ensure optimum usage of deployed infrastructure
- Initiate automation of routine activities and reduce manual intervention.
- Develop triggers for various events to reduce TAT for resolving reported issues.
- Track and get the issues resolved reported in cloud infrastructure
- Track release cycles of all software updates, firmware updates, security patches and ensure the cloud infrastructure are updated with latest software releases to maintain vendor support
- Regular audit and review of the cloud infrastructure and recommend changes in the cloud infrastructure
- Strong understanding and hands on experience on cloud infrastructure and related technologies, Nokia CBIS, Redhat Cloud, and Mavenir Experience is must.
- Understand the intricacies of each vendor product deployed in the network
- Experience in handling LIVE operations to judge given situation and take appropriate decisions
- Hands on experience on Nokia CBIS, Zabbix, ZenOSS, OMC and NADCM operations.
- Hands on experience in deploying open-stack (NFVI) cloud infrastructure
- Good understanding of CEPH based storage
- Good understanding of Openstack services, Undercloud and Overcloud Concepts.
- Good understanding of SDN technologies
- Hands on experience in updating/modifying networking of cloud infrastructure
- Good Understanding of Docker and Container.
- BE/B. Tech in Electronics & Communication
- Training/certifications in relevant cloud technology Private/Public/Hybrid cloud)
- Linux advance Training/certifications
- Open stack Training/certifications
An Aditya Birla Group & Vodafone partnership