
Infrastructure Monitoring Specialist/Consultant Specialist
- Hyderabad, Telangana Pune, Maharashtra
- Permanent
- Full-time
- Collaborating closely with software and operations teams to improve end-to-end monitoring and alerting production services.
- They deliver lasting, preventative improvements that cross the development/operation team divides.
- They coordinate our response to service impacting incidents
- Routinely modifying configurations or systems in a way that produces lasting improvements from a one-time effort
- Applying their expertise and experience to assist with architecting the next generation of services
- Assisting with support escalation in high impacting incidents, coordinating SMEs and vendors as required
- Representing ITID “outwards” to manage quality of service delivered.
- Understand & analyze changes in technology & process across the Group / regions that would impact development & support of builds & tools.
- Collaborate with regional teams and global function as required. Ensure understanding of practices within regions and drive standardization amongst regions.
- Communicate project updates / progress, action plans / issues on timely basis.
- Organize & lead meetings with regional teams for development or support of deliverables.
- Escalation Management
- Proactively identify problem situations and resolve to give maximum customer satisfaction.
- Good communication skills to collaborate with Global and regional stakeholders
- Strong fundamentals in distributed systems and networking
- Experience programming in at least one of the following languages: Bash scripting, Python, Java Script, Java etc.
- Experience programing in APIs.
- Experience on DevOps tools like – Puppet, Ansible, Tanium, Git etc.
- Experience in monitoring solutions (Patrol, Truesight, BHOM, AppDynamics, Opensource tools) to create best-of-breed production monitoring, incident detection and response solutions.
- Develop and maintain tools used in problem investigation and remediation.
- DevOps – We build it / We support it. Participation in regular follow-the-sun on call rotas to ensure adequate out of hours cover for the services.
- Participate in the design and engineering of auto-healing solutions.