
IT Manager – IT Operations Engineer – Supporting Platforms (API / ODM)
- Chennai, Tamil Nadu
- Permanent
- Full-time
- Continuous observation of our systems regarding availability, performance, system usage and costs
- Definition, design and implementation of observability / monitoring regarding Service Levels (SLIs / SLOs / SLAs)
- Integration in central observability solutions e.g.: Datadog, Elastic, …
- Reporting of availability, performance, system usage and costs on a regular basis
- Planning, coordination and implementation of system updates in collaboration with our vendors and suppliers.
- Take care of keeping our system secure by fixing vulnerabilities in collaboration with our CISO department
- Take care of housekeeping tasks
- Drive automation regarding paradigms like CaC/IaC (Configuration as Code / Infrastructure as Code) to ensure the lowest possible degree of error prone manual work.
- Optimize our CI/CD pipeline
- Take over responsibility of coordination & solving incidents to keep “Mean-Time-To-Repair” and user impact as low as possible
- Drive and support problem management to ensure system reliability and prevent reoccurring incidents
- Take over responsibility of service request handling
- Driving continuous improvement of our platform regarding to scalability, reliability & cost-efficiency
- Strong analytical and problem-solving skills
- Team-oriented with excellent communication and collaboration skills
- Ability to build pro-active, co-operative working relationships with customers, peers and key stakeholders based on respect and teamwork
- Ability to act under pressure and to manage efficiently crisis situations
- Able to evaluate information, identify key issues and formulate conclusions based on sound, practical judgment, experience, and common sense
- Extensive experience in operations of business critical and cloud-based platforms (monitoring, maintenance, improvement, troubleshooting, …) on an enterprise scale
- Extensive experience with AWS cloud and container runtimes like ROSA (Red Hat Open Shift on AWS)
- Good Knowledge in end-to-end monitoring of applications and systems with enterprise observability tools (e.g. Datadog, Elastic, Prometheus, Grafana)
- Experience with automation tools such as Terraform or Ansible is an advantage
- Experience in software development and the tools used, such as version management, CI/CD, planning and collaboration tools (e.g. Git, Jenkins, Jira, Confluence, ...)
- Excellent communication, problem-solving, and stakeholder management skills.
- Bachelor's or Master's degree in computer science, Engineering, or related discipline
- English language - expert proficiency (additional languages are beneficial)
- Competitive salary
- Self & Family Health Insurance
- Term & Life Insurance
- OPD Benefits
- Employees' Deposit Linked Insurance Scheme (EDLI)
- Learning & Development through HL Academy
- Flexible Work from Home
- Leave Travel Allowance
- Variable performance bonus
- Recreation facilities
- Privilege, Casual and Sick leaves