
Lead Enterprise Software Engineer
- India
- Permanent
- Full-time
- Implementation/support/maintenance of AIOps/ system which is used by multiple development and operations teams.
- Implementation of plugins / integration components to existing monitoring solutions (commercial and custom).
- Work with product team in containerizing applications, debugging issues and assisting with app transformation.
- Implement full stack monitoring to ensure infra, cloud platform, OS, apps telemetry
- Work to continually improve time to market products and releases by proposing innovative solutions to automate
- Provide input into architecture and engineering standards
- Prepare and maintain the architectural documentation
- Explore new technologies, development patterns, and partake in pilots/POC/technology evaluations
- Coordinate and assist in complex troubleshooting in production support/operations
- IaC Development & Automation
- Design, develop, and maintain Infrastructure as Code (IaC) modules using tools such as Terraform, ARM, Bicep, CloudFormation, and Ansible for multi-cloud environments.
- Build Just-In-Time reusable ADO pipelines integrated with YAML-based deployment patterns and application-driven inputs for automated recovery and provisioning.
- Disaster Recovery & AIops Engineering
- Lead technical design and implementation of automated DRA pipelines including failover/failback strategies across AWS and Azure.
- Implement infrastructure workflows as part of AIOps for automated trend analysis, load balancer control, firewall posture tracking, and vulnerability remediation.
- Production Reliability & Incident Support
- Act as Tier-4 engineering escalation point for complex production incidents, root cause analysis, and recovery automation.
- Develop self-healing mechanisms and monitoring enhancements using cloud-native tools (e.g., AWS CloudWatch, Azure Monitor, Tanium).
- Ability to debug and handle complex incidents in production environments with composure.
- Security & Compliance Integration
- Support infrastructure compliance with frameworks such as NIST, SOC2, PCI-DSS, by embedding automated posture checks and remediation in CI/CD.
- Participate in weekly vulnerability reviews, risk register updates, and remediation rollout across cloud environments.
- Collaboration & Documentation
- Collaborate with product owners, QA, business analysts, and application teams to translate functional use cases into automated infrastructure solutions.
- Maintain documentation for automation workflows, pipeline standards, and reusable modules.
- Certifications: Terraform Associate, AWS/Azure Architect, CKA/CKAD preferred.
- Experience with AI/ML-driven operations tools or auto-remediation frameworks.
- Exposure to SRE practices, Observability stack, and incident automation.
- Proactive ownership of initiatives with a delivery-first mindset.
- Strong cross-functional communication and ability to work in agile pods.
- Clear documentation and reporting skills (contributing to audit and security reviews).
- Actively explore new trends and identify new ways of solving old problems
- Serves as a peer leader to other developers and develops technical skills and practices for high-quality software development
- Liaise with other teams and stakeholders, adhering to standards and coordinating approaches. participate in Agile activities and assist the development and testing process changes as and when required.
- Performs other duties as assigned by management
- Preferred: Bachelor's degree in Computer Science, Information Systems, or a related field
- 10+ years’ experience in Software Engineering, Cloud DevOps Role
- 6+ years’ experience in writing IaC templates
- 6+ years’ writing programmatic deployment artifacts for enterprise applications
- 6+ years’ experience working with CI/CD tools (Azure DevOps, GitLab, GitHub, Jenkins or other well-known tools)
- 6+ years’ experience with Kubernetes and Prometheus
- 6+ years’ experience in Monitoring systems: Zabbix and/or Prometheus; (experience in Data Dog / New Relic / Dynatrace / AppDynamics will be plus), cloud native monitoring solutions (AWS CloudWatch/X-Ray, Azure Monitoring);
- 5+ years’ experience in Logging solutions: preferable Elasticsearch
- 4+ years’ experience in Visualization tools: Grafana/Kibana
- 4+ years’ experience in Scripting: Bash and/or PowerShell, Python
- 4+ years’ strong experience in Architectural
- 4+ years’ strong experience in Networking skills
- 4+ years’ strong experience in Linux
- A strong DevOps experience on AWS & Azure
- Expertise in a SCM/CI/CD tool like GitHub, Gitlab, Jenkins
- 4+ experience with SQL
- 2+ Certifications in AWS and Azure
- Able to work independently, Lead the initiatives, Mentor/help the Org to build next gen team
- Individual Contributor
- Lead