
Team Member - SRE - IT - Mumbai
- Goregaon, Maharashtra Mumbai, Maharashtra
- Permanent
- Full-time
- Monitor, manage, and ensure the reliability and performance of cloud-based systems on Azure.
- Implement and maintain tools for monitoring, logging, and alerting, using Azure Monitor, Application Insights, and related tools.
- Automate routine operational tasks, including deployments, monitoring, and incident response.
- Work closely with development and DevOps teams to implement best practices for reliability and availability.
- Troubleshoot and resolve incidents, performing root cause analysis to prevent recurrence.
- Optimize cloud infrastructure for cost-efficiency, scalability, and performance.
- Design and maintain disaster recovery and backup strategies on Azure.
- Define and enforce service-level objectives (SLOs) and indicators (SLIs) to measure system performance.
- Support CI/CD pipelines and deployment processes, ensuring smooth operations in production environments.
- Stay current with new Azure features, tools, and industry best practices.