Senior Site Reliability Engineer
Wolters Kluwer View all jobs
- Pune, Maharashtra
- Permanent
- Full-time
- Engineer reliability into a large-scale Azure SaaS platform
- Design, implement, and continuously improve monitoring, alerting, and observability solutions
- Define and improve SLI’s, SLO’s and error budgets together with engineering teams
- Build automation to reduce operational risk and eliminate manual toil
- Analyse incidents end-to-end and translate learnings into structural improvements
- Perform deep debugging and optimization of production issues across application code, services and infrastructure
- Improve how teams use metrics, logs and traces to understand system behaviour
- Collaborate closely with software engineers, platform engineers and support teams
- Contribute to incident response when needed, with a strong focus on learning and prevention
- Support deployment strategies and execution
- Provide advanced technical support to help user issues
- 5+ years of experience as a Site Reliability Engineer
- Strong experience with monitoring, alerting and observability in production environments
- Experience with Datadog, Grafana, Log Analytics and/or Prometheus
- Proven ability to design and work with SLI’s, SLO’s and reliability metrics
- Hands-on coding experience (preferable C#/.NET, but not required) in production environments
- Experience building automation to improve system reliability and reduce toil
- Experience working with preferable Microsoft Azure or in another major public cloud providers like AWS, GCP
- Comfortable working with live production systems and customer data
- Understanding of performance optimization techniques
- Excellent communication skills, including direct interaction with users
- Strong cross‑functional collaboration skills
- Experience with distributed systems or large SaaS platforms
- SQL knowledge and understanding of relational databases
- Experience working in regulated or compliance-sensitive environments
- You work on a mission-critical SaaS platform with real customer impact
- You influence how reliability and observability are engineered into software
- You help shape SRE and reliability practices in a growing, evolving team
- You get ownership, trust and space to improve how we build and operate software
- You work in an open, pragmatic, international and engineering-driven culture