
Site Reliability Engineer - Sr. Consultant level, Middleware Reliability Engineering
- Bangalore, Karnataka
- Permanent
- Full-time
- 8+ years of relevant work experience with a Bachelor's Degree or at least 5 years of experience with an Advanced Degree (e.g. Masters, MBA, JD, MD) or 2 years of work experience with a PhD, OR 11+ years of relevant work experience.
- 5+ years of experience working with observability solutions (Prometheus, Grafana, Splunk, ELK)
- Strong Python coding experience for developing automation tools and integrations
- Working knowledge of middleware products (e.g., Tomcat, Apache, JBoss, Hazelcast, IBM DataPower)
- Experience with monitoring and logging infrastructure design and implementation
- Familiarity with public cloud platforms (AWS, Azure, GCP) and their monitoring capabilities
- Understanding of infrastructure-as-code principles using tools like Terraform or Ansible
- Experience with CI CD pipelines and practice
- Experience with OpenTelemetry (OTEL) implementation
- Knowledge of AI ML integration for operational tooling
- Experience with containerization technologies (Docker, Kubernetes)
- Background in Site Reliability Engineering or similar operational roles
- Experience with middleware performance tuning and optimization
- Understanding of security best practices for middleware components
- Problem-Solving Excellence: You possess exceptional analytical abilities to quickly identify and resolve complex technical issues, with a focus on root cause analysis and permanent solutions
- Automation Mindset: You're passionate about automating repetitive tasks and creating scalable solutions
- Continuous Learning: You thrive on learning new technologies and transferring knowledge to others
- Collaborative Approach: You work effectively with cross-functional teams and communicate clearly with both technical and non-technical stakeholders
- Operational Focus: You understand the importance of reliability and performance in production environments
- Adaptability: You're comfortable working in a dynamic environment and can prioritize multiple tasks effectively