
DevOps Site Reliability Engineer
- Pune, Maharashtra
- Permanent
- Full-time
- Monitoring backend services (cloud-based infrastructure)
- Supporting, troubleshooting, and investigating issues and incidents (support developers and infra team with system metrics analysis, logs, traffic, configuration, deployment changes, etc)
- Supporting and improving monitoring/alerting systems (Searching, testing, deploying new functionality for existing tools)
- Creating new features for automating troubleshooting and investigation process
- Creating new tools to improve the support process
- Drafting reports and summarizing information after investigations and incidents
- At least 1 year of work experience with similar responsibilities
- Strong knowledge and practical experience in working with the Linux(Ubuntu) command-line/administration
- Understanding of network protocols and troubleshooting (TCP/IP, UDP)
- Strong scripting skills (Bash, Python)
- Critical thinking and problem solving
- Understanding of containerization (Docker, container)
- Experience with troubleshooting API driven services
- Experience with Kubernetes
- Experience with Git
- Background in release management processes
- English — Professional written and verbal skills
- Prometheus, Grafana, Kibana (Query language)
- Experience with Nginx/OpenResty
- Experience with telco protocols (Camel, Map, Diameter) from advantage
- Software development/scripting skills
- Basic knowledge Casandra, PostgreSQL
- Experience with using AWS cloud services (EC2, Redshift, S3, RDS, ELB/ALB, ElastiCache, Direct Connect, Route 53, Elastic IPs, etc.)
- CI/CD: Jenkins
- Terraform
- Purposeful Work: Every team member sees how their efforts make a tangible, positive difference for our customers and partners.
- Growth Opportunities: We provide the chance to develop professionally while mastering cutting-edge practices in cloud-native enterprise software.
- Website:
- LinkedIn:
- Instagram:
- Twitter: