
Senior Site Reliability Engineer
- Hyderabad, Telangana
- Permanent
- Full-time
- Architect, build, document, and maintain Cloud standards and processes
- Lead projects and new application implementations
- Create new Terraform architecture and modules to provision AWS resources
- Create, manage, and administrate Kubernetes running on EKS
- Create and modify Jenkins pipelines to support CI and automation
- Work with Software Development teams to write and tune their application Helm charts for EKS
- Performance Engineering, load testing, hotspot isolation, and remediation
- Guide teams on best practices in the cloud
- POC new solutions and production in the cloud
- Configure APM, SLO, SLA and alerting via Dynatrace
- Configure log metrics and analysis via Splunk
- Build and manage CI deployment process for all environments
- Support and enable teams to migrate from on-prem environments into AWS
- You will be reporting to a Senior Manager
- 8+years of experience with Terraform
- Expert level experience with AWS services
- EC2, ASG, SG, ALB/NLB/WAF, ACL, Routing, Route53, Express Connect/Transit Gateway, EC2 Image Builder, EKS, ECS, ECR, Lambda
- Experienced writing Jenkins files and Jenkins Shared Libraries
- Expert level with EKS creation and administration
- Expert level with Kubernetes application deployment and management
- Experienced writing and maintaining custom application Helm charts and Helm template libraries
- Aws
- Kubernetes
- Jenkins
- Terraform
- EKS
- Helm