
Site Reliability Engineer (SRE)
- Bangalore, Karnataka
- Permanent
- Full-time
- - Operate and optimize Kubernetes-based infrastructure using HELM for deployment and configuration management. - Build and maintain CI/CD pipelines for infrastructure and application deployments. - Manage and monitor cloud infrastructure on AWS (EKS, EC2, S3, IAM, VPC, etc.). - Ensure observability through logging, monitoring, and alerting systems (e.g., Prometheus, Grafana, ELK). - Implement and enforce security best practices across infrastructure components. - Participate in on-call rotations, incident response, and root cause analysis. - Support scaling of systems to meet demand while maintaining reliability. - Collaborate with engineering and security teams on architecture and deployment strategies.
- - 6 -10+ years of experience in SRE, DevOps, or Infrastructure roles. - Expertise in Kubernetes (EKS or self-managed) and HELM. - Strong knowledge of networking concepts: TCP/IP, DNS, VPNs, firewalls, load balancing, etc. - Hands-on experience with AWS services: EKS, EC2, IAM, S3, CloudWatch, VPC. - Proficiency with Infrastructure-as-Code tools (Terraform, Pulumi, or CloudFormation). - Familiarity with containerization (Docker) and CI/CD pipelines (e.g., GitHub Actions, ArgoCD, GitLab). - Strong Linux administration and troubleshooting skills. - Solid experience in production environments and real-time operations