Site Reliability Engineer (SRE)

Bangalore, Karnataka
Permanent
Full-time

10 days ago

Project descriptionLuxoft partner with next-generation digital bank, built from the ground up to deliver seamless, secure, and scalable financial services. Our platform is cloud-native, API-first, and focused on reliability, speed, and security. We are growing fast and looking for top-tier Site Reliability / Ops Engineers to join our core team and help run and scale our infrastructure. As a Site Reliability Engineer, you will be responsible for maintaining and scaling our core infrastructure, ensuring our banking services remain available, secure, and performant. You will work closely with development, product, and security teams to automate operations, manage cloud infrastructure, and uphold high availability standards.Responsibilities

- Operate and optimize Kubernetes-based infrastructure using HELM for deployment and configuration management. - Build and maintain CI/CD pipelines for infrastructure and application deployments. - Manage and monitor cloud infrastructure on AWS (EKS, EC2, S3, IAM, VPC, etc.). - Ensure observability through logging, monitoring, and alerting systems (e.g., Prometheus, Grafana, ELK). - Implement and enforce security best practices across infrastructure components. - Participate in on-call rotations, incident response, and root cause analysis. - Support scaling of systems to meet demand while maintaining reliability. - Collaborate with engineering and security teams on architecture and deployment strategies.

SKILLSMust have

- 6 -10+ years of experience in SRE, DevOps, or Infrastructure roles. - Expertise in Kubernetes (EKS or self-managed) and HELM. - Strong knowledge of networking concepts: TCP/IP, DNS, VPNs, firewalls, load balancing, etc. - Hands-on experience with AWS services: EKS, EC2, IAM, S3, CloudWatch, VPC. - Proficiency with Infrastructure-as-Code tools (Terraform, Pulumi, or CloudFormation). - Familiarity with containerization (Docker) and CI/CD pipelines (e.g., GitHub Actions, ArgoCD, GitLab). - Strong Linux administration and troubleshooting skills. - Solid experience in production environments and real-time operations

Nice to have- Experience in regulated industries (banking, fintech, healthcare). - Experience with incident management and disaster recovery.

Luxoft

Apply Now