Azure Kubernetes Services Operations

Diverse Lynx

  • India
  • Permanent
  • Full-time
  • 2 months ago
Kubernetes Cluster Administrator (L3 & L2)
JC - 75008 , 75016, 75013.
Location: Pune, Coimbatore, Bengaluru Shift: 24/7 rotational shift (includes weekends, with two days off during the week)
Job Overview
The Kubernetes Cluster Administrator will manage, optimize, and secure Kubernetes environments deployed across Azure and AWS cloud platforms. This role requires expertise in cluster operations, high availability strategies, and security enforcement to support critical banking infrastructure.
Key Responsibilities
  • Deploy, configure, and maintain Kubernetes clusters in Azure and AWS environments.
  • Ensure high availability and fault tolerance of Kubernetes workloads.
  • Implement Infrastructure Availability Monitoring with tools like Prometheus, Grafana, and ELK stack.
  • Execute Infra Patch Management for Kubernetes control plane, worker nodes, and containerized workloads.
  • Develop Disaster Recovery (DR) strategies ensuring backup and failover solutions for clusters.
  • Strengthen Infra Vulnerability Management by applying security best practices in containerized environments.
  • Manage cluster networking policies, ingress configurations, and service mesh implementations.
  • Oversee container storage provisioning, ensuring efficient use of persistent storage solutions.
  • Optimize Cloud Cost Management, reducing overhead by right-sizing cluster resources.
  • Conduct Capacity Planning, ensuring clusters scale efficiently to meet workload demands.
  • Improve Infrastructure Performance Management by tuning node configurations and resource allocation.
  • Automate cluster operations using Kubectl, Helm, Terraform, and GitOps workflows.
  • Troubleshoot complex Kubernetes cluster issues, ensuring seamless operations.
  • Implement RBAC and access control policies to secure Kubernetes environments.
  • Monitor pods, deployments, and services for performance optimization.
  • Maintain compliance with banking industry security standards and regulatory requirements.
  • Develop Standard Operating Procedures (SOPs) for Kubernetes administration and troubleshooting.
  • Collaborate with application teams on deployment best practices and scalability strategies.
  • Conduct regular knowledge-sharing sessions to enhance Kubernetes expertise across teams.
  • Work in a collaborative environment ensuring operational excellence in Kubernetes management.
Required Skills & Expertise
  • Expertise in Kubernetes cluster administration in cloud environments (Azure, AWS).
  • Understanding of Helm, Istio, service meshes, and container orchestration.
  • Experience with container networking, security policies, and role-based access control (RBAC).
  • Proficiency in infrastructure automation tools like Terraform, Ansible, and GitOps workflows.
  • Troubleshooting skills for diagnosing Kubernetes and container-related issues.
  • Strong communication skills with the ability to work collaboratively in a team, and willingness to work night shifts on a rotational basis.

Diverse Lynx