Principal Site Reliability Engineer - (Linux | Networking | Python | SRE | Terraform | Ansible)
Zscaler
- Bangalore, Karnataka
- Permanent
- Full-time
- Design and deploy various customer facing Linux and BSD based systems.
- Management of container-based architecture (AWS ECS and Kubernetes).
- Create and deploy scalable monitoring systems.
- Architect and implement various cloud management automations.
- Contribute to OS and software packaging and distribution.
- Write and maintain Ops documentation.
- Resolve escalations and help prevent reiteration of incidents with process, monitoring and reliability improvements.
- Contribute and implement DevOps best practices within the group.
- Work as a member of a cross-functional project team contributing to the technology-based solutions and consult on concept feasibility.
- Minimum of 12 years of relevant experience in designing, analyzing, and troubleshooting large-scale distributed systems
- Hands-on experience with infrastructure as code
- Good exp in Terraform, ansible, familiar with automation in Python, well understand network basics, Kubernetes, and AWS cloud.
- Should have worked in a production environment
- Knowledge of Virtualization, Cloud Architecture, and Services, Automated Deployments
- Understanding of web security and protocols HTTP, SSL/TLS, DNS, SQL, and networking fundamentals
- Rich DevOps skills across CI/CD, SCM, Builds and Releases, Continuous Integration Tools and frameworks