
Senior Site Reliability Engineer - Python, Azure and Linux
- Bangalore, Karnataka
- Permanent
- Full-time
- Deploying, managing, and securing Ivanti's production Software-as-a-Service (SaaS) environments in AWS and Azure
- Working with geographically dispersed, cross-departmental teams to solve difficult problems
- Automating common and repetitive tasks
- Write documentation and training material
- Train other colleagues.
- Participate in on-call rotations for 24x7 coverage (follow-the-sun model) for incident response, issue triage, and problem resolution
- A BSc in Computer Science, a related field, or equivalent practical experience
- 5+ years of relevant industry experience.
- Proficiency with Python and experience with one of the following languages:
- Java
- Golang
- C#
- Proficiency working with Bash or PowerShell programmatically
- Familiarity with public cloud platforms (AWS or Azure preferred)
- Experience troubleshooting Java and .NET applications
- Experience troubleshooting network and storage infrastructure issues
- Experience working with core Linux distributions (Debian, RHEL, SUSE, Slackware).
- Experience working with Windows.
- Experience working with one or more: SQL Server, PostgreSQL, Redis, Kafka, MongoDB, Elasticsearch, or similar
- Ability to configure and fine tune at least one: HA Proxy, Apache, Nginx, IIS, or similar
- Ability to configure: New Relic, DataDog, Splunk, or similar monitoring tools
- Familiarity with container orchestration technologies (AWS EKS or Azure AKS preferred)
- Experience with deployment pipeline tools such as Ansible, Jenkins, and/or GitHub Actions
- Proficiency working and developing Infrastructure as Code (IaC)
- A desire to adopt and implement emergent technologies and best practices
- Strong verbal and written communication skills in English for the purposes of global collaboration
- Prior experience as a Site Reliability Engineer or DevOps Engineer
- Certificates in one or more of the following categories, or demonstrated certificate-equivalent knowledge:
- Cloud Development and architecture
- Kubernetes Administration
- Linux Administration
- Software engineering disciplines
- Experience with compliance frameworks such as SOC 2 Type 2, ISO-27001, FedRAMP, or IRAP and privacy regulations such as GDPR and PIPEDA
- Onboarding and role-training is complete
- You're building foundational knowledge of the SRE-run product portfolio
- You hold general knowledge of how SRE manages our SaaS environments
- You've gotten to know the team and are building relationships with SRE peer teams
- Self-sufficiency in core job functions and existing processes
- Participating in SRE on-call rotations
- Contributing to handling SRE tickets to fulfillment and responsible for individual SRE tasks
- Active participation in SRE stability discussions with direct interaction with SRE peers
- Contribute independently to improve reliability and compliance in our SaaS environments
- Demonstrate ownership of SRE ticket management including triage and resolution
- Lead one or more well-defined projects.
- Identify areas where performance, scalability, security, and reliability can be improved in production systems and environments
- Mentor junior team members and contribute to internal knowledge-sharing sessions.