Senior Site Reliability Engineer

Bangalore, Karnataka
Permanent
Full-time

1 month ago

About Ping Identity:At Ping Identity, we believe in making digital experiences both secure and seamless for all users, without compromise. We call this digital freedom. And it's not just something we provide our customers. It's something that inspires our company. People don't come here to join a culture that's built on digital freedom. They come to cultivate it.Our intelligent, cloud identity platform lets people shop, work, bank, and interact wherever and however they want. Without friction. Without fear.While protecting digital identities is at the core of our technology, protecting individual identities is at the core of our culture. One of our core values, Respect Individuality, reminds us to celebrate differences so you are empowered to bring your authentic self to work.We're headquartered in Denver, Colorado and we have offices and employees around the globe. We serve the largest, most demanding enterprises worldwide, including more than half of the Fortune 100. At Ping Identity, we're changing the way people and businesses think about cybersecurity, digital experiences, and identity and access management.As a Ping Identity SRE, you will be involved in every facet of our On-Demand SaaS services and will build, deploy, and maintain the infrastructure of one of the largest identity platforms in the world. We follow a DevOps model: our teams are integrated with development teams, and running continuous deployments daily, and SREs are expected to provide input in the product's design, development, deployment, and operations.Working within the Cloud Operations team, you'll build automated infrastructure and deployment processes. You'll be the expert on operational excellence and how systems can be built to be; redundant, scalable, and observable.You Will:

Maintain our production infrastructure hosted on AWS via code.
Create pipelines to deploy and manage global infrastructure.
Analyze complex system behavior, performance and application issues.
Develop observability, alerts and runbooks
Develop, maintain and administer modern infrastructure deployment tools.
Linux systems administration, configuration, troubleshooting and automation.
Capacity analysis and planning, traffic routing, and security policies for Ping's market leading Single Sign-On SaaS applications.
This position is part of an on-call rotation of 8 hours by 7 days a week.

You Have:

5-13 years of experience in Software Engineering, focusing on Site Reliability Engineering (SRE) or DevOps principles
At least 3-8 years of hands-on experience designing, deploying, and managing complex systems on Amazon Web Services (AWS).
Expert-level proficiency in provisioning and managing public cloud infrastructure using Infrastructure as Code (IaC) frameworks such as AWS CloudFormation and Terraform.
Proven ability to develop, test, and maintain robust automation scripts and tools to improve operational efficiency and reliability. That includes good experience with Python scripting.
Extensive hands-on experience with Containerization (Docker) and Container Orchestration (Kubernetes), including deployment, scaling, and troubleshooting of containerized applications.
Proficiency in designing and implementing server configuration management using tools like Puppet, Chef, or SaltStack, with a focus on idempotent and declarative configurations.
Strong experience with CI/CD pipeline design and implementation using tools such as GitLab CI/CD, Argo CD, Jenkins, or similar, promoting automated testing, deployment, and release strategies.
In-depth knowledge of Relational Databases (e.g., PostgreSQL, MySQL)
Solid understanding and practical application of Site Reliability Engineering (SRE) principles including SLOs, SLIs, error budgets, post-mortems, and incident response.
Demonstrated experience in a high-volume, mission-critical production service environment, with a strong focus on system resilience, fault tolerance, and disaster recovery.

Bonus Points If You Have:

Knowledge with observability tooling such as NewRelic, Grafana, and Cloudwatch.
Knowledge of Cassandra
Experience with distributed data systems and their unique challenges in a cloud environment.
Experience with security design principles and best practices for building secure, scalable, and resilient cloud-native applications.

Life at Ping:We believe in and facilitate a flexible, collaborative work environment. We're growing quickly, but remain true to the innovative, can-do startup values that got us here. Most importantly, we keep hiring talented, smart, fun, and genuinely nice people because that's who we want to succeed with every day.Here are just a few of the things that make Ping special:

A company culture that empowers you to do your best work.
Employee Resource Groups that create a sense of belonging for everyone.
Regular company and team bonding events.
Competitive benefits and perks.
Global volunteering and community initiatives

Our Benefits:

Generous PTO & Holiday Schedule
Parental Leave
Progressive Healthcare Options
Retirement Programs
Opportunity for Education Reimbursement
Commuter Offset (Specific locations)

Ping is the collective sum of all our individual experiences, backgrounds and influences and we pride ourselves in growing and learning together. We are committed to building an inclusive and diverse environment where everyone's individuality is respected and everyone has an Identity. In recruiting for new colleagues, we welcome the unique contributions you can bring and encourage you to be your best self.We are an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex including sexual orientation and gender identity, national origin, disability, protected Veteran Status, or any other characteristic protected by applicable federal, state, or local law.

Ping Identity

Apply Now