
Sr. Site Reliability Engineer (all genders)
- Mohali, Punjab
- Permanent
- Full-time
- Service Reliability: Maintain service availability, system performance, and manage capacity-related matters. Involvement in designing and implementing SLOs and SLIs
- System Improvement: Develop and implement solutions to improve system reliability and scalability.
- Incident Response: Participate in on-call rotations and assist in incident management and resolution. Contribution to post-incident reviews (blameless post-mortems)
- Collaboration: Work closely with development teams to troubleshoot issues and enhance system performance.
- Automation: Contribute to the automation of processes to improve efficiency and scalability.
- Monitoring & Observability: Implement and maintain monitoring solutions using tools like New Relic, Kibana, Prometheus, Grafana, and ElasticSearch.
- Experience: 3-5 years in site reliability engineering or related areas.
- Education: Bachelor's degree in Computer Science, Engineering, or related field.
- Technical Skills:
- Proficiency in Java, Python, and familiarity with other coding languages.
- Experience with AWS cloud services and cloud engineering practices.
- Knowledge of monitoring tools (New Relic, Kibana, Prometheus, Grafana, ElasticSearch).
- Strong understanding of software development methodologies.
- Experience with infrastructure as code tools (e.g., Terraform, CloudFormation)
- Familiarity with containerization and orchestration (e.g., Docker, Kubernetes)
- Knowledge of networking and distributed systems
- Problem-Solving: Strong analytical skills and the ability to perform root cause analysis.
- Automation: Experience with scripting and automation to enhance operational efficiency.
- Teamwork: Ability to work effectively within a team and collaborate with cross-functional teams.
- Attention to Detail: High level of accuracy and thoroughness.
- Communication Skills: Clear and concise communication abilities.
- Learning Mindset: Eagerness to learn and apply new technologies.
- Proactive Approach: Initiative to identify issues before they become problems.