
Site Reliability Engineer III
- Bangalore, Karnataka
- Permanent
- Full-time
- Execute small to medium projects independently with initial guidance, eventually progressing to designing and delivering projects autonomously.
- Utilize technology to address business challenges by writing high-quality, maintainable, and robust code following software engineering best practices.
- Participate in triaging, examining, diagnosing, and resolving incidents, collaborating with others to address problems at their root.
- Identify toil within your role and proactively work towards eliminating it through systems engineering or updating application code.
- Understand observability patterns and strive to implement and improve service level indicators, objectives monitoring, and alerting solutions for optimal transparency and analysis.
- Formal training or certification on Site Reliability engineering concepts and 3+ years applied experience.
- Proficiency in coding in at least one programming language.
- Experience in maintaining a cloud-based infrastructure.
- Familiarity with site reliability concepts, principles, and practices.
- Knowledge of observability, including white and black box monitoring, service level objective alerting, and telemetry collection using tools such as Grafana, Dynatrace, Prometheus, Datadog, Splunk, and others.
- Familiarity with containers or server OS such as Linux and Windows.
- Emerging knowledge of software, applications, and technical processes within a technical discipline (e.g., Cloud, AI, Android).
- Emerging knowledge of CI/CD tools like Jenkins, GitLab, or Terraform.
- Emerging knowledge of common networking technologies.
- Ability to work collaboratively in a large team and vocalize ideas with peers and managers.
- Understanding of prioritizing work plans, eagerness to learn, and ability to apply system processes and methodologies.