Site Reliability Engineer
Anaplan View all jobs
- Gurgaon, Haryana
- Permanent
- Full-time
- Devise and implement complex changes to our production platforms
- Work together on team projects to improve our automation
- Contribute to standards for our software operating environment
- Provide support and operations experience to development teams
- Troubleshoot and resolve incidents, and root-cause analysis
- Work on scripting and tools to improve automation and reliability
- Work with the other infrastructure teams to enhance our platform
- Experience of Linux administration will be a day-one skill
- Experience with Kubernetes administration
- Being comfortable in a scripting language suitable for automation tasks
- A degree in computing or science is helpful but not essential
- Experience with configuration management tools is helpful
- Experience with operating a production platform with live services
- Experience with common public clouds will also help especially with Azure.
- Experience with Production on-call duties as part of Incident Management.
- Hands-on experience with one or more public clouds (AWS, GCP, Azure)
- Experience with Event Streaming, Exception Management, and Integration
- Experience with Stream-processing and batch-processing frameworks such as
- Experience with configuration management, and infrastructure as code
- Knowledge of observability and monitoring best practices
- Prior experience mentoring or coaching other engineers
- Extend offers to candidates without an extensive interview process with a member of our recruitment team and a hiring manager via video or in person.
- Send job offers via email. All offers are first extended verbally by a member of our internal recruitment team whenever possible and then followed up via written communication.