Site Reliability Engineer
Alter Domus
- Hyderabad, Telangana
- Permanent
- Full-time
Here in AlterDomus, you will be part of a dynamic team, composed by software engineers and IT experts with different backgrounds, within the IT Client Support Ops branch of Alter Domus.It is an exciting moment to accept this challenge, our fast-growing company is passing through an innovation and cloud journey, which will give you the possibility to work in different scenarios and technologies. From legacy on-premise applications to Kubernetes apps hosted on AKS, which will contribute to your professional growth.
In this role, you should be able to conduct troubleshoots with a sharp eye for spotting defects. You should be a team player and excellent communicator. If you are also passionate about coding and getting things done, we would like to meet you.Responsibilities
- Design, develop, and maintain software that automates the management of our support operations, by other means, reduce the toil.
- You will troubleshoot and diagnose issues on software, actively work on hotfixes and other code amendments whenever it’s necessary.
- Work closely with our tech support and development team to ensure that our systems are reliable and performant.
- Monitor our systems for issues and take corrective action when necessary.
- Continuously improve our systems to ensure that they are scalable and reliable.
- Work to instrument telemetry from our application to enhance our observability.
- Operate APM tools and implement alerts, dashboards and reports based on telemetry data.
- Document technical knowledge in the form of notes, wikis and manuals.
- Customer oriented attitude, our members understand that putting customers first is part of the team’s DNA.
- Mentor junior engineers and help them grow in their careers.
- Bachelor’s degree in Computer Science or related field.
- 5+ years of experience in software development, DevOps or site reliability.
- Excellent troubleshooting and communication skills.
- Experience with cloud computing platforms such as AWS or Azure.
- Professional experience with at least one OOP language and web development.
- Professional experience with telemetry instrumentation.
- Professional experience with monitoring tools (APM, DPA)
- Support for professional accreditations such as ACCA and study leave
- Flexible arrangements, generous holidays, birthday leave and graduation leave
- Continuous mentoring along your career progression
- Active sports, events and social committees across our offices
- Support with mental, physical, emotional and financial support 24/7 from our Employee Assistance Program
- The opportunity to invest in our growth and success through our Employee Share Plan
- Plus additional local benefits depending on your location