Developer III - SRE DevOps Engineering

UST

  • Pune, Maharashtra Thiruvananthapuram, Kerala
  • Permanent
  • Full-time
  • 20 days ago
Job Description:Sr. Site Reliability Engineer Offshore – Senior Site Reliability Engineer is responsible for meaningfully contributing and providing continuous feedback on site health, reliability, availability and user experience client products. This is a matrixed role where the SRE will work closely on a day-to-day basis with the product team while reporting to the practice lead. This role is expected to understand the product in depth, collect and analyze meaningful measurements and provide feedback to the business, Software Engineering and Product teams. The SRE will work very closely with the key stakeholders to help drive changes to increase customer satisfaction, product availability, reliability, and the completion of strategic technical initiatives. In addition to monitoring and integration with the observability platform, a heavy focus will be placed on automation opportunities and automating operational processes to maintain high availability of the product. Technical 1. General knowledge of most technical expertise areas, with deep knowledge in at least two areas1. Advanced Terraform syntax , Ansible (syntax, tasks, playbooks) and CI/CD configuration, pipelines, jobs.1. Advanced knowledge of cloud services (preferably Azure)2. Monitoring Dynatrace, Azure App Insight, Prometheus, and Grafana: service catalog metrics and recording rules for s3. Log shipping pipelines and incident debugging visualizations2. Ability to understand and Contribute improvements to the codebase to resolve issues. Execution1. Performs application specific SRE support, RCAs, and service restoration as needed to quickly respond to and resolve production issues.2. Plan and achieve high availability, performance, and availability of the product service.3. Ensure pro-active monitoring of all core services and processes to prevent un-planned service disruption.4. Implement self-healing and scalability of technical services to avoid un-planned disruptions.5. Identifies significant projects that result in substantial improvements in reliability, cost savings and/or revenue.6. Identifies changes for the product architecture from the reliability, performance and availability perspectives with a data driven approach.7. Influences the product roadmap and works with engineering and product counterparts to influence improved resiliency and reliability of the product.8. Proactively work on the efficiency and capacity planning to set clear requirements and optimize the system resources usage.9. Identify Service Level Indicators (SLIs) that will align the team to meet the availability and latency objectives.10. Provide detailed analysis and troubleshooting for systems outages providing feedback to product/software engineering Collaboration and Communication:1. Leads initiatives and problem definition and scoping, design, and planning through epics and blueprints.2. Deep domain knowledge and radiation that knowledge through recorded demos, technical presentations, discussions, and3. Perform and run blameless RCAs on incidents and outages aggressively looking for answers that will prevent the incident from ever happening again.4. For stable counterpart assignments, maintain awareness and actively influence stage group plans and priorities through participation in stage group meetings and async discussions. Act as a champion for reliability.5. Set an example for team of SREs with positive and inclusive leadership and discussion on work.Experience & Education1. 5+ years of software Engineering or Site Reliability Engineering experience2. Bachelor’s degree in Computer Science, Information Technology or equivalent experience plus certifications3. Understanding of web hosting infrastructure and architecture in highly available environments4. Working knowledge and experience C#, Javascript, and HTML5. Experience with one of the Public Cloud architectures (Azure experience highly desired)6. Familiarity with RESTful API and .Net Applications.7. Experience working with Dynatrace, Azure monitor, AppInsight, log analytics (highly Desirable)Skills:Application Support Services,C#,API Rest,Azure CloudAbout Company:UST is a global digital transformation solutions provider. For more than 20 years, UST has worked side by side with the world’s best companies to make a real impact through transformation. Powered by technology, inspired by people and led by purpose, UST partners with their clients from design to operation. With deep domain expertise and a future-proof philosophy, UST embeds innovation and agility into their clients’ organizations. With over 30,000 employees in 30 countries, UST builds for boundless impact—touching billions of lives in the process.

UST

Similar Jobs

  • Developer III - DevOps Engineering

    UST

    • Pune, Maharashtra
    • Thiruvananthapuram, Kerala
    Job Description: Role Proficiency: Acts under minimum guidance of DevOps Architect to set up and manage DevOps tools and pipelines. Outcomes: * Interpret the DevOps Tool/feat…
    • 17 days ago
  • Lead AI Developer and DevOps

    dentsu

    • Pune, Maharashtra
    The purpose of this role is to lead the collaboration with ML Engineers and DevOps Engineers to formulate AI designs that can be built, tested and deployed through the Route to Liv…
    • 1 day ago