Lead Site Reliability Engineer

Delta Air Lines

  • Bangalore, Karnataka
  • Permanent
  • Full-time
  • 5 days ago
  • Apply easily
About Delta Air LinesAbout the CompanyDelta Air Lines (NYSE: DAL) is the U.S. global airline leader in safety, innovation, reliability and customer experience. Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-winning customer service. With our mission of connecting the people and cultures of the globe, Delta strives to foster understanding across a diverse world and serve as a force for social good.In 2024, Delta was recognized by Fortune with a No. 11 placement on Fortune’s Top 50 Most Admired Companies list. The company’s strong management and commitment to providing elevated experiences and premium products also secured a No. 1 ranking out of the eight airlines on the list. Delta landed on TIME’s inaugural list of the “World’s Best Companies,” coming in at No. 12 – the only U.S. airline in the top 155. TIME’s award is based on three criteria: employee satisfaction, revenue growth and sustainability.Delta’s people-first culture continues to be recognized, earning the airline a spot on Fortune’s 100 Best Companies to Work For® list for the fifth year. Delta is the only airline included on the 2024 list.Additionally, Delta earned a coveted spot, on Fast Company’s list of the Most Innovative Companies, climbing from its No. 8 spot in 2023 to No. 2 in the travel category. The airline was recognized for its Wi-Fi revolution that is working to ensure the future of travel is connected.About the Delta Technology Hub (DTH), BangaloreDelta has fast emerged as a customer-focused, innovation-led, technology-enabled business. The Delta Technology Hub will contribute directly to these objectives.Job DescriptionAbout Delta Tech Hub:
Delta Air Lines (NYSE: DAL) is the U.S. global airline leader in safety, innovation, reliability and customer experience. Powered by our employees around the world, Delta has for a decade led the airline industry in operational excellence while maintaining our reputation for award-winning customer service. With our mission of connecting the people and cultures of the globe, Delta strives to foster understanding across a diverse world and serve as a force for social good. Delta has fast emerged as a customer-oriented, innovation-led, technology-driven business. The Delta Technology Hub will contribute directly to these objectives. It will sustain our long-term aspirations of delivering niche, IP-intensive, high-value, and innovative solutions. It supports various teams and functions across Delta and is an integral part of our transformation agenda, working seamlessly with a global team to create memorable experiences for customers.Key Responsibilities:
  • Execute on the Incident, Change Management, Problem Management processes
  • Building and supporting reliable applications that meet development and maintenance requirements.
  • Provide consultation and direct technical support on life cycle planning, problem management, integration, and systems programming
  • Ensure platform performance and availability meet enterprise objectives through monitoring, timely service restoration, and tuning
  • Constantly working to improve and implement automation of applications tasks
  • Providing technical support for systems/platforms according to application SLA's.
  • Responsible for designing and developing resiliency in the application code, troubleshooting incidents, engaging with squads to address failure patterns, and participating in incident management
  • Brings strong troubleshooting ability
  • Leads triage or contributes to a logical fashion
  • Focus on resolving issues before they become incidents
  • Identify and articulate severity of impacts using provided monitoring tools and escalate as needed
  • Able to understand architecture and design of applications and identify or narrow focus for an incident based on symptoms
  • Perform root cause analysis to quickly recover from service interruptions, and to prevent recurring problems
  • Monitor, manage, and tune platforms to ensure expected availability and performance levels are achieved
  • Identify gaps in monitoring or documentation and reaches out to appropriate teams to fill those gaps
  • Implement changes to platforms with minimal impact on the business by following enterprise standards and procedures
  • Design and document enterprise standards and procedures
Minimum Qualifications:
  • Bachelors degree or industry certification in an applicable IT field, in addition to seven years applicable experience in the design/administration/support of one or more platforms; or bachelors degree in an IT field, in addition to five years applicable experience in the design/administration/support of one or more platforms; or seven years equivalent in depth experience in the above-related areas
  • 4 or more years of experience as a Systems Engineer, Developer, or Site Reliability Engineer
  • 3 or more years of experience with ops automation using a scripting language such as Python or Ansible
  • 2 or more years of experience working with production environments in AWS
  • 2 or more years of experience designing Grafana dashboards for application monitoring and observability
  • 2 or more years of experience creating synthetic transaction monitoring checks
  • 2 or more years of experience working with an observability system such as CloudWatch, or Dynatrace.
  • Site Reliability Engineering: Knowledge of the theories and methodologies of reliability engineering; ability to design, develop and support various tools, services and applications to maintain a reliable site environment.
  • Performance Measurement and Tuning: Knowledge of system performance, testing and programming; ability to monitor, measure, and optimize system performance and network communication.
  • CI/CD Pipeline: Knowledge of concepts, values and tools applied in building Continuous Integration (CI), Continuous Delivery and Continuous Deployment (CD) pipeline; ability to design, build, implement and maintain CI/CD pipelines to achieve the automation of software delivery process.
  • Software Release Management: Knowledge of strategies, practices and tools for managing versions and distribution of software products and enhancements; ability to evaluate and improve release management practices and tools
  • Application Maintenance: Knowledge of production applications; ability to monitor application functions and resolve issues to maintain optimal conditions for system applications.
  • Software Engineering: Knowledge of software engineering; ability to deliver new or enhanced software products.
  • Agile Development: Knowledge of agile methodologies and the agile development lifecycle; ability to utilize formal agile methodologies, disciplines, practices and techniques for the delivery of new and enhanced applications.
  • Embraces diverse people, thinking and styles
Preferred Qualifications:
  • Experience with airline applications and infrastructure technology
  • Experience in leading Incident Management
  • Masters degree in computer science, Information Technology or related field is preferred
  • Experience in cloud cost optimization is a plus
  • 1 or more years of experience leading RCA and postmortem investigations.

Delta Air Lines

Similar Jobs

  • Lead Engineer - React.js

    Neighborly

    • Bangalore, Karnataka
    About Neighborly Neighborly is a local network of home service brands that will connect you to very specific vetted local experts. Our family of service professionals work with r…
    • Just now
    • Apply easily
  • Lead Software Engineer

    Ferguson

    • Bangalore, Karnataka
    About Ferguson Ferguson is the largest value-added distributor serving the specialized professional in the residential and non-residential North American construction market. We …
    • 20 hours ago
    • Apply easily
  • Lead Software Engineer

    Ferguson

    • Bangalore, Karnataka
    About Ferguson Ferguson is the largest value-added distributor serving the specialized professional in the residential and non-residential North American construction market. We …
    • 20 hours ago
    • Apply easily