Manager Site Reliability Engineering

Bangalore, Karnataka
Permanent
Full-time

1 month ago

Flexera saves customers billions of dollars in wasted technology spend. A pioneer in Hybrid ITAM and FinOps, Flexera provides award-winning, data-oriented SaaS solutions for technology value optimization (TVO), enabling IT, finance, procurement and cloud teams to gain deep insights into cost optimization, compliance and risks for each business service. Flexera One solutions are built on a set of definitive customer, supplier and industry data, powered by our Technology Intelligence Platform, that enables organizations to visualize their Enterprise Technology Blueprint™ in hybrid environments—from on-premises to SaaS to containers to cloud.We’re transforming the software industry. We’re Flexera. With more than 50,000 customers across the world, we’re achieving that goal. But we know we can’t do any of that without our team. Ready to help us re-imagine the industry during a time of substantial growth and ambitious plans? Come and see why we’re consistently recognized by Gartner, Forrester and IDC as a category leader in the marketplace. Learn more at flexera.comAbout UsWe're a fast-growing, category-leading organization with ambitious objectives and a positive, inclusive culture. We're looking for passionate professionals who want to grow their talents and achieve great things. If that sounds like you, we want to talk to you about joining our team.The Cloud Enablement team is responsible for accelerating the delivery and improving the operation of our cloud-based software by providing and supporting tools and patterns which reduce the cognitive load on our development teams. We free up our developers to focus on solving problems for our customers rather than spending time on extraneous tasks. Drawing on the shared experience and expertise from our organization and industry; we create, support and evolve the paved path for teams to build, deploy and run secure and reliable software. The team owns, operates and manages 100s of cloud accounts across AWS, Azure and GCP.Position Overview:We are seeking a strategic and technically experienced Manager to build, grow and lead a cloud enablement team. This team is responsible for cloud accounts, platform reliability, infrastructure architecture including Databricks & PowerBI, FinOps, RBAC, and governance compliance across Flexera’s multi-cloud footprint.You will drive the team’s mission to deliver secure, scalable, and cost-effective cloud platforms while ensuring compliance and operational excellence. This includes leading cloud governance and FinOps practices, and collaborating across engineering, finance, and security functions to align infrastructure with business goals. It is expected that this role will be 50/50 Hands-on technical and management, therefore the ideal candidate will be highly experienced in leading technology as well as capable to lead a team.What will you do?

Lead and mentor a team of Site Reliability Engineers focused on: Cloud account lifecycle management (AWS, Azure & GCP) and access controls, Platform reliability and operational excellence, Infrastructure architecture and governance, RBAC and compliance enforcement etc and runs the core systems that each of our engineering teams leverage.
Own the architecture and operational integrity of cloud-native platforms, including Databricks and Power BI.
Define and enforce governance policies including tagging, RBAC, compliance, and security standards.
Drive FinOps maturity through cost visibility, forecasting, anomaly detection, and optimisation.
Collaborate with SRE, Security, Finance, and Engineering teams to align infrastructure with business and financial goals.
Champion automation and Infrastructure-as-Code (IaC) to improve deployment velocity and reduce manual overhead.
Partners with security and other “shared services” teams to align, automate, integrate and orchestrate specialist tooling into a common set of SRE best practices that supports the wider Software Delivery Lifecycle and Product Lifecycle.
Plan and execute projects in support of the SRE objectives, and ensure projects are delivered with high quality, on time, and within budget
Hire, develop and retain a highly skilled SRE team
Evaluate hardware and software technologies to improve efficiency and performance
Contribute to platform security

You have

Developer/DevOps/SRE/Platform experience and a strong interest in software delivery and ongoing operation.
Owned and led the architecting and rolling out of automation, tools, technologies, patterns and guardrails across an organisation.
Experience working in a globally distributed team.
Deep & extensive public cloud knowledge & experience on either AWS, Azure or GCP.
Deep knowledge of containers (Docker) orchestration (Kubernetes).
Knowledge of tools and patterns around CI/CD (familiar with GitHub, Travis CI, Circle CI, Buildkite or similar).
Cloud cost optimisation: Using automation to keep Cloud cost under control and within budget. Enabling individual Engineering teams with cloud cost optimisation.
Knowledge of operations, including incident management, immutable infrastructure as code (esp. Terraform or CloudFormation), and problem-solving.
Produced robust well-tested code preferably in Golang; however, we will also consider Python, JavaScript, Ruby, Java or C# if you are happy to learn Go.
Excellent communication skills, including experience in writing good documentation and running workshops.
Vendor selection and management experience.

Required skills and knowledge:

Bachelor's or higher degree in Computer Science, Information Technology, or a related field.
Background in centralized Site Reliability Engineering or Platform Engineering supporting globally distributed engineering teams
At least 1+ years’ experience leading a team of Site Reliability Engineers
At least 2 years of experience working as a senior member of a centralized Cloud enablement / Platform or a similar team
At least 8+ years’ experience in SRE/DevOps/Platform Engineering in cloud environments
Experience with IaC and Containers to achieve scalable, reliable, performant and secure SaaS platform infrastructure

Bonus Skills:The following list of items are not pre-requisites for the role but might give you a bit more of an idea about what you may expect to come across in your SRE role at Flexera:

Python / Golang / Java / C# / C / C++ / Bash experience
Big Data, Machine Learning, AI (DataBricks, Snowflake etc.) Platforms
Experience with Monitoring systems such as New Relic, ELK, Prometheus, Datadog, X-ray etc.
Security background
SQL, NOSQL and Graph databases
Relevant Certification e.g. AWS, GCP, Azure (Professional or higher)

Flexera is proud to be an equal opportunity employer. Qualified applicants will be considered for open roles regardless of age, ancestry, color, family or medical care leave, gender identity or expression, genetic information, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran status, race, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by local/national laws, policies and/or regulations.Flexera understands the value that results from employing a diverse, equitable, and inclusive workforce. We recognize that equity necessitates acknowledging past exclusion and that inclusion requires intentional effort. Our DEI (Diversity, Equity, and Inclusion) council is the driving force behind our commitment to championing policies and practices that foster a welcoming environment for all.We encourage candidates requiring accommodations to please let us know by emailing .

Flexera

Apply Now