Required Skills & Experience:5-8 years of hands-on experience in DevOps and Cloud technologiesDeep expertise in at least one major cloud platform: AWS, Azure, or GCP, and strong knowledge of infrastructure automation tools such as Terraform, Pulumi, CloudFormation, and HelmProven experience in designing and managing high-scale production infrastructure systems and extensive hands-on experience with Kubernetes in production environments, including deployment, scaling, monitoring, and troubleshootingSolid debugging and troubleshooting skills across infrastructure and applications, deep working knowledge of Linux systems and networking conceptsProficiency in at least one scripting or programming language: Python, Shell, Go, or Java,Experience with monitoring and observability tools like DataDog, New Relic, ELK, Prometheus/GrafanaFamiliarity with modern cloud-native architecture patterns, including microservices and RESTful APIs, strong problem-solving mindset and ability to thrive in a fast-paced, dynamic environmentLeadership skills to lead infrastructure automation, including provisioning, capacity planning, demand forecasting, and cost optimization, ability to design and implement secure, resilient, and highly scalable infrastructureExperience in setting high standards for engineering through code reviews, documentation, and building self-service automation tools, ability to participate in and drive blameless postmortem sessions and sustainable incident response practiceKey ResponsibilitiesDesign and implement secure, resilient, and highly scalable infrastructure, ensuring robust security measures are in place to protect against cyber attacks, lead infrastructure automation, including provisioning, capacity planning, demand forecasting, and cost optimization, ensuring efficiencies and cost-savings for the organizationDevelop automation tools and frameworks to improve system observability, reliability, availability, and performance, enabling faster resolution of issues and reduced downtime, ensure application scalability and performance by adhering to cloud-native architecture best practicesSet high standards for engineering through code reviews, documentation, and building self-service automation tools, promoting a culture of continuous improvement and innovation, participate in and drive blameless postmortem sessions and sustainable incident response practiceCollaborate with cross-functional teams to identify opportunities for automation and implement solutions that improve efficiency and reduce errors, develop and maintain documentation for automation tools and frameworks, ensuring that they are easily accessible and understandable by all stakeholdersStay up-to-date with the latest trends and technologies in DevOps and cloud computing, and make recommendations for process and tool improvements, participate in training and mentoring programs to help develop the skills of other team membersDevelop and maintain relationships with key stakeholders, including engineers, product managers, and executives, to ensure that automation efforts are aligned with business goals and objectives, lead or participate in the development of business cases for automation initiativesIdentify and prioritize automation opportunities, and develop and implement plans to address them, collaborate with other teams to ensure that automation efforts are coordinated and aligned with overall business objectivesDevelop and maintain metrics and reporting to measure the effectiveness of automation efforts, and make recommendations for process and tool improvements based on data and analytics, participate in the development of the automation team's strategy and visionLead or participate in the development of automation roadmaps, and make recommendations for process and tool improvements to ensure that the team is aligned with overall business objectives, collaborate with other teams to ensure that automation efforts are coordinated and aligned with overall business objectives