Sr DevOps Engineer

HMH

Pune, Maharashtra
Permanent
Full-time

9 days ago

HMH is a learning technology company committed to delivering connected solutions that engage learners, empower educators and improve student outcomes. As a leading provider of K–12 core curriculum, supplemental and intervention solutions, and professional learning services, HMH partners with educators and school districts to uncover solutions that unlock students’ potential and extend teachers’ capabilities.HMH serves more than 50 million students and 4 million educators in 150 countries. HMH Technology India Pvt. Ltd. is our technology and innovation arm in India focused on developing novel products and solutions using cutting-edge technology to better serve our clients globally. HMH aims to help employees grow as people, and not just as professionals. For more information, visitTechnical infrastructure

Cloud & Infrastructure: AWS EC2, Terraform Enterprise, Docker, Aurora, Mesos, Kubernetes, ELK (Elastic Search, Logstash & Kibana).
Observability & Automation: Grafana, Prometheus, Datadog, Telegraf, Runscope, Apollo, GraphQL.
Development Stack: Microservices architecture, Spring, Java & NodeJS, React, Express.js.
Data & Storage: Amazon RDS, Dynamo DB, Postgres, Oracle, MySQL, Influx DB, Linux, Jenkins, GitHub.
AI & Agentic Automation: AWS Bedrock LLMs and AWS Bedrock Engineer for building and integrating scalable, low-latency AI-driven automation capabilities.
You can read more on our Engineering Blog -

About the role:You will constantly be asking, what are the most important infrastructure problems we need to solve for today, that will increase the reliability and performance of our applications and infrastructure.

Identify and solve the most critical infrastructure challenges to improve system reliability, scalability, and performance.
Design, test, and implement AI-enhanced DevOps workflows, including autonomous agents for monitoring, remediation, and optimization.
Partner with SRE and development teams to build robust, self-service deployment pipelines and infrastructure tooling.
Evaluate new technologies to continuously improve system automation, cost efficiency, and security.
Work with AI-enhanced monitoring and self-healing infrastructure components powered by agentic patterns.

Key Responsibilities:

Build, maintain, and evolve cloud infrastructure with Infrastructure as Code (Terraform, CloudFormation).
Manage containerized workloads (Docker, Kubernetes) at scale, with a focus on extending capabilities through AI-driven orchestration.
Implement and maintain advanced monitoring, observability, and alerting systems enhanced with agent-based analytics.
Automate workflows to reduce manual intervention and accelerate delivery cycles.
Collaborate with cross-functional teams to ensure infrastructure meets the needs of high-availability, low-latency applications.
Regularly review and optimize existing architecture for cost, security, and performance improvements.

Skills & Experience:

6 to 10 years of hands-on SRE/DevOps experience in an Agile environment.
Proven ability to collaborate across engineering and operations, with pragmatic problem-solving.
Deep experience with AWS and infrastructure design patterns, and in recommending appropriate AWS services, including newer AI-focused tools like Bedrock.
Strong knowledge and skills of AI-enhanced DevOps workflows and agentic infrastructure models.
Able to quickly resolve outages, lead incident response, and restore service reliability.
Proficiency in diagnosing outages and restoring service with urgency.
Infrastructure as Code expertise (Terraform, CloudFormation).
Experience with containerization (Docker, Kubernetes).
Familiarity with CI/CD tools, scripting languages, and observability platforms.
Strong collaboration skills, with the ability to influence and guide best practices

Preferred Skills and Interests

Solid RDBMS experience (Postgres, MySQL, etc.), with tuning and performance expertise.
Strong Linux fundamentals.
Event-driven systems and message queue management
Security, including firewalls, load balancing, secret management.

HMH Technology Private Limited is an Equal Opportunity Employer and considers applicants for all positions without regard to race, colour, religion or belief, sex, age, national origin, citizenship status, marital status, military/veteran status, genetic information, sexual orientation, gender identity, physical or mental disability or any other characteristic protected by applicable laws. We are committed to creating a dynamic work environment that values diversity and inclusion, respect and integrity, customer focus, and innovation. For more information, visit . Follow us on Twitter, Facebook, LinkedIn, and YouTube.

HMH

Apply Now