System Administrator, NOC

Evolent Health

  • Pune, Maharashtra
  • Permanent
  • Full-time
  • 1 day ago
Your Future Evolves HereEvolent Health has a bold mission to change the health of the nation by changing the way health care is delivered. Our pursuit of this mission is the driving force that brings us to work each day. We believe in embracing new ideas, challenging ourselves and failing forward. We respect and celebrate individual talents and team wins. We have fun while working hard and Evolenteers often make a difference working in everything from scrubs to jeans.Are we growing? Absolutely and Globally. In 2021 we grew our teams by almost 50% and continue to grow even more in 2022. Are we recognized as a company you are supported by for your career and growth, and a great place to work? Definitely. Evolent Health International (Pune, India) has been certified as “Great Places to Work” in 2021. In 2020 and 2021 Evolent in the U.S. was both named Best Company for Women to Advance list by Parity.org and earned a perfect score on the Human Rights Campaign (HRC) Foundation’s Corporate Equality Index (CEI). This index is the nation's foremost benchmarking survey and report measuring corporate policies and practices related to LGBTQ+ workplace equality.We recognize employees that live our values, give back to our communities each year, and are champions for bringing our whole selves to work each day. If you’re looking for a place where your work can be personally and professionally rewarding, don’t just join a company with a mission. Join a mission with a company behind it.What You’ll Be Doing:System Administrator, NOCWe are looking for a skilled System Administrator to join our NOC team. The successful candidate will be responsible for managing and maintaining our infrastructure, ensuring high availability and performance.Responsibilities
  • Monitor cloud infrastructure and perform level 1 and level 2 troubleshooting
  • Configure and manage various monitoring tools.
  • Perform Windows patching and vulnerability remediation.
  • Troubleshoot infrastructure issues and Active Directory access.
  • Develop and implement automation solutions.
  • Troubleshoot infrastructure issues and Active Directory access.
  • Provide analysis and insights to various teams.
  • Create, operationalize, and manage NOC Standard Operating Procedures (SOPs), including quarterly reviews and adjustments.
  • Perform system health checks and maintain related reports and communications.
  • Participate in stand-up meetings, conference calls, bridge/incident calls, documentation, and Root Cause Analysis (RCA) preparation.
  • Implement and maintain security measures.
  • Collaborate with other IT teams to ensure seamless operations.
  • Lead troubleshooting efforts for complex issues.
  • Perform advanced system maintenance and upgrades.
  • Mentor and train junior administrators.
  • Investigate issues outside of defined SOPs.
  • Work closely with technical teams (e.g., developers, sys admins, DBAs, infra teams).
  • Execute corrective actions identified during post-incident reviews and RCA.
  • Identify and resolve performance bottlenecks as part of NOC.
  • Develop and implement automation solutions to streamline NOC operations using Terraform, GitHub, and Ansible.
  • Collaborate with cross-functional teams to identify automation opportunities and improve operational efficiency.
  • Maintain and update automation scripts and configurations to ensure optimal performance and reliability.
  • Monitor and troubleshoot automation processes to quickly resolve any issues.
  • Document automation workflows and provide training to team members as needed.
  • Apply SRE principles to enhance system reliability and performance.
  • Utilize PowerShell scripting to automate routine tasks and improve system management.
Qualifications:
  • Bachelor’s degree in computer science or a related field.
  • 5+ years of experience in monitoring tool configuration and system administration.
  • 2+ years of experience in Windows/Linux troubleshooting and patching.
  • Proven experience with Terraform, GitHub, and Ansible.
  • Knowledge of Site Reliability Engineering (SRE) principles.
  • Proficiency in PowerShell scripting.
  • Strong problem-solving skills and attention to detail.
  • Excellent communication and teamwork abilities.
  • Ability to work in a fast-paced environment and manage multiple tasks simultaneously.
  • Excellent communication skills, both verbal and written.
  • Experience with diagnostic and monitoring tools such as SolarWinds, Dotcom Monitor, Datadog, Nagios, Prometheus etc.
  • Flexibility to provide on-call support during weekends if required and adapt to a constantly changing environment.
  • Experience with Windows server patching tools such as Ivanti, Endpoint Central, etc.
  • Experience leveraging agile methodologies (e.g., Scrum Ban) to manage project work.
  • Familiarity with the healthcare/health insurance industry.
  • Familiarity with service desk platforms (e.g., JIRA, Remedy, ServiceNow).
  • Experience working closely with technical teams (e.g., developers, sys admins, DBAs, infra teams).
  • Ability to review vulnerability reports and prioritize vulnerabilities for remediation based on severity/impact.
  • Highly collaborative attitude.
  • Low ego and a team player.
  • Relevant certifications (e.g., AZ-900, ITIL, CCNA, Datadog, SolarWinds).
#LI-remoteMandatory Requirements:Employees must have a high-speed broadband internet connection with a minimum speed of 50 Mbps and the ability to set up a wired connection to their home network to ensure effective remote work. These requirements may be updated as needed by the business.Evolent Health is an equal opportunity employer and considers all qualified applicants equally without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, veteran status, or disability status.

Evolent Health