Senior Engineer Software
Albertsons Companies India
- Bangalore, Karnataka
- Permanent
- Full-time
- Lead technical support for applications and programs currently in production.
- Analyze complex problems to determine solutions to problems to be implemented permanently into production.
- Prepare for Production releases by ensuring appropriate alerts, dashboards, KB articles, Confluence pages and knowledge sharing are properly executed.
- Ensures dashboards are being monitored daily to detect anomalies and corrections are shared with appropriate teams and team members.
- Check that alerts are being responded to appropriately.
- Ensures approvement agendas for services are being maintained and acted on with Development Engineering and DevOps Engineering partners. Experience in Observability and Monitoring initiatives as platform Engineer.
- Troubleshoot platform issues and restore service by resolving customer-facing incidents
- Development and implementation of build release pipelines with accountability for managing deployment schedules, issues, risks, and impediments.
- Agile development experience with team member accountability for commitment and delivery each sprint.
- Troubleshoot and implement corrections to problems associated with connectivity between the supported applications and the clients they serve
- Provide technical guidance, in the diagnosis of issues as they arise in support of critical applications
- Drive collaboration sessions among IT and business groups to facilitate optimal support and operation of the relevant applications
- Provide Site Reliability Engineering techniques such as observability, alerting and performance tuning
- Contribute to the design, implementation, and enhancement of critical applications
- Perform proactive analysis and troubleshooting to predict and prevent production incidents
- Define and contribute to monitoring capabilities for critical applications
- Collaborate with key vendors on functional, performance and capacity improvements
- Design and build tools to automate support and monitoring functions
- Ensure that all implementations of observability meet the requirements prescribed by IT Services through the effective implementation or use of approved processes, methodologies, and deliverables.
- Provide expertise and build solutions for observability applications as well as system integration with internal systems and external vendors.
- Able to provide coding and technical direction to less experienced staff or develops highly complex original code.
- Track infrastructure delivery and dependencies to implementation.
- Experience with gathering and organizing large volume of data to use for instrumentation into an Enterprise Observability solution.
- Experience with recommending baseline monitoring thresholds, and performance monitoring KPIs and SLAs.
- Experience with installing agents, forwarders, APIs, performance monitoring alerts, dashboards, and data trend analysis.
- Good Knowledge and understanding of Azure foundation components e.g. App GW, APIM, Virtual Network, NSG, Load Balancer, Azure VM etc. is required.
- Team-oriented, positively contributing to team morale and willing to help.
- Learning-Focused, finding ways to improve in their field and use positive constructive feedback to grow personally and professionally
- Think strategically and proactively anticipate future problems, needs or changes in the work
- Experience with Databases Azure SQL, PostgreSQL, MySQL, MongoDB, TSDB or similar databases.
- Experience with designing and implementing user-friendly web interfaces
- Experience with developing and maintaining server-side applications
- Experience with managing databases and optimizing their performance
- Hands on experience in developing MCP clients using React
- Experience with integrating the web applications with Grafana
- Ensuring application responsiveness
- Ensuring cross-platform optimization of web applications for mobile devices
- Keeping up with the latest developments in web application technology
- Experience with developing back-end web applications
- Experience with developing front-end web architectures
- Azure/GCPhands-on with details around pulling observability data from managed services
- Experience must include at least one of the following languages: React, Node JS, Javascript, HTML, CSS, Java (required), Desired--Python, Go
- Golang/Python coding or from solutioning background with experience on SRE development and Open telemetry implementation
- Design and develop standard Grafana dashboards for critical metrics for various Azure/GCP services using the observability data
- Knowledge of monitoring tools such as Log Analytics, App Dynamics, Grafana, Prometheus, Splunk, and Sitescope
- Experience in working with ServiceNow or similar Service Management tools
- Familiarity with Cloud technologies in Azure, AWS, and Google Cloud
- Experience on PCF, Docker, Kubernetes platform is required.
- Experience with DevOps and CI/CD tools and processes is required.
- Experience in high-performance and high-frequency data streaming and health confirmation techniques (using Kafka etc.) and handling large volume of batch data is strongly preferred
- In-depth advanced knowledge of current monitoring tools
- In-depth advanced knowledge of at least one major cloud platform and Service Container/Instance concepts
- In-depth advanced knowledge of querying and inspection techniques for service and other types of logs
- In-depth advanced knowledge of the full software development lifecycle and software development methodologies (Agile).
- Strong ability to understand client expectations and to resolve issues that may affect service.
- Strong ability to mentor, coach and train other application support engineers
- Self-starter, with a demonstrated ability to learn beyond formal training with a strong aptitude for delivering quality products.
- 4-year degree (Computer Science, Information Systems, or relational functional field) and/or equivalent combination of education or work experience.
- 6-9+ years of experience on integration engineering related to Observability/Monitoring framework with open source technologies such as Grafana, Mimir, Loki, Tempo, Fluentbit, Vector etc.,
- Hands-on experience with Tools and Technology is preferred.
- 5+ years of developer experience in as a Full stack engineer is required.
- Experience working with Open-source platforms and Open Telemetry libraries e.g. Grafana is preferred.