Lead
StoneX Group View all jobs
- Pune, Maharashtra
- Permanent
- Full-time
- Develop and maintain Infrastructure‑as‑Code patterns for MQ deployments (Ansible/Terraform/Kubernetes Operators depending on architecture direction).
- Implement robust observability, including metrics, logs, dashboards, and alerting (Prometheus, Grafana, Datadog, etc.).
- Partner with the architecture function on broader messaging strategy alignment, looking at features like Streaming Queues to enable bridging flow to Kafka/Redpanda
- Manage the lifecycle of global queue managers — installation, patching, upgrades, configuration, security hardening.
- Handle incident diagnosis and resolution, including hung queue managers, performance degradation, and integration issues
- Maintain consistent operational standards across all environments (Dev/Pre‑Prod/DR/Prod).
- Build pipelines for automated provisioning, validation, and deployment of MQ resources.
- Create self‑service workflows allowing application teams to safely request/configure queues without manual intervention.
- Contribute to platform SRE maturity through error‑budgeting, reliability scorecards, and toil‑reduction initiatives.
- Implement and maintain LDAP / AD integration, RBAC, TLS, mTLS, and certificate automation.
- Ensure audit, compliance, and security standards are met across all MQ assets.
- Work with network engineering on secure, low‑latency connectivity and cross‑region routing.
- Partner with globally distributed teams (including AES, Data Engineering, Networks, and Architecture).
- Contribute to technical designs, standards, and best‑practice playbooks.
- Mentor junior engineers and help shape the future internal MQ engineering capability.
- Participate in strategic IBMMQ discussions and roadmap initiatives.
- 4–10+ years working with IBM MQ in engineering, DevOps, SRE, or platform‑support roles.
- Strong understanding of MQ queue managers, channels, listeners, clustering, security exits, and message flows.
- Hands‑on experience with MQ NativeHA, HA clusters, DR patterns, replication and failover.
- Experience integrating MQ with Prometheus, Datadog, or equivalent monitoring stacks.
- Proficiency with Linux administration, scripting (Python, Shell), and system performance tuning.
- Demonstrated experience in automation, CI/CD, and Infrastructure‑as‑Code.
- Background with containerised MQ (MQ Operator, Kubernetes/OpenShift).
- Experience designing self‑service platforms or internal developer portals.
- Exposure to other messaging technologies (Kafka/Redpanda, Redhat AMQ7) to support future messaging strategy alignment.
- Familiarity with enterprise security controls, LDAP/SSO integration, certificates, and encryption.
- Experience in financial services or other highly regulated sectors.
- Strong problem‑solving mindset with a drive to modernise legacy systems.
- Comfortable owning a critical platform end‑to‑end.
- Excellent communication skills; able to influence and collaborate across engineering, operations, and architecture groups.
- Self‑starter with the ability to work independently and proactively.
- Passion for automation, reliability engineering, and designing for scale.
- A stable, fully automated MQ estate with reduced operational burdens.
- Successful rollout of NativeHA and cross‑region architectures.
- Realisation of self‑service configuration flows for application teams.
- Clear observability, security, and governance maturity across the platform.