This list contains only the countries for which job offers have been published in the selected language (e.g., in the French version, only job offers written in French are displayed, and in the English version, only those in English).
At BlackRock, technology resilience and operational excellence are foundational. As a VP of DevOps & Application Support with a strong Java engineering background, you will lead the reliability, automation, and production stability of mission‑critical enterprise-scale software. You will combine deep systems engineering, strong operational instincts, and hands‑on automation expertise with the ability to understand and troubleshoot Java‑based services & modern tech stack. This hybrid profile strengthens your capacity to drive end‑to‑end service ownership—runtime operations to incident leadership.
Job Responsibility:
Act as the senior owner for production stability and service reliability
Implement and refine monitoring, alerting, and observability using Prometheus, Grafana, Splunk, and OpenTelemetry
Ensure production readiness through capacity planning, change reviews, and disaster‑recovery exercises
Drive long‑term remediation by translating operational learnings into engineering backlog priorities
Influence service design toward operability, observability, and supportability
Lead major incident management, including triage, stakeholder communication, and root‑cause analysis
Work closely with L1/L2/L3 support teams to define escalation paths, SLAs, and handover processes
Design and promote agentic automation solutions to reduce repetitive manual tasks across DevOps and Application Support workflows
Champion a shift from reactive support toward proactive, intelligent operations
Bridge gaps between Engineering and Operations through shared technical context
Requirements:
B.S./M.S. in Computer Science, Engineering, or related discipline
8+ years of professional experience across DevOps, SRE, Product Engineering and/or Production Support roles
Strong knowledge of Java and Spring-based services, and enterprise integration patterns
Strong hands‑on experience with observability and monitoring stacks
Exposure to AI‑assisted tooling for operations, automation, or observability
Solid understanding of distributed systems and event‑driven architectures
Proven incident leadership and calm decision‑making under pressure
Excellent communication skills and ability to influence engineering and operations teams
Experience with Sybase, SQL Server, Snowflake, Cassandra, Redis, and Kafka is a plus
Nice to have:
Experience with service resiliency patterns or multi‑region deployments
Experience with Kubernetes, Docker, or cloud-native environments (AWS/GCP)
Familiarity with financial systems, accounting, or investment technology