This role will also be required to perform functional and hierarchal escalations to ensure we meet service level agreements with our customers and internal SLO’s. As a Major Incident Manager, you will be required to assist in building the team’s documentation and be a key participant in RCA meetings. The Major Incident Management team is part of the larger Operations Management Center which is critical to ensuring timely resolution of production incidents, performing continual service improvements to proactively detect and prevent customer impacting incidents. To actively focus on Production Stability across the estate, with key areas of focus on reduction of mean time to engage and mean time to resolve.
What you'll do:
- Support and promote ITSM framework amongst teams and the wider IT organization. The Major Incident Manager is responsible to effectively manage the lifecycle of unplanned interruptions. This includes protecting the organization from reputational damage, productivity loss and regulatory failings
- The ability to learn new technical concepts quickly and translate technical jargon into plain speak for stakeholders, as well as the ability to filter information quickly and tailor communications to the recipient
- Perform initial incident assessment of technical and process deficiencies and follow up on investigations
- Ownership of coordinating and driving technical service restoration plans across all application and infrastructure services.
- Provide leadership to multiple support teams during Major Incidents to actively drive towards immediate service restoration.
- Resolve matters that have been escalated and provide approvals where required and follow the technical and functional escalations where necessary
- Follow Emergency change process to ensure corrective actions are implemented and managed effectively
- Ensure proper lifecycle transition from Incident to Problem Management processes including active participation in RCA meetings
- Contribute to the continual improvement of the Incident Management process.
- Perform Incident trend analysis and systemic Problem identification.
- Participate in specific Operations Management activities and special projects or initiatives as required.
What you'll need:
- Experience with ITIL methodologies and industry best practices. - ITIL v3 certification, minimum foundation level.
- Excellent working knowledge of an ITSM platform - ServiceNow is preferred
- Strong background in managing Financial / Payments related Major incidents with understanding of associated regulatory implications and time dependent drivers.
- Experience leading bridge calls with many technical participants
- Experience and tenacity in driving resolution of complex issues
- Proven ability to multi-task and prioritise to always push towards resolution in a timely manner, keeping a level head while systematically pushing technical teams to perform the technical work to restore service
- Proven ability to analyze and solve a wide range of technical problems. Including the ability to perform first level assessment of technical and process deficiencies and follow up on investigations
- Excellent written and verbal communication skills are a must. The ability to effectively communicate technical concepts to non-technical clients/partners, is significant, specifically at senior management level.
- Excellent analytical skills are required
- Experience leading teams to support meeting a contractual service level agreement
- Experience with Problem Management
- Experience with Change and Release Management
- Experience with Crisis Management and Communications