In this key role, you\xe2\x80\x99ll support the improvement of non-functional and operational characteristics such as availability, performance, efficiency, change management, monitoring, security, incident response, and capacity planning of our products and services
You\xe2\x80\x99ll enjoy significant stakeholder interaction, working in collaboration with engineers to ensure a principled approach to deliver change in a safe and secure way
This is a chance to join an inclusive team with a collaborative ethos and a commitment to innovation and professional development
We\'re offering this role at associate level
What you\'ll doAs our Site Reliability Engineer, you\xe2\x80\x99ll work alongside colleagues and feature team members to meet defined service level objectives and continually improve systems and environments. You\xe2\x80\x99ll proactively contribute new ideas and innovations to meet short term and longer term goals whilst at the same time balancing and managing risk.You\xe2\x80\x99ll also be accountable for the day-to-day health of both production and non-production environments, responding to incidents as required.A typical day will involve:
Providing structure and supporting release processes, suggesting and making improvements where possible
Supporting the clear communication and frequent update of incident status to other teams and customers
Providing technical expertise and input to establish the risk tolerance of products and services
Supporting live production services, Improve root cause analysis and identification of mitigations
The skills you\'ll needWe\xe2\x80\x99re looking for someone with strong knowledge of reliability systems thinking and experience of configuring and tuning standard observability tooling. You\xe2\x80\x99ll need at least five years of experience of taking on the support of new application and major releases into production. We\xe2\x80\x99ll also look for financial services knowledge, and the ability to identify wider business impact, risk and opportunity, and make connections across key outputs and processes.You\'ll also need:
Good knowledge of risk process
Strong troubleshooting skills using Kepner and Tregoe
Experience of utilising tools and technology across the software development lifecycle
Experience of using a data driven and scientific approach to fact finding
Strong communication skills with the ability to proactively engage with a wide range of stakeholders