Site Reliability Engineer, Avp

Year Bangalore, Karnataka, India

NatWest Group

175 Current Jobs Openings

Apply Now

Job Description

Join us as a Site Reliability Engineer

Youll be managing the provision of stable, resilient, reliable applications with the end goal of minimising disruption to Customer & Colleague Journeys (CCJ)
Well look to you to identify and automate manual tasks and implement observability solutions, ensuring a thorough understanding of CCJ across applications
This is a great chance to work in a supportive environment with opportunities to advance your personal and career development
We're offering this role at associate vice president level

What you'll doAs a Site Reliability Engineer, youll collaborate with feature teams to understand application changes, participate in delivery activities, and address production issues to assist in the delivery of change that does not negatively affect the customer experience. Youll also help to monitor and manage cloud costs, recommending optimisations and cost-saving measures.Youll be responding to, managing, and resolving incidents in a timely manner, performing root cause analysis and driving improvements to prevent recurrence. As well as this, youll automate routine operational tasks and cloud infrastructure provisioning using IaC tools.Youll also be:

Conducting capacity planning exercises to make sure cloud resources can handle anticipated traffic spikes and growth
Implementing and maintaining monitoring, logging, and alerting systems to provide insights into cloud infrastructure and applications' health and performance
Delivering automation solutions to minimise and eliminate manual tasks associated with maintaining and supporting the applications
Ensuring an in-depth understanding of the full tech stack on which the application resides and depends on
Identifying alerting and monitoring requirements for an application, based on sound understanding of customer journeys
Evaluating the resilience of the end-to-end tech stack on which the applications depend, and addressing weaknesses
Seeking to reduce frequency of hand-offs in the end-to-end resolution of customer-impacting incidents

The skills you'll needTo succeed in this role, youll need experience of supporting live production services serving customer journeys with a demonstrable knowledge of ITIL processes and IT Security principles along with tools and techniques to prevent compliance breaches.On top of this, youll bring hands on experience with Azure Cloud and full-stack observability using tools such as Log Analytics, Application Insights, Grafana, CloudWatch, Prometheus and Splunk.Youll also need:

Strong verbal and written communication skills
Strong hands on experience with cloud platforms including AWS and GCP, and their services such as S3, Lambda and Kubernetes
Experience of managing production systems and incidents with a focus on minimising downtime and improving system resilience
Strong troubleshooting skills for cloud infrastructure and application performance issues
Experience of networking in the cloud and familiarity with Chaos Engineering principles and tools

Hours 45Job Posting Closing Date: 16/04/2025

NatWest Group

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.

Job Detail

Job Id

JD3642606
Industry

Not mentioned
Total Positions

1
Job Type:

Full Time
Salary:

Not mentioned
Employment Status

Permanent
Job Location

Bangalore, Karnataka, India
Education

Not mentioned
Experience

Year

Jobs by Function

Popular Job Skills

Popular Industries

Popular Cities

Jobseekers

Employers