As a Site Reliability Engineer (SRE), you will be responsible for designing, implementing, and maintaining the infrastructure and services that power our healthcare applications and systems. You will play a crucial role in ensuring the seamless functioning of our services while focusing on system stability, performance, and availability. Experience with designing and measuring service level indicators (SLIs) and objectives (SLOs) Monitoring cloud-based systems (Google Cloud and/or Microsoft Azure) Experience handling operational issues such as production failures, infrastructure problems, security, and monitoring (ServiceNow, AppDynamics, AppInsights, GCP Operations Suite, Grafana, Splunk) Experience in programming languages (C#, Java, Python) Responsible for ensuring the availability, performance, and scalability of a website or application. Conduct Production Readiness Review (PRR), a process that identifies the reliability needs of a service based on its specific details. Monitor systems and create plans for responding to incidents, handle unexpected outages or performance issues. Experience with capacity planning and performance tuning to ensure that the site can handle increased traffic without issue. Deep understanding of how distributed systems work to be able to troubleshoot and optimize them. Deep understanding of how different types of databases work to be able to effectively troubleshoot any issues that may arise.
foundit
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.