Site Reliability Engineer

Year    Hyderabad, Telangana, India

Job Description


  • Managing 24/7 production support team (L2, L2, L3) in Banking domain or credit card domain
  • Awareness in Software Development Life Cycle knowledge on waterfall and Agile methodology
  • Strong expertise in development experience in Java (banking applications, preferable credit card applications) and willing to provide the support activities in shifts.
  • Java, Java script and Cloud Support
  • Managing major incidents, Coordination incident management, problem management, infra teams & Preparing Major Incident report
  • Experience in troubleshoot, resolve, and communicate complex incidents and deep underlying issues, Performance analysis and fine tuning of real time and batch systems. Perform deep-dive analysis for all major incidents, coordinating with all support teams, Infra teams to identify root cause in both Application and Infrastructure areas.
  • Experience in Executing continuous service improvement plan, monitoring, process improvement and automation.
  • Experience in Major Change management process to make sure no critical incidents reported during weekend cutover activities.
  • Experience in managing Capacity Monitoring, Server Patching (Production environment):
  • Experience in managing Bank wide DR, BCP coordination / scheduling:
  • Good communication skills and stakeholder management
  • Follow the standards of the Bank.
  • Closely work with Dev teams.
Requirements
  • Must have at least 6+ years\xe2\x80\x99 experience in BFSI sector providing application support, supporting complex transactional system with multiple upstream-downstream systems involving online (API driven) and Offline (batch processing) interactions.
Solid experience in container image deploy and release management with OpenShift and Kubernetes.Must have strong automation and scripting skills \xe2\x80\x93 proficiency in shell, groovy & python.Good knowledge on monitoring tools \xe2\x80\x93 Prometheus, Grafana and ELKBackground in large-scale system administration and familiarity with SRE principles and Release EngineeringHave advanced Linux System Administrator skills and advanced configuration management systems skills.In-depth knowledge in infrastructure areas such as virtual server technologies, networking, firewall, internet protocols
  • Root cause analysis, management communication and client relationship management in partnership with Infrastructure Service Support team members.
  • Ensures all production changes are made in accordance with life-cycle methodology and risk guidelines.
  • Application Support, Deployment of Release, patches & fixes on Platform

RiskInsight Consulting Pvt Ltd

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3327167
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year