Site Reliability Engineer

Year    Hyderabad, Telangana, India

Job Description


  • Managing 24/7 production support team (L2, L2, L3) in Banking domain or credit card domain
  • Awareness in Software Development Life Cycle knowledge on waterfall and Agile methodology
  • Strong expertise in development experience in Java (banking applications, preferable credit card applications) and willing to provide the support activities in shifts.
  • Java, Java script and Cloud Support
  • Managing major incidents, Coordination incident management, problem management, infra teams & Preparing Major Incident report
  • Experience in troubleshoot, resolve, and communicate complex incidents and deep underlying issues, Performance analysis and fine tuning of real time and batch systems. Perform deep-dive analysis for all major incidents, coordinating with all support teams, Infra teams to identify root cause in both Application and Infrastructure areas.
  • Experience in Executing continuous service improvement plan, monitoring, process improvement and automation.
  • Experience in Major Change management process to make sure no critical incidents reported during weekend cutover activities.
  • Experience in managing Capacity Monitoring, Server Patching (Production environment):
  • Experience in managing Bank wide DR, BCP coordination / scheduling:
  • Good communication skills and stakeholder management
  • Follow the standards of the Bank.
  • Closely work with Dev teams.
Requirements
  • Must have at least 6+ years experience in BFSI sector providing application support, supporting complex transactional system with multiple upstream-downstream systems involving online (API driven) and Offline (batch processing) interactions.
Solid experience in container image deploy and release management with OpenShift and Kubernetes.Must have strong automation and scripting skills - proficiency in shell, groovy & python.Good knowledge on monitoring tools - Prometheus, Grafana and ELKBackground in large-scale system administration and familiarity with SRE principles and Release EngineeringHave advanced Linux System Administrator skills and advanced configuration management systems skills.In-depth knowledge in infrastructure areas such as virtual server technologies, networking, firewall, internet protocols
  • Root cause analysis, management communication and client relationship management in partnership with Infrastructure Service Support team members.
  • Ensures all production changes are made in accordance with life-cycle methodology and risk guidelines.
  • Application Support, Deployment of Release, patches & fixes on Platform

RiskInsight Consulting Pvt Ltd

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3327167
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year