Lead Site Reliability Engineer Aws, Automation, Observability

Year    Hyderabad, Telangana - Secunderabad, Telangana, India

Job Description


Assume a critical role in defining the future of a globally recognized firm and have a direct and significant effect in a realm tailored for top achievers in site reliability. Job Summary As a Lead Site Reliability Engineer at JPMorgan Chase within JPMC Advanced Data Ecosystem (JADE) team on production support in cloud, where you\'ll be working with big data and cloud engineers to build the platform, pipeline, and monitoring systems to ensure the application landscape is designed to take most advantage of JPMC\'s global cloud solution. Job responsibilities Leads failure analysis / root cause analysis when required in a global SRE team and ensure the highest level of SLA through operational excellence Leads the design and development of SRE-related product technology roadmap, and transition to operations Provides support to develop & improve the quality of technical engineering documentation and support to drive the maturity of the software development lifecycle Performs deployment, administration, management, configuration, testing, and integration tasks related to the big data platforms in cloud environment Supports management of relationships with technology vendors Responsible for coaching and mentoring less experienced team members. Participates in 24x7 SRE on-call rotations and escalation workflows. Required qualifications, capabilities, and skills 10+ years of applied experience with Bachelor\'s degree in Computer Science, Information Technology, or equivalent technical field Deep understanding of SRE philosophy, technologies, platforms and tools, SLA management, incident resolution, and automation Hands on experience on AWS, Azure or GCP, managing operations of large-scale internet-centric production environments for application or infrastructure services In-Depth OS experience (RHEL, Ubuntu, Windows Server) with strong debugging, troubleshooting, and problem-solving skills Experiencein site reliability engineering in one of the following languages:Python, Java, shell scripting, PowerShell or GO Hand-on experience with big data technologies(Hadoop, Spark, Airflow etc), Snowflake, Databricks and cloud-based technologies and tools especially in deployment, monitoring and operations, such as Data Dog, Prometheus, Splunk, Elasticsearch, Grafana Strong working knowledge of modern development technologies and tools such Agile, CI/CD, Git, Terraform and Jenkins. Preferred qualifications, capabilities, and skills AWS, Terraform, Kubernetes, Snowflake, Databricks certification is highly desirable Deep knowledge of Internet protocols and web services technologies such as HTTP, DNS, TCP/UDP, JSON and REST Good understanding of networking protocols and cybersecurity, Load Balancing best practices in cloud environment

foundit

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3135702
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Hyderabad, Telangana - Secunderabad, Telangana, India
  • Education
    Not mentioned
  • Experience
    Year