Cloud Site Reliability Engineer (design And Implement)

Year    Pune, Maharashtra, India

Job Description


As a SRE your job entails, architecting, Implementing and managing heterogeneous & diverse tech stacks spanning multiple datacentres and across various cloud providers. Implement and manage enterprise level software, providing hosting and domain related services to millions of customers across the globe. Your role as a SRE is primarily focussed on helping business and development teams grow, roll out new features to the market with a strong commitment to quality and availability. At the same time, you will be an expert detective, diving into complex escalations involving enterprise level technical challenges, Engineering problems, customer connects and platform growth concerns etc. This role will involve the management of short & long term projects under SLA and adherence to deadlines.Key Responsibilities:

  • Architect and maintain mission critical global hybrid infrastructure spanning multiple datacenters & cloud providers, leveraging primarily open source technologies.
  • Design next generation scalable systems which are highly available, resilient and capable of handling high volume Internet facing web traffic.
  • Be responsible for downtimes and maintain the product SLA, capacity planning of the systems and overall health & performance of large scale production systems.
  • Participate in weekly 24/7 oncall rotation, solving escalated tickets, resolve outages and debug production issues.
  • Work closely with various stakeholders like Engineering, Monitoring and Operations teams, Noc / Soc, customers & business development teams.
  • Challenge the status quo. Empower development teams by transitioning legacy methodologies, platform & technologies to devops principles, cloud native technologies and newer ecosystems without much friction.
  • Strict adherence to automating routine tasks and scripting, with a low tolerance to manual processes.
  • Needs to be data & metric driven. Develop tools and platforms for better system observability & insights.
  • Writing design decision documentation and is keen on implementing overall production best practices with a strong focus on security & encourage right Devops Workflows.
  • Design, develop, and deploy modular cloud-based systems Educating teams on the implementation of new cloud technologies and initiatives
  • Develop and maintain cloud solutions in accordance with best practices.
Requirements
  • At least 5+ years of experience with Cloud SRE role (OCI, AWS, GCP) is mandatory.
  • Experience on Configuration Management tools such as Puppet, Ansible, Terraform is MUST.
  • Experience with Container such as Kubernetes, Docker is required.
  • Experience with scripting in Python, Golang to write scripts and automate routine tasks.
  • Proven work experience as a Cloud Engineer or similar role.
  • Experience in Load Balancer such as HAProxy, Nginx, F5, dnsdist, Varnish.
  • Experience in Webservers such as Apache, Nginx.
  • AWS and/or GCP certifications preferred, not a must.

Agivant Technologies India Private Limited

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD3595601
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Pune, Maharashtra, India
  • Education
    Not mentioned
  • Experience
    Year