Cloud Site Reliability Engineer (design And Implement)

Year    Pune, Maharashtra, India

Job Description


:We are looking for a highly skilled Site Reliability Engineer (SRE) with strong engineering and architectural expertise to design, implement, and manage large-scale, mission-critical infrastructure across multiple data centers and cloud providers.As an SRE, you will be responsible for architecting and optimizing our global infrastructure, enabling development teams to roll out new features efficiently while maintaining high availability and reliability. You will be hands-on with automation, performance tuning, infrastructure scalability, and cloud-native technologies to ensure a seamless user experience for millions of customers.Key Responsibilities

  • Architect and implement highly scalable, fault-tolerant, and distributed systems across multi-cloud (OCI, AWS, GCP) and on-premise environments using modern DevOps and SRE principles.
  • Design and deploy next-generation cloud infrastructure with a strong focus on automation, self-healing systems, and performance optimization.
  • Develop and maintain infrastructure-as-code (IaC) using Terraform and configuration management tools such as Ansible and Puppet for automated provisioning and orchestration.
  • Build and optimize containerized environments using Kubernetes and Docker for seamless deployment and scaling.
  • Drive performance, scalability, and security improvements across our cloud and on-prem infrastructure, ensuring high availability and disaster recovery capabilities.
  • Monitor, troubleshoot, and resolve complex system issues by implementing advanced observability solutions, logging, and real-time monitoring frameworks.
  • Develop and enforce SRE best practices, including SLI/SLO definition, capacity planning, and incident management strategies.
  • Eliminate toil and automate repetitive tasks using scripting languages such as Python, Golang, or Shell scripting to improve operational efficiency.
  • Collaborate closely with engineering, architecture, and security teams to improve system resiliency, optimize application performance, and streamline CI/CD workflows.
  • Lead the transition of legacy systems to modern, cloud-native architectures, advocating for DevOps and infrastructure automation.
  • Participate in 24/7 on-call rotations, ensuring rapid response to critical incidents and driving post-mortem analysis for continuous improvement.
Requirements
  • 7+ years of hands-on experience in a Site Reliability Engineering (SRE) role, with a strong focus on designing, implementing, and managing cloud-native infrastructure.
  • Proficient with any cloud platform (preferably OCI) xe2x80x94not just operational experience but actual design and implementation expertise.
  • Proven experience in building, deploying, and optimizing infrastructure-as-code (IaC) using Terraform.
  • Strong automation mindset with proficiency in Ansible, Puppet, or other configuration management tools.
  • Hands-on experience with container orchestration using Kubernetes, Docker, and microservices architecture.
  • Advanced scripting and automation skills in Python, Golang, or Shell scripting to eliminate manual operations.
  • Working knowledge of load balancing technologies (HAProxy, Nginx, F5, Varnish, dnsdist) and web servers (Apache, Nginx).
  • Strong understanding of networking, distributed systems, and observability tools (Prometheus, Grafana, ELK stack, Datadog).
  • Experience in designing and implementing highly available, scalable, and secure architectures across cloud and hybrid environments.
  • AWS and/or GCP certifications are a plus but not required.
This is not a support-focused rolexe2x80x94we are looking for engineers who have built, deployed, and optimized complex distributed systems from the ground up.

Agivant

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD3605505
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Pune, Maharashtra, India
  • Education
    Not mentioned
  • Experience
    Year