Devops Lead

Year    Delhi, Delhi, India

Job Description

b'


We\'re a European company with a mission to revolutionize the way Brazilian customers engage with financial and entertainment services. We are a company that takes Agile seriously, and we give a lot of autonomy to leaders and teams to execute the best strategy for the company. We are passionate about what we do, adept to simplicity and eager to meet people who have this same vision so we can build together! Our mission:
Provide technology and services to create unique digital entertainment experiences. Position Overview: We are seeking an experienced DevOps Lead to provide support for our enterprise infrastructure. In this role, you will be responsible for supporting our systems, including participation in an on-call rotation. Beyond support, you will collaborate on improving incident response, enhancing observability, and collaborating with technical teams to strengthen the overall resilience and improve the performance of our workloads. Your work will involve close cooperation with cross-functional teams, bridging the gap between development and operations and cultivating a culture of collaboration and ongoing enhancement. Responsibilities: As a key member of our team, your responsibilities will include:
  • Providing crucial support for production systems and playing a vital role in issue triage.
  • Actively participating in our on-call rotation, promptly responding to availability incidents and assisting service engineers.
  • Establishing comprehensive monitoring and alerting systems to detect and respond to issues in real-time.
  • Monitoring systems to ensure adherence to system SLO/SLA, reviewing and following up on production incidents.
  • Collaborating with cross-functional teams to enhance incident response and resolution times, conducting thorough post-mortems for continuous improvement.
  • Proactively identifying and addressing system reliability issues, performance bottlenecks, and implementing preventive measures to minimize downtime.
  • Working closely with engineering teams to identify and address system limitations.
  • Participating in the Change Management process by reviewing RFCs to ensure adherence to the "Definition of Done" and actively supporting software and hardware deployments.
  • Championing automation in workflows and tools to improve the reliability and scalability of services.
  • Developing and implementing comprehensive monitoring and alerting systems using Datadog for real-time issue detection and response.
  • Writing and reviewing code, creating documentation, and troubleshooting distributed systems.
  • Collaborating with teams to optimize incident response, resolution times, and conducting post-mortems for ongoing improvement.
What you need:
  • 3+ years of hands-on experience in DevOps, Site Reliability Engineering (SRE), Security roles.
  • 1+ years of experience in leading teams or a similar senior role.
  • Proven track record of building and optimizing CI/CD pipelines to streamline software delivery processes.
  • Mandatory expertise in Azure, demonstrating a comprehensive understanding of cloud infrastructure.
  • Essential experience in managing and monitoring Kubernetes applications
  • Strong troubleshooting skills in networking and infrastructure issues, ensuring uninterrupted system operations.
  • Prior experience in successfully managing customer-facing systems in a 24/7 environment, including handling escalations with a focus on customer satisfaction.
  • Proficiency in scripting and automation using Terraform
  • Familiarity with triaging and escalation policies/protocols using OpsGenie or PagerDuty.
  • Hands-on experience with monitoring and observability tools such as Datadog, Prometheus, Grafana, and the ELK stack.
  • Excellent communication and documentation skills to facilitate clear and effective collaboration within the team.
  • Any additional experience with Cloudflare monitoring is considered a significant plus.
What we offer:
  • Startup environment: challenging, collaborative, fast and fun, where you will have the opportunity to learn, bring innovation and interact with colleagues from different nationalities
  • Autonomy: freedom for you to give ideas and create improvements in processes
  • Competitive salary package
  • A remote-first culture;
  • 1-1s culture and feedback loops
  • Access to free online courses to foster your personal growth;

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Related Jobs

Job Detail

  • Job Id
    JD3332656
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Delhi, Delhi, India
  • Education
    Not mentioned
  • Experience
    Year