Key Responsibilities
Monitor and manage application performance and availability using modern tools Prometheus, Grafana, Datadog and resolve issues proactively.
Incident management and root cause analysis: Respond to production incidents, conduct root cause analysis, and work with teams to implement long-term fixes.
Collaborate with development teams to design and implement scalable, reliable, and resilient applications and systems.
Automate routine tasks related to application monitoring, deployment, and scaling through scripting Python, Bash or configuration management tools Ansible, Chef, Puppet.
Infrastructure as Code IaC: Manage cloud infrastructure AWS, GCP, Azure through tools such as Terraform, CloudFormation, or equivalent.
Application performance tuning: Analyze and optimize application performance by improving response times, resource utilization, and database performance.
Build and maintain CI/CD pipelines to enable seamless deployments and integration.
Capacity planning and scaling: Predict future application capacity needs and scale infrastructure and services accordingly.
Documentation: Create and maintain runbooks, playbooks, and architecture diagrams for the support and operational needs of the application.
Qualifications:
Bachelors degree in Computer Science, Engineering, or related field or equivalent practical experience.
2+ years of experience in an SRE or DevOps role, with a focus on application support and reliability.
Experience with cloud platforms like AWS, Azure, or Google Cloud.
Proficiency in scripting languages like Python, Bash, etc.
Strong understanding of containerization Docker, Kubernetes and orchestration technologies.
Experience with monitoring and logging tools such as Prometheus, Grafana, ELK Stack, Datadog
Knowledge of CI/CD pipelines and related tools Jenkins, GitLab CI, CircleCI
Understanding of web applications, databases SQL/NoSQL, and networking fundamentals.
Excellent problem solving skills with a strong focus on automation and efficiency.
Ability to work on-call rotation for after hours support as needed.
Nice to Have:
Experience with database management MySQL, PostgreSQL, MongoDB.
Security best practices knowledge.
Experience with performance testing tools JMeter, Gatling
About Virtusa
Teamwork, quality of life, professional and personal development: values that Virtusa is proud to embody. When you join us, you join a team of 30,000 people globally that cares about your growth -- one that seeks to provide you with exciting projects, opportunities and work with state of the art technologies throughout your career with us.
Great minds, great potential: it all comes together at Virtusa. We value collaboration and the team environment of our company, and seek to provide great minds with a dynamic place to nurture new ideas and foster excellence.
Virtusa was founded on principles of equal opportunity for all, and so does not discriminate on the basis of race, religion, color, sex, gender identity, sexual orientation, age, non-disqualifying physical or mental disability, national origin, veteran status or any other basis covered by appropriate law. All employment is decided on the basis of qualifications, merit, and business need.
MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.