Senior Engineer High Performance Computing

Year    Bangalore, Karnataka, India

Job Description


:We are seeking a highly skilled HPC/GPU Operations Engineer to manage, optimize, and maintain high-performance computing (HPC) infrastructure, with a focus on GPU-accelerated workloads. The ideal candidate will be responsible for ensuring the reliability, efficiency, and scalability of HPC systems used for scientific computing, AI/ML, and data-intensive applications. With 6-10 years of experienceCareer Level - IC3Responsibilities:HPC & GPU System ManagementAdminister and maintain HPC clusters, GPU nodes, and high-speed interconnects.
Deploy and configure GPU-accelerated workloads for AI/ML, scientific computing, and simulations.
Monitor system performance, troubleshoot issues, and optimize resource utilization.Software & Middleware SupportInstall, configure, and maintain HPC-related software, libraries, and tools (CUDA, OpenMP, MPI, etc.).
Support containerized workflows using Docker, Singularity, or similar technologies.
Ensure compatibility of software stacks with GPU architectures (NVIDIA, AMD, Intel).Performance Optimization & Monitoring
Tune GPU and CPU performance for specific workloads, including benchmarking and profiling.
Utilize monitoring tools (e.g., Prometheus, Grafana, Slurm, Ganglia) to track system health and efficiency.
Optimize scheduling and resource allocation in workload managers (Slurm, PBS, LSF, etc.).Security & Compliance
Ensure system security and access control for HPC resources.
Apply software patches, firmware updates, and security best practices.
Assist in regulatory compliance for HPC environments.User Support & Documentation
Provide support to researchers, data scientists, and engineers using HPC resources.
Develop and maintain documentation on best practices, troubleshooting, and system usage.
Conduct training sessions or workshops on HPC/GPU computing.Required Qualifications
Technical Skills
Experience managing HPC clusters and GPU-based computing environments.
Proficiency in Linux system administration, scripting (Bash, Python), and automation (Ansible, Terraform).
Knowledge of parallel computing, GPU programming (CUDA, OpenCL), and HPC frameworks.
Familiarity with networking (Infiniband, RDMA), storage (Lustre, GPFS, NFS), and virtualization.About Us:As a world leader in cloud solutions, Oracle uses tomorrow's technology to tackle today's problems. True innovation starts with diverse perspectives and various abilities and backgrounds.When everyone's voice is heard, we're inspired to go beyond what's been done before. It's why we're committed to expanding our inclusive workforce that promotes diverse insights and perspectives.We've partnered with industry-leaders in almost every sector-and continue to thrive after 40+ years of change by operating with integrity.Oracle careers open the door to global opportunities where work-life balance flourishes. We offer a highly competitive suite of employee benefits designed on the principles of parity and consistency. We put our people first with flexible medical, life insurance and retirement options. We also encourage employees to give back to their communities through our volunteer programs.We're committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by calling +1 888 404 2494, option one.Disclaimer:Oracle is an Equal Employment Opportunity Employer*. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans' status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.

  • Which includes being a United States Affirmative Action Employer

Oracle

Beware of fraud agents! do not pay money to get a job

MNCJobsIndia.com will not be responsible for any payment made to a third-party. All Terms of Use are applicable.


Job Detail

  • Job Id
    JD3607255
  • Industry
    Not mentioned
  • Total Positions
    1
  • Job Type:
    Full Time
  • Salary:
    Not mentioned
  • Employment Status
    Permanent
  • Job Location
    Bangalore, Karnataka, India
  • Education
    Not mentioned
  • Experience
    Year