EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential.
We are seeking a Lead High-Performance Computing Engineer experienced in managing and enhancing HPC environments.
The ideal candidate will bring a robust engineering background with proven experience in deploying and optimizing HPC infrastructures, who will thrive in our HPC infrastructure engineering team supporting scientific research teams.
Want more jobs like this?
Get jobs in Hyderabad, India delivered to your inbox every week.
#LI-DNI
Responsibilities
- Participate in incident resolution, software and hardware upgrades
- Support and maintain HPC infrastructure
- Implement Infrastructure as Code (IaC) automation
- Develop and review system operational procedures
- Lead troubleshooting efforts in complex systems
- Experience range of 8 to 12 years in HPC environments
- Proficiency in configuring and supporting HPC infrastructure
- Proficiency in Linux, including capabilities such as kernel modules compilation and using debugging tools like strace, coredump, tcpdump
- Background in job schedulers including IBM LSF and Slurm
- Expertise in Bright Cluster Manager including installation and configuration tasks
- Knowledge of GPFS and Lustre file systems
- Understanding of InfiniBand and OmniPath network interconnect technologies
- Familiarity with cloud-based HPC solutions
- Experience in system security and data protection best practices
- Opportunity to work on technical challenges that may impact across geographies
- Vast opportunities for self-development: online university, knowledge sharing opportunities globally, learning opportunities through external certifications
- Opportunity to share your ideas on international platforms
- Sponsored Tech Talks & Hackathons
- Unlimited access to LinkedIn learning solutions
- Possibility to relocate to any EPAM office for short and long-term projects
- Focused individual development
- Benefit package:
- Health benefits
- Retirement benefits
- Paid time off
- Flexible benefits
- Forums to explore beyond work passion (CSR, photography, painting, sports, etc.)