Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Chief HPC Engineer

AT EPAM Systems
EPAM Systems

Chief HPC Engineer

San Javier, Chile

We are seeking a Chief HPC Engineer with robust technical skills in HPC infrastructure to manage day-to-day operations and engineering activities within our HPC environment. The ideal candidate will have a strong engineering background with significant hands-on experience in deployment and optimization. This leadership role requires strategic oversight and a proactive approach to maintaining and enhancing system performance and reliability.

#LI-DNI

Responsibilities

  • Support and oversee HPC infrastructure
  • Implement Infrastructure as Code (IaC) for system automation
  • Lead incident resolution efforts, as well as software and hardware upgrades
  • Guide and mentor a team of HPC engineers
  • Strategize and implement system scalability and efficiency improvements
  • Ensure system security and compliance with industry standards
  • Develop and monitor key performance indicators to assess system health
  • Foster strong vendor relationships for system hardware and software procurement
  • Lead research and adoption of new technologies to keep the infrastructure at the cutting edge
  • Facilitate collaboration across departments to align HPC strategies with organizational goals
Requirements

Want more jobs like this?

Get jobs in San Javier, Chile delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.
  • Minimum of 7 years of experience as an HPC Engineer
  • At least 2 years of relevant leadership experience
  • Proficiency in Linux (any rpm-based), including compiling kernel modules and using debugging tools like strace, coredump, and tcpdump
  • Experienced in managing HPC job schedulers such as IBM LSF and Slurm
  • Skilled in configuring and implementing Bright Cluster Manager
  • Understanding of both GPFS and Lustre file systems
  • Familiarity with InfiniBand and OmniPath network interconnect technologies
  • Fluent English communication skills at a C1 level or higher
Nice to have
  • Proficiency in hardware diagnostics, upgrades, and tuning, including HCA InfiniBand and disk arrays from Lustre, Vast, IBM
  • Capability to utilize infrastructure monitoring tools like Zabbix, Splunk, or Grafana
  • Understanding of Easybuild
  • Experience working within a GxP environment
  • Familiarity with project and service management tools like Jira and ServiceNow
We offer
  • Improved medical coverage - EPAMers are eligible to participate in a supplementary health insurance program that shall have the usual coverage in the industry, with the Company funding 100% of the value of the monthly premium for participation
  • Lunch Allowance - You will receive a daily allowance of CLP $ 7.000 per working day. Enjoy a nice meal on us
  • Allowance for internet and electricity - You will receive an allowance of CLP$15.000 per month to cover internet and electricity expense
  • National Holiday Bonus - We celebrate joining the Chilean Market. That is why all our employees will receive a bonus of CLP $86,646 in September
  • Christmas Bonus - You will receive an End of Year bonus of CLP $170,539. It will be paid during the month of December, to ensure you have a Happy Holiday!
  • Learning Culture - We want you to be the best version of yourself, that is why we offer unlimited access to learning platforms, a wide range of internal courses, and all the knowledge you need to grow professionally
  • Additional Income - Besides your regular salary, you will also have the chance to earn extra income by referring talent, being a technical interviewer, and many more ways
  • Are you open to relocation? - If you want to relocate to another country and we have the right project, we will assist you every step of the way, to help you and your family, reach your new home

Client-provided location(s): Chile
Job ID: EPAM-epamgdo_blt260576d386ad0146_en-us_Other_Chile
Employment Type: Other