Skip to main contentA logo with &quat;the muse&quat; in dark blue text.

Chief HPC Engineer

AT EPAM Systems
EPAM Systems

Chief HPC Engineer

San Javier, Chile

We are currently seeking an experienced Chief HPC Engineer to manage the daily operations and engineering activities within our HPC environment.
The perfect candidate should be proficient in engineering with substantial expertise in setting up and enhancing HPC infrastructure. This role will involve collaboration with our L3 HPC infrastructure engineering team to facilitate the use of an HPC cluster by our Scientific research team. Priority will be given to candidates residing in India, though the position is available to candidates from any location.

#LI-DNI

Responsibilities

  • Maintenance and support of the HPC infrastructure
  • Implementation of infrastructure automation through IaC (Infrastructure as Code)
  • Participation in software and hardware upgrades while resolving incidents
  • Management of job scheduling and resource distribution with HPC job schedulers
  • Configuration and installation of Bright Cluster Manager
  • Optimization and maintenance of GPFS/Lustre file systems
  • Supervision of InfiniBand/OmniPath network interconnect configurations
Requirements

Want more jobs like this?

Get jobs in San Javier, Chile delivered to your inbox every week.

By signing up, you agree to our Terms of Service & Privacy Policy.
  • 10+ years as a general technical expert in HPC
  • Background in engineering or HPC system development
  • Experience in configuring and supporting HPC infrastructure
  • Proficiency in Linux (any rpm-based) including knowledge of kernel modules compilation and debugging tools such as strace, coredump, and tcpdump
  • Skills in managing HPC job schedulers including IBM LSF and Slurm
  • Competency in configuring and installing Bright Cluster Manager
  • Familiarity with GPFS and Lustre file systems
  • Understanding of InfiniBand and OmniPath network interconnect technologies
Nice to have
  • Understanding of hardware diagnostics, upgrades, and tuning including HCA InfiniBand and disk arrays from Lustre, Vast, IBM
  • Skills in infrastructure monitoring using Zabbix, Splunk, or Grafana
  • Familiarity with Easybuild
  • Experience in a GxP environment
  • Capability to use Jira and ServiceNow
We offer
  • Improved medical coverage - EPAMers are eligible to participate in a supplementary health insurance program that shall have the usual coverage in the industry, with the Company funding 100% of the value of the monthly premium for participation
  • Lunch Allowance - You will receive a daily allowance of CLP $ 7.000 per working day. Enjoy a nice meal on us
  • Allowance for internet and electricity - You will receive an allowance of CLP$15.000 per month to cover internet and electricity expense
  • National Holiday Bonus - We celebrate joining the Chilean Market. That is why all our employees will receive a bonus of CLP $86,646 in September
  • Christmas Bonus - You will receive an End of Year bonus of CLP $170,539. It will be paid during the month of December, to ensure you have a Happy Holiday!
  • Learning Culture - We want you to be the best version of yourself, that is why we offer unlimited access to learning platforms, a wide range of internal courses, and all the knowledge you need to grow professionally
  • Additional Income - Besides your regular salary, you will also have the chance to earn extra income by referring talent, being a technical interviewer, and many more ways
  • Are you open to relocation? - If you want to relocate to another country and we have the right project, we will assist you every step of the way, to help you and your family, reach your new home

Client-provided location(s): Chile
Job ID: EPAM-epamgdo_blt6b7d1e3c48bc2971_en-us_Other_Chile
Employment Type: Other