Metsi Technologies
Global Systems Integrator | Digital Maturity | Data Center Automation | Hybrid Multicloud | Anything-as-a-Service
Senior GenAI, High Performance Computing Delivery Engineer
Location
Texas
Posted
4 days ago
Salary
$153.9K - $199.1K / year
Bachelor Degree7 yrs expEnglishDockerKubernetesLinuxNode.js
Job Description
• Deploy, configure, and validate GPU accelerated compute clusters for AI, ML, and HPC with NVIDIA Base Command Manager (Warewulf and OpenHPC knowledge are a plus)
• Perform benchmarking with HPL GPU, HPL MxP, STREAM, NCCL, RCCL, OSU Microbenchmarks, and related tools
• Produce as-built documentation, performance reports, and share best practices amongst the team.
• Configure and secure RHEL, Ubuntu, Rocky for GenAI or HPC workloads
• Work directly with customers onsite (travel both regionally and across the U.S.)
Job Requirements
- 7+ years with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields
- Deep hands on experience with GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager
- Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks
- Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience
- Experience with GenAI/HPC networking (InfiniBand and/or RoCE)
- Experience working in Linux based parallel computing environments at scale
- Experience with containers/orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm)
- Ability to travel up to 70% of the time across the U.S. as needed for projects
- Strong customer facing and communication skills
Benefits
- Health insurance
- Paid time off
- Flexible work arrangements
- Professional development
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Engineer4 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor
Senior Electrical Engineer leading high voltage protection and control systems design
Sharepoint Engineer
Accenture Federal ServicesWe believe in the power of change, harnessed in ways that matter for our country and communities.
Engineer4 days ago
Full TimeRemoteTeam 10,001+Since 2017H1B No Sponsor
SharePoint Engineer designing and maintaining environments for federal EHRM program
District of Columbia + 1 moreAll locations: District of Columbia, Washington
$106.3K - $221.1K / year
Engineer4 days ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor
Go Core Client Engineer designing and implementing Go-based client code at Tailscale
Distributed SystemsGo
Engineer4 days ago
Full TimeRemoteTeam 1,001-5,000Since 2002H1B Sponsor
Lead Identity Engineer specializing in Microsoft Entra ID and Okta for Bravo Communications.
CloudPython
United States