Metsi Technologies

Global Systems Integrator | Digital Maturity | Data Center Automation | Hybrid Multicloud | Anything-as-a-Service

Senior GenAI, High Performance Computing Delivery Engineer

EngineerEngineerFull TimeRemoteTeam 51-200H1B No SponsorCompany SiteLinkedIn

Location

Texas

Posted

4 days ago

Salary

$153.9K - $199.1K / year

Bachelor Degree7 yrs expEnglishDockerKubernetesLinuxNode.js

Job Description

• Deploy, configure, and validate GPU accelerated compute clusters for AI, ML, and HPC with NVIDIA Base Command Manager (Warewulf and OpenHPC knowledge are a plus) • Perform benchmarking with HPL GPU, HPL MxP, STREAM, NCCL, RCCL, OSU Microbenchmarks, and related tools • Produce as-built documentation, performance reports, and share best practices amongst the team. • Configure and secure RHEL, Ubuntu, Rocky for GenAI or HPC workloads • Work directly with customers onsite (travel both regionally and across the U.S.)

Job Requirements

  • 7+ years with HPC or GenAI clusters, GPU based systems, AI infrastructure, or related fields
  • Deep hands on experience with GPU deployment, configuration, and multi-node testing using NVIDIA Base Command Manager
  • Proficiency with benchmarking tools: HPL, STREAM, NCCL, RCCL, MxP, OSU Microbenchmarks
  • Red Hat certification (RHCSA/RHCE) or 7+ years of relevant RH distros experience
  • Experience with GenAI/HPC networking (InfiniBand and/or RoCE)
  • Experience working in Linux based parallel computing environments at scale
  • Experience with containers/orchestration (Docker, Singularity/Apptainer, Kubernetes, Slurm)
  • Ability to travel up to 70% of the time across the U.S. as needed for projects
  • Strong customer facing and communication skills

Benefits

  • Health insurance
  • Paid time off
  • Flexible work arrangements
  • Professional development

Related Categories

Related Job Pages

More Engineer Jobs

Engineer4 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

Senior Electrical Engineer leading high voltage protection and control systems design

United States
$133.3K - $168.9K / year

Sharepoint Engineer

Accenture Federal Services

We believe in the power of change, harnessed in ways that matter for our country and communities.

Engineer4 days ago
Full TimeRemoteTeam 10,001+Since 2017H1B No Sponsor

SharePoint Engineer designing and maintaining environments for federal EHRM program

District of Columbia + 1 moreAll locations: District of Columbia, Washington
$106.3K - $221.1K / year

Core Client Engineer

Tailscale

Simple, secure networks for teams of any scale. Built on WireGuard.

Engineer4 days ago
Full TimeRemoteTeam 51-200Since 2020H1B No Sponsor

Go Core Client Engineer designing and implementing Go-based client code at Tailscale

Distributed SystemsGo
United States
$163K - $226K / year
Full TimeRemoteTeam 1,001-5,000Since 2002H1B Sponsor

Lead Identity Engineer specializing in Microsoft Entra ID and Okta for Bravo Communications.

CloudPython
United States