Senior DL Algorithms Engineer – Inference Performance

EngineerEngineerFull TimeRemoteTeam 10,001+Since 1993H1B SponsorCompany SiteLinkedIn

Location

California

Posted

23 days ago

Salary

$184K - $356.5K / year

Postgraduate Degree5 yrs expEnglishMicroservicesPy Torch

Job Description

• Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs). • Contribute new features, fix bugs and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library. • Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance. • Benchmark state-of-the-art offerings in various DL models inference and perform competitive analysis for NVIDIA SW/HW stack. • Collaborate heavily with other SW/HW co-design teams to enable the creation of the next generation of AI-powered services.

Job Requirements

  • PhD in CS, EE or CSEE or equivalent experience.
  • 5+ years of experience.
  • Strong background in deep learning and neural networks, in particular inference.
  • Experience with performance profiling, analysis and optimization, especially for GPU-based applications.
  • Proficient in C++, PyTorch or equivalent frameworks.
  • Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture.
  • Proven experience with processor and system-level performance optimization.
  • Deep understanding of modern LLM architectures.
  • Strong fundamentals in algorithms.
  • GPU programming experience (CUDA or OpenCL) is a plus

Benefits

  • equity
  • benefits

Related Categories

Related Job Pages

More Engineer Jobs

Forward Deployed Engineer

Titan AI

Building mobile RPG Games with AI ✨ Companions 🤖. First game: Hell Rush is coming in July 2024.

Engineer23 days ago
Full TimeRemoteTeam 1-10

Forward Deployed Engineer bridging tech and banking clients

PythonTypeScript
United States
$150K - $200K / year

Redis/Valkey Contributing Engineer

Percona

Scaling, Securing, and Managing the Best Open Source Databases on the Most Popular Platforms

Engineer24 days ago
Full TimeRemoteTeam 201-500Since 2006H1B No Sponsor

Contributing Engineer for Percona’s Redis/Valkey product in a remote capacity

Open SourceRedisRustGo
United States
Full TimeRemoteTeam 5,001-10,000H1B Sponsor

Engineer applying expertise to enhance reliability for industrial clients

Alabama + 1 moreAll locations: Alabama, Florida

Software Engineer – Production Cost

Switzerland Global Enterprise

We support Swiss SMEs in their international business and help innovative foreign companies to establish in Switzerland.

Engineer24 days ago
Full TimeRemoteTeam 51-200Since 1927H1B No Sponsor

Software Engineer advancing power system planning software for GE Vernova

JavaPython
United States
$94.9K - $158.1K / year