Senior DL Algorithms Engineer – Inference Performance
Location
California
Posted
23 days ago
Salary
$184K - $356.5K / year
Postgraduate Degree5 yrs expEnglishMicroservicesPy Torch
Job Description
• Implement language and multimodal model inference as part of NVIDIA Inference Microservices (NIMs).
• Contribute new features, fix bugs and deliver production code to TRT-LLM, NVIDIA’s open-source inference serving library.
• Profile and analyze bottlenecks across the full inference stack to push the boundaries of inference performance.
• Benchmark state-of-the-art offerings in various DL models inference and perform competitive analysis for NVIDIA SW/HW stack.
• Collaborate heavily with other SW/HW co-design teams to enable the creation of the next generation of AI-powered services.
Job Requirements
- PhD in CS, EE or CSEE or equivalent experience.
- 5+ years of experience.
- Strong background in deep learning and neural networks, in particular inference.
- Experience with performance profiling, analysis and optimization, especially for GPU-based applications.
- Proficient in C++, PyTorch or equivalent frameworks.
- Deep understanding of computer architecture, and familiarity with the fundamentals of GPU architecture.
- Proven experience with processor and system-level performance optimization.
- Deep understanding of modern LLM architectures.
- Strong fundamentals in algorithms.
- GPU programming experience (CUDA or OpenCL) is a plus
Benefits
- equity
- benefits
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Forward Deployed Engineer
Titan AIBuilding mobile RPG Games with AI ✨ Companions 🤖. First game: Hell Rush is coming in July 2024.
Engineer23 days ago
Full TimeRemoteTeam 1-10
Forward Deployed Engineer bridging tech and banking clients
PythonTypeScript
Redis/Valkey Contributing Engineer
PerconaScaling, Securing, and Managing the Best Open Source Databases on the Most Popular Platforms
Engineer24 days ago
Full TimeRemoteTeam 201-500Since 2006H1B No Sponsor
Contributing Engineer for Percona’s Redis/Valkey product in a remote capacity
Open SourceRedisRustGo
United States
Engineer24 days ago
Full TimeRemoteTeam 5,001-10,000H1B Sponsor
Engineer applying expertise to enhance reliability for industrial clients
Alabama + 1 moreAll locations: Alabama, Florida
Software Engineer – Production Cost
Switzerland Global EnterpriseWe support Swiss SMEs in their international business and help innovative foreign companies to establish in Switzerland.
Engineer24 days ago
Full TimeRemoteTeam 51-200Since 1927H1B No Sponsor
Software Engineer advancing power system planning software for GE Vernova
JavaPython