Senior Deep Learning Framework Engineer
Location
California + 3 moreAll locations: California, North Carolina, Massachusetts, Texas
Posted
51 days ago
Salary
$152K - $241.5K / year
Bachelor Degree5 yrs expExperience acceptedEnglishPythonPy Torch
Job Description
• Integrate new communication libraries features in AI frameworks: from PoC to performance analysis to production
• Perform deep analysis of AI workloads and frameworks to identify multi-GPU communication requirements and opportunities.
• Collaborate hands-on with teams working on the latest AI models.
• Improve AI compilers to hide communications or perform automatic fusion.
• Conduct in-depth AI workload performance characterization on multi-GPU clusters.
• Design fault-tolerant and elastic solutions for large-scale or dynamic AI workloads.
• Author custom communication or fused compute-communication kernels to showcase ultimate performance on NV platforms.
• Influence the roadmap of communication libraries - NCCL & NVSHMEM.
• Collaborate with a very dynamic team across multiple time zones.
Job Requirements
- B.S, M.S. or PHD in Computer Science, or related field (or equivalent experience) with 5+ software engineering and HPC/AI experience
- Development or integration experience with Deep Learning Frameworks such PyTorch, JAX, and Inference Engines such as TRT-LLM, vLLM, SGLang
- Rapid prototyping and development with Python, C++, CUDA or related DSLs (Triton, cuTe)
- Solid grasp of AI models, parallelisms, and/or compiler technologies (e.g. torch.compile)
- Experience conducting performance benchmarking on AI clusters.
- Familiarity with at least one performance profiler toolchain (PyTorch profiler, NVIDIA Nsight Systems)
- Understanding of HPC/AI communication concepts (1-sided v 2-sided communication, elasticity, resiliency, topology discovery, etc)
- Adaptability and passion to learn new areas and tools
- Flexibility to work and communicate effectively across different teams and timezones
Benefits
- equity
- benefits
Related Guides
Related Categories
Related Job Pages
More Communications Jobs
Director, Scientific Communications
Nuvalent, Inc.#PreciselyTargetedTherapies for patients with cancer
Communications51 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor
Director of Scientific Communications driving publication strategy for oncology at Nuvalent
Change Communications Manager
ASCENDING Inc.AWS Certified Advanced Consulting Partner, provides Cloud Consulting/Migration/Operation, Data Analytics, IT Staffing.
Communications52 days ago
ContractRemoteTeam 11-50H1B No Sponsor
Communications Manager executing strategic communications to enhance engagement and alignment
United States
Lead Strategic Communications Specialist
RESPECAchieving the impossible. Transforming our clients' visions into reality.
Communications53 days ago
Full TimeRemoteTeam 201-500H1B Sponsor
Lead Strategic Communications Specialist managing OCI communications for a federal project
Cloud
District of Columbia + 1 moreAll locations: District of Columbia, Washington
Communications53 days ago
InternshipRemoteTeam 1-10Since 2013H1B No Sponsor
Employee Communications Intern supporting internal communications efforts at Cologix