Rackspace Technology
Realize the full value of the cloud.
AI Model Serving Specialist
Artificial IntelligenceArtificial IntelligenceFull TimeRemoteTeam 5,001-10,000Since 1998H1B No SponsorCompany SiteLinkedIn
Location
United States
Posted
89 days ago
Salary
$82.3K - $140.6K / year
Bachelor DegreeEnglishCloudDockerGrafanaKubernetesPrometheusPythonVmware
Job Description
• Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms (e.g., NVIDIA Triton, vLLM, KServe) within Rackspace’s Private Cloud and Hybrid environments.
• Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters.
• Tune performance for latency and throughput SLAs.
• Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy.
• Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers.
• Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing.
• Support RAG and agentic workflows by connecting to vector databases and context stores.
• Configure telemetry for GPU utilization, request tracing, and error monitoring.
Job Requirements
- Hands-on experience with **NVIDIA Triton**, **vLLM**, or similar serving stacks.
- Strong knowledge of **Kubernetes**, **GPU scheduling**, and **CUDA/MIG**.
- Familiarity with **VMware VCF9**, NSX-T networking, and vSAN storage classes.
- Proficiency in **Python** and containerization (Docker).
- Understanding of **observability stacks** (Prometheus, Grafana) and **FinOps principles**.
- Exposure to **RAG architectures**, vector DBs, and secure multi-tenant environments.
- Excellent problem-solving and customer-facing communication skills.
Benefits
- Our compensation reflects the cost of labor across several US geographic markets.
- Compensation package may also include incentive compensation opportunities in the form of annual bonus or incentives, equity awards and an Employee Stock Purchase Plan (ESPP).
- Learn more about benefits at Rackspace.
Related Guides
Related Categories
Related Job Pages
More Artificial Intelligence Jobs
Artificial Intelligence89 days ago
InternshipRemoteTeam 10,001+Since 1996
AI Intern supporting design and development of automated workflows for finance
Artificial Intelligence89 days ago
Full TimeRemoteTeam 10,001+Since 2015H1B Sponsor
Private Cloud AI Sales Specialist specializing in AI solutions at Hewlett Packard Enterprise
Cloud
Business Growth & Development Intern, AI Product
FocusKPI, Inc.FocusKPI is a data science and technology firm specializing in predictive analytics practice and methodologies.
Artificial Intelligence89 days ago
InternshipRemoteTeam 11-50Since 2010H1B No Sponsor
Business Growth & Development Intern supporting market outreach for AI consulting firm
United States
Marketing Intern, AI Product
FocusKPI, Inc.FocusKPI is a data science and technology firm specializing in predictive analytics practice and methodologies.
Artificial Intelligence89 days ago
InternshipRemoteTeam 11-50Since 2010H1B No Sponsor
Marketing Intern supporting AI tool adoption at FocusKPI
United States