Rackspace Technology

Realize the full value of the cloud.

AI Model Serving Specialist

Artificial IntelligenceArtificial IntelligenceFull TimeRemoteTeam 5,001-10,000Since 1998H1B No SponsorCompany SiteLinkedIn

Location

United States

Posted

89 days ago

Salary

$82.3K - $140.6K / year

Bachelor DegreeEnglishCloudDockerGrafanaKubernetesPrometheusPythonVmware

Job Description

• Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms (e.g., NVIDIA Triton, vLLM, KServe) within Rackspace’s Private Cloud and Hybrid environments. • Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters. • Tune performance for latency and throughput SLAs. • Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy. • Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers. • Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing. • Support RAG and agentic workflows by connecting to vector databases and context stores. • Configure telemetry for GPU utilization, request tracing, and error monitoring.

Job Requirements

  • Hands-on experience with **NVIDIA Triton**, **vLLM**, or similar serving stacks.
  • Strong knowledge of **Kubernetes**, **GPU scheduling**, and **CUDA/MIG**.
  • Familiarity with **VMware VCF9**, NSX-T networking, and vSAN storage classes.
  • Proficiency in **Python** and containerization (Docker).
  • Understanding of **observability stacks** (Prometheus, Grafana) and **FinOps principles**.
  • Exposure to **RAG architectures**, vector DBs, and secure multi-tenant environments.
  • Excellent problem-solving and customer-facing communication skills.

Benefits

  • Our compensation reflects the cost of labor across several US geographic markets.
  • Compensation package may also include incentive compensation opportunities in the form of annual bonus or incentives, equity awards and an Employee Stock Purchase Plan (ESPP).
  • Learn more about benefits at Rackspace.

Related Job Pages

More Artificial Intelligence Jobs

AI Intern

CBIZ

Trusted local advisors enhanced by specialists nationwide. (NYSE: CBZ)

Artificial Intelligence89 days ago
InternshipRemoteTeam 10,001+Since 1996

AI Intern supporting design and development of automated workflows for finance

United States
$20 - $40 / hour
Full TimeRemoteTeam 10,001+Since 2015H1B Sponsor

Private Cloud AI Sales Specialist specializing in AI solutions at Hewlett Packard Enterprise

Cloud
Connecticut + 2 moreAll locations: Connecticut, New York, Pennsylvania
$216K - $507K / year

Business Growth & Development Intern, AI Product

FocusKPI, Inc.

FocusKPI is a data science and technology firm specializing in predictive analytics practice and methodologies.

Artificial Intelligence89 days ago
InternshipRemoteTeam 11-50Since 2010H1B No Sponsor

Business Growth & Development Intern supporting market outreach for AI consulting firm

United States

Marketing Intern, AI Product

FocusKPI, Inc.

FocusKPI is a data science and technology firm specializing in predictive analytics practice and methodologies.

Artificial Intelligence89 days ago
InternshipRemoteTeam 11-50Since 2010H1B No Sponsor

Marketing Intern supporting AI tool adoption at FocusKPI

United States