ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

AI Model Serving Specialist

Rackspace

India - Remote Remote permanent

Posted: February 26, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms within Rackspace’s Private Cloud and Hybrid environments.

Job Description

Role Purpose:
• Enable enterprise customers to operationalize AI workloads by deploying and optimizing model-serving platforms (e.g., NVIDIA Triton, vLLM, KServe) within Rackspace’s Private Cloud and Hybrid environments.
• This role bridges AI engineering and platform operations, ensuring secure, scalable, and cost-efficient inference services.


Key Responsibilities : - :
• Model Deployment & Optimization
• Package and deploy ML/LLM models on Triton, vLLM, or KServe within Kubernetes clusters.
• Tune performance (batching, KV-cache, TensorRT optimizations) for latency and throughput SLAs.
• Platform Integration
• Work with VMware VCF9, NSX-T, and vSAN ESA to ensure GPU resource allocation and multi-tenancy.
• Implement RBAC, encryption, and compliance controls for sovereign/private cloud customers.
• API & Service Enablement
• Integrate models with Rackspace’s Unified Inference API and API Gateway for multi-tenant routing.
• Support RAG and agentic workflows by connecting to vector databases and context stores.
• Observability & FinOps
• Configure telemetry for GPU utilization, request tracing, and error monitoring.
• Collaborate with FinOps to enable usage metering and chargeback reporting.
• Customer Engineering Support
• Assist solution architects in onboarding customers, creating reference patterns for BFSI, Healthcare, and other verticals.
• Provide troubleshooting and performance benchmarking guidance.
• Continuous Improvement
• Stay current with emerging model-serving frameworks and GPU acceleration techniques.
• Contribute to reusable Helm charts, operators, and automation scripts.


Required Skills & Experience:
• Hands-on experience with NVIDIA Triton, vLLM, or similar serving stacks.
• Strong knowledge of Kubernetes, GPU scheduling, and CUDA/MIG.
• Familiarity with VMware VCF9, NSX-T networking, and vSAN storage classes.
• Proficiency in Python and containerization (Docker).
• Understanding of observability stacks (Prometheus, Grafana) and FinOps principles.
• Exposure to RAG architectures, vector DBs, and secure multi-tenant environments.
• Excellent problem-solving and customer-facing communication skills.


Preferred Certifications:
• NVIDIA Certified Professional (AI/ML)
• Kubernetes Administrator (CKA)
• VMware VCF Specialist
• Rackspace AI Foundations (internal)


KPI's:
• Model deployment success rate and SLA compliance.
• Latency/throughput benchmarks per SKU.
• Customer satisfaction (NPS) for AI services.
• Efficiency in GPU utilization and cost optimization.


Physical Demands:
• General office environment: no special physical demands required.
• May require long periods of sitting and viewing a computer monitor.
• Schedule flexibility to include working weekends and/or evenings and holidays as required by the business for 24/7 operations.


Travel:
• As per business needs


About Rackspace Technology:
• We are the multicloud solutions experts.
• We combine our expertise with the world’s leading technologies — across applications, data and security — to deliver end-to-end solutions.
• We have a proven record of advising customers based on their business challenges, designing solutions that scale, building and managing those solutions, and optimizing returns into the future.
• Named a best place to work, year after year according to Fortune, Forbes and Glassdoor, we attract and develop world-class talent.
• Join us on our mission to embrace technology, empower customers and deliver the future.


More on Rackspace Technology:
• Though we’re all different, Rackers thrive through our connection to a central goal: to be a valued member of a winning team on an inspiring mission.
• We bring our whole selves to work every day.
• And we embrace the notion that unique perspectives fuel innovation and enable us to best serve our customers and communities around the globe.
• We welcome you to apply today and want you to know that we are committed to offering equal employment opportunity without regard to age, color, disability, gender reassignment or identity or expression, genetic information, marital or civil partner status, pregnancy or maternity status, military or veteran status, nationality, ethnic or national origin, race, religion or belief, sexual orientation, or any legally protected characteristic.
• If you have a disability or special need that requires accommodation, please let us know.

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply