MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 250+ jobs updated daily

Latest Job Openings

US, CA, Santa Clara permanent
Computer SciencePythonGoRustCUDAGPU ProgrammingPerformance EngineeringLLM InferenceML techniquesDSLsContainerization

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-perf...

April 2, 2026 View Details
U.S. Remote Remote permanent
Technical LeadershipTeam ManagementInference OptimizationFramework MasteryEvaluating AI OutputvLLMKV cache reuseSpeculative DecodingContinuous BatchingLMCacheNIXL

WEKA is architecting a new approach to the enterprise data stack built for the age of reasoning. NeuralMesh by WEKAsets the standard for agentic AI data infrastructure with a cloud and AI-native softw...

March 30, 2026 View Details
US, CA, Santa Clara permanent
Go-To-Market StrategyEcosystem EngagementEnablementRoadmap InfluenceRevenue GrowthEnterprise SalesPartner ManagementTechnical EnablementPipeline ManagementWritten Feedback

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. To...

March 30, 2026 View Details
U.S. Remote Remote permanent
Technical LeadershipTeam ManagementInference OptimizationFramework MasteryDeep Domain ExpertiseStack KnowledgeBackend EngineeringInfrastructure

WEKA is architecting a new approach to the enterprise data stack built for the age of reasoning. NeuralMesh by WEKAsets the standard for agentic AI data infrastructure with a cloud and AI-native softw...

March 26, 2026 View Details
Germany, Munich permanent
C++CUDAAI InferencePyTorchMLGPU-Centric Performance EngineeringAutomotive SystemsLinux DevelopmentGPU ProgrammingTensorRT

NVIDIA is synonymous with innovation, boasting trailblazers who are shaping the world with their forward-thinking approaches. This is your chance to be part of a vibrant community that's redefining th...

March 20, 2026 View Details
Canada, Toronto permanent
Computer SciencePythonGPU ProgrammingCUDAPerformance EngineeringLLM InferencevLLMSGLangDistributed Systems

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-perf...

March 17, 2026 View Details
Remote job permanent
C++Problem Solving TechniquesJob Schedule OptimizationMemory managementSystem engineeringPredictable ResultsTechnical ownershipInfra trust

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

March 11, 2026 View Details
Remote job permanent
C++Problem Solving TechniquesSystem ArchitecturePerformance OptimizationMemory ManagementProduction DeploymentEnglish CommunicationAI Inference

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

March 11, 2026 View Details
US, CA, Santa Clara permanent
GPU Performance Engineeringdeepset CloudTensorRT-LLMvLLMSGLangSoftware DevelopmentPython ProgrammingC++ ProgrammingPerformance OptimizationDistributed Inference

We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s performance standards across language models, video generation, and speech workloads. We work directl...

March 9, 2026 View Details
US, CA, Santa Clara permanent
Performance OptimizationBenchmarkingTensorRT-LLMSGLangvLLMDeep LearningQuantizationSchedulingMemory ManagementGPU Performance EngineeringDistributed Inference

We optimize and benchmark GenAI inference on NVIDIA's latest accelerators, defining the industry’s performance standards across language models, video generation, and speech workloads. We work directl...

March 5, 2026 View Details
Dallas, TX permanent
Performance OptimizationConcurrencyMultithreadingMemory ManagementSpeedBenchmarkingReliabilityAPI ArchitectureImage ProcessingC++OpenCV

54,000 new photos are taken every second, and 600 hours of video are uploaded every minute. At Topaz Labs, we help over 1 million paying customers (including teams at Google, Nvidia, and NASA) maximiz...

March 5, 2026 View Details
Archived 5 Locations permanent
Computer ScienceCompiler TechnologiesPythonDeep Learning ModelsPerformance AnalysisGPU ArchitectureCUDALLM Inference

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited moder...

February 25, 2026 View Details
Archived US, CA, Santa Clara permanent
Computer SciencePythonCUDAGPU ProgrammingPerformance EngineeringLLM RetrievalvLLMSGLangDockerKubernetesML Compilers

We are seeking highly skilled and motivated software engineers to join us and build AI inference systems that serve large-scale models with extreme efficiency. You’ll architect and implement high-perf...

February 24, 2026 View Details
Archived 6 Locations permanent
Computer ScienceEfficient ExecutionDeep LearningGPU ArchitecturePython ProgrammingPerformance AnalysisAPI DesignParallel ComputingCollaboration

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited moder...

February 24, 2026 View Details
Archived 2 Locations permanent
Systems EngineeringLLM BenchmarksvLLMSGLangGPU WorkloadsCUDARustC++PythonDistributed SystemsConcurrencyProfiling

NVIDIA is the platform for every new AI-powered application. We seek a Principal Software Engineer - AI Inference to advance open-source LLM serving. This role involves contributing to upstream infere...

February 23, 2026 View Details
Archived 5 Locations permanent
Computer ScienceDeep LearningRate OptimizationGPU ArchitecturePerformance AnalysisPython ProgrammingCUDADeep Learning Models

NVIDIA's invention of the GPU 1999 sparked the growth of the PC gaming market, redefined modern computer graphics, and revolutionized parallel computing. More recently, GPU deep learning ignited moder...

February 23, 2026 View Details

AI Inference Engineer

quadric, Inc

Archived Burlingame, California, United States Hybrid permanent
Model OptimizationModel QuantizationModel ConversionFull-Stack ProficiencyPython ProgrammingCollaborationProblem SolvingTechnical Support

Quadric has created an innovative general purpose neural processing unit (GPNPU) architecture. Quadric's co-optimized software and hardware is targeted to run neural network (NN) inference workloads i...

November 24, 2025 View Details
Archived Remote job permanent
AIInferenceMachine LearningEdge DevicesFrameworksCollaborationCommunicationProduct DevelopmentProduction-Ready Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 18, 2026 View Details
Archived Remote job permanent
AIInferenceMachine LearningEdge DevicesLlama-indexggmlONNXCross-functional CollaborationProduct DevelopmentEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
AIMachine LearningEdge AIC++JavaScript

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
AIInferenceEdge DevicesMachine LearningGGMLONNXCollaborationResearchProduct DevelopmentLlama.cpp

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
AIMachine LearningC++JavaScriptEdge ComputingProduct ManagementCross-functional CollaborationProduction-ready SystemsEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
AIInferenceMachine LearningEdge DevicesOnnxLlama.cppggmlCollaborationTeam CoordinationProduction-Ready Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Remote WorkMachine LearningEdge AILlama.cppggmlONNXCross-functional CollaborationTeam CoordinationProduction-ready Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
AIInferenceMachine LearningEdge DevicesFrameworksCollaborationResearchProduct CoordinationProduction-ready Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Inference PipelinesMachine LearningEdge DevicesC++JavaScriptTeam LeadershipProduction Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningC++InferenceDeep LearningTransformersLLMsGPUModel DeploymentProgramming SkillsResearch Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningDeep LearningC++JavaScriptLlama.cppggmlOnnxTransformersLLMsDiffusion ModelsAPI Integration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++LLama.cppggmlOnnxDeep LearningTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++PythonLlama.cppggmlOnnxDeep LearningTransformersLLMsDiffusion ModelsAPI Integration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningC++Deep LearningTransformersLLMsInference EnginesJavascriptAPIsCollaborationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningDeep LearningC++JavascriptLlama.cppggmlOnnxTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++LLama.cppggmlOnnxAIMachine LearningDeep LearningTransformersLLMsDiffusion ModelsInference Engines

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++Llama.cppggmlONNXInference PipelinesDeep LearningTransformersLLMsDiffusion ModelsCollaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++PythonLLMDeep LearningTransformersInferenceEdge ComputingGPU ArchitectureAPI IntegrationCollaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningDeep LearningModel DeploymentC++JavaScriptLlama.cppggmlOnnxTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningInference PipelinesC++Llama.cppggmlOnnxDeep LearningTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++Llama.cppggmlOnnxDeep LearningTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++Llama.cppggmlONNXDeep LearningTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++Llama.cppggmlOnnxDeep LearningTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++Llama.cppggmlOnnxDeep LearningTransformersLLMsDiffusion ModelsEdge DevicesProgramming Skills

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningDeep LearningTransformersLLMsDiffusion ModelsC++JavascriptInference EnginesGPU ArchitecturesModel Deployment

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
C++Llama-indexOnnxDeep LearningTransformersLLMsDiffusion ModelsModel DeploymentAI Integrationggml

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived Remote job permanent
Machine LearningInferenceC++JavaScriptLlama-indexggmlOnnxTransformersLLMsDiffusion Models

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

February 17, 2026 View Details
Archived United States (Remote) Remote permanent
Machine LearningPythonPyTorchInferenceDiffusion Transformer ModelsGPU OptimizationDockerKubernetesAWSGCPAzureOpenCV

Generative AI Inference Engineer <Remote> About the role: We are seeking passionate Machine Learning Engineers to join our Inference team, focusing on the creative applications of generative ...

November 17, 2025 View Details
Archived Sydney, NSW, Australia permanent
Applied ScienceAI InferenceDistributed SystemsModel ServingInference OptimizationQuantizationHardware AccelerationAlgorithm OptimizationBenchmarkingLLM Deployment

We invite you to join NinjaTech AI as an Applied Scientist specialized in AI inference and distributed systems to help optimize and scale our AI models for production environments. You will work at t...

September 14, 2025 View Details
Archived San Francisco, California, United States permanent
PythonSoftware DevelopmentProduct ManagementTechnical Customer SuccessPre-sales Solution EngineeringDockerProduction Deployment

ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, ...

November 4, 2025 View Details
Archived Bangalore, Karnataka permanent
Enterprise SalesCloudAIInferenceGPU InfrastructureLLMsData PlatformsCustomer Relationship ManagementContract NegotiationValue/Benefit Communication

About us: Paytm is India's payment Super App offering consumers and merchants most comprehensive payment services. Pioneer of mobile QR payments revolution in India, today, Paytm is India’s largest pa...

December 19, 2025 View Details
Archived Location not specified Remote
C++AI InferenceEdge DevicesOptimizationRuntimeStabilityCollaborationResearchEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 22, 2026 View Details

Lead AI Inference Engineer

Tether Operations Limited

Archived Location not specified Remote
AI SystemsMachine LearningEdge ComputingC++JavaScriptCollaborationTeam LeadershipProduction SystemsEdge AI

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 22, 2026 View Details