MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 250+ jobs updated daily

Latest Job Openings

San Francisco, California, United States permanent
AMD GPU EnablementGPU KernelsHIPCUDATritonDistributed InferencePerformance OptimizationCommunication LibrariesRCCLGPUs

About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprises and developers alike to use and access our s...

October 8, 2025 View Details
6 Locations permanent
Computer ScienceArtificial IntelligenceDeep LearningMachine LearningSoftware SystemsGPU OptimizationCUDAQuantizationSparsityInference Optimization

NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diff...

January 3, 2026 View Details
Menlo Park, California, United States permanent
Machine LearningInferenceSnowflakeLLMsInference EnginesTensorFlowPyTorchDockerKubernetesTerraform

Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for build...

August 21, 2025 View Details
San Francisco, California, United States permanent
RustDistributed SystemsLoad BalancingTraffic RoutingSticky RoutingLow-Latency Connection ManagementObservabilityDebuggingObservability ToolsSystem Lifecycle

About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprises and developers alike to use and access our ...

October 15, 2025 View Details
San Francisco, California, United States permanent
PythonPyTorchNVidia GPUsCUDAHPC technologiesInfiniBandMPINVLinkDistributed systemsPerformance-critical systems

About the Team Our Inference team brings OpenAI’s most capable research and technology to the world through our products. We empower consumers, enterprise and developers alike to use and access our s...

February 6, 2025 View Details
San Francisco, California, United States permanent
GPU InferenceModel ServingInference PerformanceSystem EfficiencyKernel OptimizationData MovementLow-level Performance TuningServing InfrastructureMultimodal CapabilitiesAI System Scaling

About the Team The Sora team is pioneering multimodal capabilities for OpenAI’s foundation models. We’re a hybrid research and product team focused on integrating multimodal functionalities into our ...

April 21, 2025 View Details
Sunnyvale, CA / Bellevue, WA (Bellevue, WA, Sunnyvale, CA) Remote permanent
CloudKubernetesGPU InferenceTritonvLLMTensorRT-LLMAutoscalingMicro-batchingDeveloper ExperienceObservabilityCost OptimizationTeam Leadership

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 22, 2025 View Details
Champaign-Urbana, USA (Distributed) Remote permanent
PythonProduction service operationCloud Platforms (GCP)DockerContainerized deploymentsKubernetesML Model Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 20, 2025 View Details
Boise, USA (Distributed) Remote permanent
PythonMachine LearningProduction DeploymentCloud Platforms (GCP)DockerKubernetesML Model Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 20, 2025 View Details
Portland, USA (Distributed) Remote permanent
PythonProduction service operationCloud Platforms (GCP)DockerContainerized deploymentsKubernetesML model deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 20, 2025 View Details
Detroit-Ann Arbor, USA (Distributed) Remote permanent
PythonProduction service operationCloud Platforms (GCP)DockerContainerized deploymentsKubernetesML model deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 20, 2025 View Details
Austin, USA (Distributed) Remote permanent
PythonMachine LearningInferenceProduction DeploymentKubernetesDockerCloud Platforms (GCP)High-availability ApplicationsML Model Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 20, 2025 View Details
Remote Remote permanent
PythonFastAPIPyTorchHuggingFaceKubeflowAirflowRayMLflowTensorRTONNX RuntimevllmTriton

MARA is redefining the future of sovereign, energy-aware AI infrastructure. We’re building a modular platform that unifies IaaS, PaaS, and SaaS which will enable governments, enterprises, and AI innov...

December 17, 2025 View Details
Palo Alto Remote permanent
Backend DevelopmentML infrastructureScalable API Developmentserving embedding modelsSystem Skillsinference runtimesVector SearchTechnical leadership

We’re looking for a Lead Engineer, Inference Platform to join our team building the inference platform for embedding models that power semantic search, retrieval, and AI-native features across MongoDB...

December 12, 2025 View Details
Sunnyvale, CA / Bellevue, WA (Bellevue, WA, Sunnyvale, CA) Remote permanent
PythonGoKubernetesCI/CDSLOs/SLIsBatch Job SchedulersGPU resource isolationMetrics-driven Improvement

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 12, 2025 View Details
Sunnyvale, CA / Bellevue, WA (Bellevue, WA, Sunnyvale, CA) Remote permanent
Distributed SystemsReal-time InferenceBatch Job SchedulersGPU resource isolationKV cachingspeculative decodingmixed precision inferencePyTorch serving internalsCUDA Optimization

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 12, 2025 View Details
Sunnyvale, CA / San Francisco, CA / Bellevue, WA (Bellevue, WA, San Francisco, CA, Sunnyvale, CA) Remote permanent
Program ManagementDistributed SystemsML InfrastructureGPU OptimizationModel OnboardingPerformance OptimizationRoadmap DeliveryCross-Functional CollaborationMetric DevelopmentProcess Standardization

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 12, 2025 View Details
Sunnyvale, CA / Bellevue, WA (Bellevue, WA, Sunnyvale, CA) Remote permanent
PythonGoC++TritonvllmTensorRTRay ServeLinuxGitKubernetesPyTorch

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 12, 2025 View Details
Palo Alto Remote permanent
Backend DevelopmentCloud NativeDistributed SystemsMulti-TenantModel Servinginference runtimesVector SearchObservabilityPerformance optimization

About the Role We’re looking for a Senior Engineer to help build the next-generation inference platform that supports embedding models used for semantic search, retrieval, and AI-native experiences i...

December 12, 2025 View Details
London (London, United Kingdom) permanent
Machine LearningDistillationQuantizationPruningData CompressionPyTorchEdge ComputingInnovation

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
London (London, United Kingdom) permanent
C++ ProgrammingGPU OptimizationSystem-Level Performance TuningCUDAOpenCLLow-Level System ManagementData Transfer EfficiencyReal-Time InferenceGPU Programming APIsPerformance Profiling

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
London (Israel, London, United Kingdom) permanent
Machine LearningDistillationQuantizationPruningData CompressionPyTorchEdge ComputingInnovationLow resource optimization

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
Palo Alto, CA; San Francisco, CA (Palo Alto, CA, San Francisco, CA) Remote permanent
KubernetesTerraformPulumiCI/CDDockerBasic DebuggingTesting

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details
Palo Alto, CA; San Francisco, CA (Palo Alto, CA, San Francisco, CA) permanent
PythonRustPyTorchJAXCUDAKubernetesSystemsOptimizationTestingCommunication

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details
Remote - USA (Remote - NAMER) Remote permanent
PhDMaster'sQuantitativeStatisticsPythonSQLCausal InferenceQuasi-experimental MethodsTechnical MentorshipProduct StrategyCommunication

Ready to be pushed beyond what you think you’re capable of? At Coinbase, our mission is to increase economic freedom in the world. It’s a massive, ambitious opportunity that demands the best of us, e...

December 10, 2025 View Details
Ann Arbor, USA (Distributed) Remote permanent
PythonProduction service operationKubernetes

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
State College, USA (Distributed) Remote permanent
PythonDockerKubernetesMLFilm ProductionGCPAWSCodeInference

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Seattle, USA (Distributed) Remote permanent
PythonMLKubernetesDockerGCPAWSML Model Development

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
San Francisco, USA (Distributed) Remote permanent
PythonInferenceProduction service operationDockerContainer-based DeploymentKubernetesModel Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Salt Lake City, USA (Distributed) Remote permanent
PythonProduction service operationKubernetes

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Raleigh-Durham, USA (Distributed) Remote permanent
PythonInferenceProduction service operationDockerContainer-based DeploymentKubernetesModel Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Philadelphia, USA (Distributed) Remote permanent
PythonProduction service operationPublic CloudDockerContainer-based DeploymentKubernetesModel Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Madison, USA (Distributed) Remote permanent
PythonFilm ProductionCloudsDockerKubernetesML Model Development

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Denver, USA (Distributed) Remote permanent
PythonProduction service operationKubernetesModel Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Chicago, USA (Distributed) Remote permanent
PythonProduction service operationKubernetes

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Boston, USA (Distributed) Remote permanent
PythonProduction service operationKubernetes app deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Atlanta, USA (Distributed) Remote permanent
PythonFilm ProductionCloudsDockerKubernetesMLInference

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Remote (Distributed) Remote permanent
PythonMachine LearningFilm ProductionCloudsDockerKubernetesML Model Development

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Pittsburgh, USA (Distributed) Remote permanent
PythonMachine LearningProduction DeploymentKubernetesDockerGCP

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Nashville, USA (Distributed) Remote permanent
PythonProduction service operationKubernetes

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Minneapolis-St. Paul, USA (Distributed) Remote permanent
PythonMLCloudsDockerKubernetesML Model Development

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Ithaca, USA (Distributed) Remote permanent
PythonProduction service operationDockerContainer-based DeploymentKubernetesModel Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Houston, USA (Distributed) Remote permanent
PythonFilm ProductionCloudsDockerKubernetesML Model Development

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Dallas, USA (Distributed) Remote permanent
PythonProduction service operation

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Columbus, USA (Distributed) Remote permanent
PythonInferenceProduction service operationDockerContainer-based DeploymentKubernetesModel Deployment

The mission of Speechify is to make sure that reading is never a barrier to learning. Over 50 million people use Speechify’s text-to-speech products to turn whatever they’re reading – PDFs, books, Go...

December 8, 2025 View Details
Belgrade, Serbia (Belgrade) Hybrid permanent
Backend DevelopmentAPI DesignPerformance optimizationSoftware ArchitectureDockerPythonLinux

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must...

December 8, 2025 View Details
LATAM; NAMER Remote permanent
PythonInferenceAgentic AI systemsAI-powered video featuresGPU Kernel OptimizationKubernetesCloud InfrastructureLLM IntegrationLLM orchestration

Please note that we will never request payment or bank account information at any stage of the recruitment process. As we continue to grow our teams, we urge you to be cautious of fraudulent job posti...

November 26, 2025 View Details
München, Germany Freelance
PythonDockerKubernetesCI/CDContainerizationData OrchestrationObservabilityGPULLM Optimization

For a client from the Telecom sector we are currently looking for a Senior AI Infrastructure & Inference Engineer (m/f/d). a) Project Details: Duration: 05.01.2026 – 30.06.2026 (with option to extend)...

November 25, 2025 View Details
München, Germany Remote Freelance
AI InfrastructureInferenceGPULLMsMulti-GPU SchedulingDistributed SystemsContainerizationKubernetesCI/CDObservability

Für unseren Kunden sind wir auf der Suche nach einem AI Infrastructure & Inference Engineer (m/w/d) mit Fokus GPU & LLM. Laufzeit: 5.1.26 Auslastung: Vollzeit Einsatzort: Remote • Design, implemen...

November 25, 2025 View Details