MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 50+ jobs updated daily

Latest Job Openings

Hong Kong ( United States) Remote permanent
GPU ClustersDistributed SystemsPyTorchDeepSpeedMegatron-LMInfiniBandRDMACheckpointingRecovery ModelSLURM

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
Australia ( United States) Remote permanent
Strong systems engineering backgroundGPU-Accelerated PlatformsSystems Engineering SupportSystem Optimization

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
Boston, USA ( United States) Remote permanent
SLURMKubernetesC++CUDAPython

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
China ( United States) Remote permanent
GPU-Accelerated PlatformsStrong systems engineering background

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
Singapore ( United States) Remote permanent
Strong systems engineering backgroundGPU-Accelerated PlatformsSystems Engineering SupportSystem OptimizationCheckpointingRecovery Model

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
San Francisco Bay Area, USA ( United States) Remote permanent
Strong systems engineering backgroundGPU-Accelerated PlatformsSystems Engineering SupportSystem OptimizationCheckpointingRecovery Model

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
Oregon, USA ( United States) Remote permanent
GPU clustersDistributed SystemsStrong systems engineering background

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details
Seattle, USA ( United States) Remote permanent
PythonKubernetesMachine LearningC++

We are seeking a highly skilled LLM Pre-training & Distributed Systems Engineer. This role is essential for orchestrating large-scale machine learning training runs and optimizing distributed infr...

April 24, 2026 View Details