MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 250+ jobs updated daily

Latest Job Openings

2 Locations permanent
Software EngineeringSystems ProgrammingData StructuresAlgorithmsLinux AdministrationCluster ManagementKubernetesGoPythonGPU Deep LearningGPU Asset ProvisioningDatacenter Operations

NVIDIA is hiring engineers to scale up its AI Infrastructure. We expect you to have a strong programming background, knowledge of datacenter hardware, operations, and networking, familiarity with soft...

January 6, 2026 View Details
4 Locations permanent
Hybrid Quantum-HPC Platform EngineeringHPC Systems & OperationsVendor & Partner Collaboration

NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. To...

January 6, 2026 View Details

IT Projektmanager (m/w/d) HPC/DataCenter, (ID: 30023)

Firmenname für EXPERT-Mitglieder sichtbar

IT-Projektmanagement -Erfahrung im Remote
IT ProjektmanagementHigh Performance ComputingStrategieberatungKonzeptentwicklungProviderauswahlDeutschkenntnisseEnglischkenntnisse

Projektbeschreibung Für unseren Kunden suchen wir derzeit einen IT Projektmanager (m/w/d) High Performance Computing in Köln. Sie unterstützen unseren Kunden beim Aufbau des HPC und sind dabei verant...

January 5, 2026 View Details
US, CA, Santa Clara permanent
Performance EngineeringHPCParallel ProgrammingNCCLUCXNVSHMEMMPIKubernetesPython

NVIDIA is leading the way in groundbreaking developments in Artificial Intelligence, High Performance Computing and Visualization. The GPU, our invention, serves as the visual cortex of modern compute...

January 5, 2026 View Details
San Francisco, Seattle, California, Washington, United States Remote permanent
HPCCluster ManagementOperating SystemsFirmwareSoftwareNetworkingTroubleshootingLinuxSLURMKubernetesAttention to DetailProblem Solving

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mi...

November 4, 2025 View Details
San Francisco, Seattle, California, Washington, United States Remote permanent
LeadershipHPCGPUCluster DeploymentCross-functional CollaborationAutomationProcess ImprovementMetricsVisibilityTechnology Awareness

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mi...

October 9, 2025 View Details
United States Remote permanent
HPCCloudKubernetesSlurmGPUCUDAInfiniBandIncident ManagementIncident ResponseCustomer ServiceTechnical ExpertiseRunbooks

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mi...

October 13, 2025 View Details
United States Remote permanent
LeadershipTechnical ExpertiseCustomer AdvocacyIncident ManagementOperationsProcess ImprovementCollaborationMetricsHands-On

Lambda, The Superintelligence Cloud, is a leader in AI cloud infrastructure serving tens of thousands of customers. Our customers range from AI researchers to enterprises and hyperscalers. Lambda's mi...

October 13, 2025 View Details
Canada Remote permanent
GPU InfrastructureHPCKubernetesCloudOptimizationTroubleshootingObservabilityInnovationMentorship

Who are we? Our mission is to scale intelligence to serve humanity. We’re training and deploying frontier models for developers and enterprises who are building AI systems to power magical experience...

November 18, 2025 View Details
San Francisco, California, United States permanent
System-Level TroubleshootingAutomationMonitoringPerformance OptimizationPythonGoLinuxNetworkingServer HardwareSQL

About the team The Fleet team at OpenAI supports the computing environment that powers our cutting-edge research and product development. We oversee large-scale systems that span data centers, GPUs, ...

June 3, 2025 View Details
San Francisco, California, United States permanent
HPC SystemsArchitectureDeploymentOperationWorkload ManagementNCLSFSlurmWorkload BalancingCluster FederationMulti-SchedulerLinux Administration

ABOUT THE TEAM The Consumer Products Infrastructure team builds and operates the high-performance computing platforms that support product design, simulation, and validation across OpenAI’s consumer-...

December 15, 2025 View Details
6 Locations permanent
Linux AdministrationNetworkingStorageJob SchedulersCluster AdministrationAutomation SolutionsCluster EfficiencyIncident ResponseLinux Distributions (Centos/RHEL, Ubuntu)Python Programming

NVIDIA is a pioneer in accelerated computing, known for inventing the GPU and driving breakthroughs in gaming, computer graphics, high-performance computing, and artificial intelligence. Our technolog...

January 3, 2026 View Details
Derbyshire, United Kingdom Contract
High Performance ComputingKubernetesOpenFoamSoftware Development Life Cycle (SDLC)Job SchedulingRun:AIGPU slicingJupyter ecosystemDockerSlurm

HPC Consultant: High Performance Computing, Kubernetes, OpenFoam - 5 day Contract Our enterprise client is looking for a HPC Principal Consultant to join their team. Start Date: ASAP Duration...

January 2, 2026 View Details
Hawthorne, CA permanent
LinuxHPCDockerPythonSlurmTerraformPuppetGPUC++Kubernetes

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologi...

December 29, 2025 View Details
Oslo, Norway Freelance
HPCAIInfrastructure ManagementArchitectureNVIDIAInfiniBandEthernetBase Command ManagerDockerKubernetesLinuxScripting

Primary Duties: Design and architect scalable, high-performance compute and network infrastructures for HPC/AI clusters Lead the implementation of advanced networking solutions, including NVIDIA Infin...

December 18, 2025 View Details
IT-Teams Remote
HPC SystemSoftware EngineeringLinuxPythonBashC/C++MPIOpenMPCUDAAnsibleJob SchedulingParallel Computing

Projektbeschreibung Zur Entlastung des Global Engineering Computing Operations Teams (GECOT) werden externe HPC System- bzw. Software Engineers gesucht. Ziel ist die Unterstützung des laufenden Betrie...

December 18, 2025 View Details

HPC Data Center Manager

Corescientific

Denton, TX permanent
LeadershipTeam BuildingSupervisionProject ManagementBudgetingResource AllocationIncident ManagementCommunicationTechnical ExpertiseOperations

Who We Are Core Scientific is a leading provider of infrastructure for high-performance compute in North America. Our mission is to accelerate digital innovation by scaling high-value compute rapidly...

December 15, 2025 View Details
Austin, Texas, United States; Santa Clara, California, United States; Toronto, Ontario, Canada (Austin, Santa Clara, United States) Hybrid permanent
Container TechnologiesLinux AdministrationHPC NetworkingStorage PlatformsInfrastructure-as-CodeAI/automationTroubleshootingHardware CollaborationIBM Spectrum LSFDockerSingularityPodmanLinux system administrationStorage architecturesJob scheduling

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must...

December 12, 2025 View Details
Austin, Texas, United States; Santa Clara, California, United States; United States (Austin, Santa Clara, Toronto) Hybrid permanent
Linux AdministrationAnsibleBare-metal provisioningRHELUbuntu administrationHPC Cluster ManagementInfrastructure as Code (IaC)kernel tuningPerformance optimization

Tenstorrent is leading the industry on cutting-edge AI technology, revolutionizing performance expectations, ease of use, and cost efficiency. With AI redefining the computing paradigm, solutions must...

December 12, 2025 View Details
Livingston, NJ / New York, NY / Sunnyvale, CA / Bellevue, WA (Bellevue, WA, Livingston, NJ , New York, NY, San Francisco, CA, Sunnyvale, CA) Remote permanent
KubernetesProof of ConceptsTechnical leadershipCustomer CollaborationComputer HardwareCloud InfrastructureWorkload BalancingProduct EnhancementTechnical Review

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 12, 2025 View Details
New York, NY / Sunnyvale, CA / Bellevue, WA (Bellevue, WA, New York, NY, Sunnyvale, CA) Remote permanent
HPC EngineerBenchmarkingVirtualizationLinux kernel modulesContainerizationDockerKubernetesMPI WorkloadsInfiniBandTelemetry

CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...

December 12, 2025 View Details
Palo Alto, CA (Dublin, IE, Memphis, TN, Palo Alto, CA) permanent
InfiniBandNCCL expertisePython automationPerformance optimizationMetric dashboard creationOn-call rotationsSoftware engineering mindset

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details
Dublin, IE (Dublin, IE, Memphis, TN, Palo Alto, CA) permanent
LargeScaleNetworkDesignEthernetAIHPCCongestionControlAIWorkloadsNCCLDebuggingPerformanceOptimizationPythonAutomationPerformanceMetricsCrossCountryTravelOnCallResponsibility

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details
Memphis, TN (Dublin, IE, Memphis, TN, Palo Alto, CA) permanent
PythonNCCL expertiseEthernetRoCEv2InfiniBandPerformance optimizationMetric dashboard creationAI Educationinference infrastructureOn-call rotations

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details
Bundesweit, Germany Remote Freelance
C++PythonBashShellLinuxNvidia CUDAParallel ProcessingMPIOpenMP

For our customer, we are looking for an experienced Software Engineer or System Engineer specializing in High Performance Computing HPC (m/f/d). Project goal: Optimization and ensuring reliabi...

December 9, 2025 View Details
Location not specified
Software DevelopmentProject ManagementCyber SecurityAgileAutomotive SystemsInfotainment ArchitectureSimulationData ProcessingLeadershipCommunication

We are searching intelligent and innovative employees. If you are interested in working for a dynamic company with flat hierarchy, we look forward to receiving your application. Your tasks: Experien...

December 9, 2025 View Details
Location not specified Remote permanent
RDMA NetworkingHPC NetworkingCloud Computing (AWS)Data StoreVirtualization

Company Description Guardant Health is a leading precision oncology company focused on guarding wellness and giving every person more time free from cancer. Founded in 2012, Guardant is transforming ...

December 9, 2025 View Details
Seoul, South Korea (Seoul) permanent
Data-Driven Decision MakingPrice OptimizationProcess AutomationSystem ManagementTeam LeadershipProject ManagementData AnalysisCustomer-Centric PricingCross-Functional Collaboration

회사 소개 쿠팡은 고객 감동 실현을 위해 존재합니다. 고객들이 "쿠팡 없이 그동안 어떻게 살았을까?" 라고 말할 때, 비로소 우리의 미션을 실현하고 있음을 알 수 있습니다. 고객들의 쇼핑과 식사, 생활 전반을 편하게 만들겠다는 유일한 집념으로 쿠팡은 수억 달러 규모의 커머스 산업 전반의 혁신을 이끌고 있습니다. 쿠팡은 가장 빠르게 성장하는 리테일 기업 중...

December 9, 2025 View Details
Seattle, USA (Seattle) Remote permanent
ProgrammingLanguages_CCloudPlatformsHighPerformanceComputingAIMachineLearningProduct ManagementRoadmapTechnical leadership

*note this role can be located in either Seattle or Mountain View Company Introduction  We exist to wow our customers. We know we’re doing the right thing when we hear our customers say, “How did we...

December 9, 2025 View Details
Seoul, South Korea (Seoul) permanent
GPUHPC EnvironmentsAIContainer OrchestrationCapacity planningProduct ManagementGPU ComputingDistributed SystemsAI FrameworksProduct Strategy

회사 소개 쿠팡은 고객 감동 실현을 위해 존재합니다. 고객들이 "쿠팡 없이 그동안 어떻게 살았을까?" 라고 말할 때, 비로소 우리의 미션을 실현하고 있음을 알 수 있습니다. 고객들의 쇼핑과 식사, 생활 전반을 편하게 만들겠다는 유일한 집념으로 쿠팡은 수억 달러 규모의 커머스 산업 전반의 혁신을 이끌고 있습니다. 쿠팡은 가장 빠르게 성장하는 리테일 기업 중...

December 9, 2025 View Details
Austin, TX, United States; Chicago, Illinois, United States; London, United Kingdom; New York, NY, United States; Seattle, Washington, United States (New York City) Remote permanent
NetworkDesignRoutingConfigurationsTroubleshootingCapacity planningScriptwritingCross-disciplinary teamworkDocumentation collectionMentoring

Hudson River Trading’s High Performance Computing (HPC) Network Engineering team designs and engineers the low-latency communications infrastructure that underpins our incredibly large GPU and CPU com...

December 8, 2025 View Details
Boston, Massachusetts, United States; Las Vegas, Nevada, United States; Pittsburgh, Pennsylvania, United States (Pittsburgh) permanent
ProcurementStrategic SourcingVendor ManagementRFX EventsContract Negotiationslegal_termsSupply Chain ManagementAutomotive IndustryCost ManagementCommunication

Reporting to the Director, Direct Procurement, the Sr. Strategic Sourcing Manager plays a critical role in the development of Motional's self-driving vehicle hardware. This role will manage the strate...

December 8, 2025 View Details
Chicago, Illinois, United States; London, United Kingdom; New York, NY, United States (All Offices) Remote permanent
HPE storageDistributed data storageNAS StorageLinux troubleshootingPythonBash ScriptingFile SystemsLinux Software RAIDZFS

The R&D team at Hudson River Trading (HRT) builds and maintains the computers, networks, data storage, operating systems, and software that allow our trading strategies and research environment to...

December 8, 2025 View Details
München, Germany Freelance
Android Automotive OSSystem ArchitectSystem Boundary DefinitionAudio-Video-Display PipelinesHigh Performance Computing

Projektbeschreibung Für ein strategisches Automotive-Projekt im Bereich High-Performance-Computing (HPC) suchen wir einen erfahrenen Lead Infotainment Architect. Der Experte unterstützt während der t...

December 5, 2025 View Details

Software / System Engineer in HPC (m/f/d)

Firmenname für EXPERT-Mitglieder sichtbar

Berlin Remote
Software-DevelopmentSystems EngineerHigh Performance ComputingLinuxPythonBashAnsibleMPIOpenMPCUDAParallel Processing

Projektbeschreibung For our customer, we are looking for an experienced Software Engineer or System Engineer specializing in High Performance Computing HPC (m/f/d). Project goal: Optimization and en...

December 5, 2025 View Details
Sydney, Australia (Sydney) permanent
HPC EngineerSystem designPerformance optimizationParallel ProcessingStorage Systems ManagementHPC ProcessorsGPU

At IMC, we harness cutting-edge technology to power world-class trading and research. We’re looking for a HPC Systems Engineer to design, implement, and optimise our global high-performance computing ...

December 4, 2025 View Details
San Francisco Remote permanent
storage_engineeringMulti-petabyte SystemsWekaFS IntegrationCephLustre OptimizationKubernetes storageNetwork PerformanceRDMA & InfiniBandMonitoring

About the Role In this role, you will design and deliver multi-petabyte storage systems purpose-built for the world’s largest AI training and inference workloads. You’ll architect high-performance pa...

December 3, 2025 View Details

Systems Engineer, HPC

Helionenergy

Everett, WA permanent
HPC EnvironmentsLinuxFortran

About Helion We are a fusion power company based in Everett, WA, with the mission to build the world's first fusion power plant, enabling a future with unlimited clean electricity. Our vision is a wo...

December 3, 2025 View Details
Home Based - Americas; Home based - EMEA (Home Based - Americas, Home Based - EMEA) Remote permanent
PythonHPC EnvironmentsDockerPublic CloudDocker image designInfiniBandRDMA NetworkingCUDAMPISlurmLustre

Canonical is a leading provider of open source software and operating systems to the global enterprise and technology markets. Our platform, Ubuntu, is very widely used in breakthrough enterprise init...

December 1, 2025 View Details
Location not specified
ProgrammingLanguages_CDistributed SystemsHighPerformanceComputingGPUProgrammingParallelProgrammingContainerizationAutomation & OrchestrationNonLinearOptimizationCommunicationSkillsAlgorithmDesign

At Lace Lithography, we’re scaling up our team and looking for a passionate and sharp talent to join us. In our company we build chip lithography technology that will enable the next 100 years of chip...

November 27, 2025 View Details
Bothell, Washington, United States; College Park, Maryland, United States (College Park, Maryland) Remote permanent
C++GoPythonQuantum software stacksMPIDockerKubernetesSlurm

IonQ is developing the world's most powerful full-stack quantum computer based on trapped-ion technology. We are pushing past the limits of classical physics and current supercomputing technology to u...

November 26, 2025 View Details
Hawthorne, CA permanent
Windows scriptingLinuxHPC Cluster ManagementDockerAnsibleMonitoringML-FrameworksGPUContainer

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologi...

November 17, 2025 View Details

HPC Infrastructure Support Engineer

Xtxmarketstechnologies

Finland permanent
HPC NetworkingData Centre PerformanceServer ManagementSwitch InstallationCable managementCluster OperationsGPU workload administrationStorage Systems ManagementInfrastructureOn-call

The Firm XTX Markets is a leading algorithmic trading firm which uses state-of-the-art machine learning technology to produce price forecasts for over 50,000 financial instruments across equities, fi...

November 14, 2025 View Details
Remote freelance
LustreCrayLinuxInfiniBandScripting & AutomationAnsiblePuppeteerSaltChefPythonPerl

My client are looking for an experienced HPC specialist for a large scale HPC project. The consultant will require an indepth understanding and have deployment experience of Lustre file system Archite...

November 12, 2025 View Details
Hawthorne, CA permanent
LinuxHPC EnvironmentsWindows scriptingHPC Cluster ManagementCluster resource managersMonitoringScientific ComputingML-FrameworksGPUContainer

SpaceX was founded under the belief that a future where humanity is out exploring the stars is fundamentally more exciting than one where we are not. Today SpaceX is actively developing the technologi...

November 5, 2025 View Details

HPC & Infrastructure Engineer

Towerresearchcapital

Gurgaon Remote permanent
HPC EnvironmentsInfrastructureCloudsDistributed data storageBatch Job SchedulersContainer TechnologiesKubernetesInfrastructure as Code (IaC)Enterprise Monitoring ToolsNetwork_Fundamentals

Tower Research Capital is a leading quantitative trading firm founded in 1998. Tower has built its business on a high-performance platform and independent trading teams. We have a 25+ year track recor...

October 13, 2025 View Details
Nuremberg freelance
HPC EngineerHigh Performance ComputingBashPythonDevOpsAnsibleSaltPuppeteerSlurmGPFSLustre

Our client are a global consultancy, working on a project in the Pharma space, they are on the lookout for a HPC Engineer to come in on a contract basis. Key Skills/Requirements: Strong exper...

October 11, 2025 View Details
India - Gurgaon Hybrid permanent
LinuxHPC clustersSystem AdministrationPerformance OptimizationCluster Management ToolsNetworkingStorage SystemsSecurityDevOpsUser Support

Responsibilities: System Administration & Maintenance: Install, configure, and maintain HPC clusters (hardware, software, operating systems), perform regular updates/patching, manage user accounts an...

October 6, 2025 View Details
Rockville, MD (Axle Informatics LLC) Remote permanent
HPC ProcessorsAWSTerraformAnsibleJupyterHub/JupyterLabSlurmOpenSCAPOpenHPCWarewulf

(ID: 2025-0683) Axle is a bioscience and information technology company that offers advancements in translational research, biomedical informatics, and data science applications to research centers a...

September 3, 2025 View Details