MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 113+ jobs updated daily

Latest Job Openings

Menlo Park, California, United States permanent
PythonTensorFlowPyTorchReinforcement LearningLarge Language ModelsRLHFDPOPPOMulti-agent SystemsPost-training Optimization

Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for build...

September 6, 2025 View Details
Munich, Germany; Singapore (Munich (DE-MUC-ARP), Singapore (SG-SIN-FUS)) permanent
Reinforcement LearningJAXPythonC++TensorFlowPyTorchRoboticsIndustrial AssemblyProduction SystemsDiffusion Policies

Intrinsic is Alphabet’s bet aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what’s possible for industrial robo...

December 19, 2025 View Details
London (London, United Kingdom) permanent
PhDMastersReinforcement LearningResearchSynthetic DataRepresentation LearningOffline RLTemporal Credit AssignmentPolicy Optimization

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
Vancouver (Vancouver, Canada) Remote permanent
MachineLearningReinforcementLearningreward modelingLargeScaleSystemsDataProcessingAgreementsData AnnotationLean Tools DevelopmentRewardModelBuildingDatabaseIntegrationAblationStudies

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
Austin, TX (HQ) Remote permanent
Reinforcement LearningPythonC++SimulatorsDistributed trainingRoboticsComputer HardwareMentoringCode-ReviewsSystemAnalysis

Apptronik is building robots for the real world to improve human quality of life and to help solve the ever-increasing labor shortage problem. Our team has been building some of the most advanced robo...

December 11, 2025 View Details
Palo Alto, CA permanent
Reinforcement LearningComputer VisionLLM AgentsSoftware DevelopmentTestingData AcquisitionEvaluation SuitesProduct ExperienceResearchCommunication

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details

Masterarbeit (w/m/d) - Reinforcement Learning basierte Flugregelung

Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR)

Location not specified
IT/SoftwareProgramming languagesPythonActor-CriticReinforcement LearningSimulationUncertaintyGaussian ProcessesSafety FilterUncertainty Estimation

Kennziffer: 3616 Arbeitsort: Braunschweig, Cochstedt Eintrittsdatum: April 2026 Karrierestufe: Studien- & Abschlussarbeit Beschäftigungsgrad: Teilzeit, Vollzeit Dauer der Beschäftigung: nach Absprache...

December 9, 2025 View Details
Munich - Berlin - London - Paris (Berlin, London, Munich, Paris) Remote permanent
ProgrammingLanguages_CPythonRustJavaC++ReinforcementLearningMultiAgentSystemsDeepLearningDataEngineeringProductionDeployment

Who we are Helsing is a defence AI company. Our mission is to protect our democracies. We aim to achieve technological leadership, so that open societies can continue to make sovereign decisions and ...

December 8, 2025 View Details
Edmonton, Alberta, Canada permanent
Reinforcement LearningLarge Language ModelsLLMRLHFGRPOReward-free MethodsAgentic Reinforcement LearningAgentic EvaluationMulti-turn TasksPythonPyTorchDeepSpeed

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher. About the team: Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organizati...

January 30, 2026 View Details
San Jose, CA (HQ) permanent
PyTorchReinforcement LearningPPOSACSHyperparameter TuningDomain RandomizationCurriculum knowledgereward modelingTensorBoardWeights&Biases

Figure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in ...

November 14, 2025 View Details

Logistik-Automatisierung

Reinforcement Learning für adaptive Steuerung

Markkleeberg,  Sachsen ‐ Freelance ‐ Onsite freelance
Technical AutomationSimulationComputer-Aided Design3D Data VisualizationAgile Project MethodsAIAlgorithmsC++Continuous IntegrationPythonScrumRobotics

Keywords Automation Simulations Computer-Aided Design 3D Visualization Agile Methodology Artificial Intelligence Algorithms C++ (Programming Language) Continuous Integration Recruitment Python (Progra...

September 3, 2025 View Details
Palo Alto, California, United States permanent
PythonC++PyTorchIsaac SimMuJoCoReinforcement LearningSimulationSim-to-RealProduct Deployment

AI Research Engineer, Reinforcement Learning | AI & Robotics Location: Palo Alto, CA (on-site) About 1X We build humanoid robots that work alongside people to solve labor shortages and create abunda...

January 30, 2026 View Details