MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 97+ jobs updated daily

Latest Job Openings

Chile (Worldwide - Remote) Remote permanent
PythonJavaScriptPyTorchTensorFlowReinforcement LearningFastAPIReactDockerAWSGCPBackend SystemsFrontend Systems

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details
Brazil (Worldwide - Remote) Remote permanent
Reinforcement LearningFull Stack DevelopmentPythonJavaScriptPyTorchTensorFlowAPIsBackend ServicesFrontend InterfacesData Exploration

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details
Argentina (Worldwide - Remote) Remote permanent
Reinforcement LearningModel DevelopmentPipeline DesignBackend ServicesFrontend InterfacesData ExplorationPythonJavaScriptPyTorchReact

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details
LATAM - Remote (Worldwide - Remote) Remote permanent
Reinforcement LearningPythonJavaScriptPyTorchTensorFlowJAXOpenAI GymFastAPIReactDistributed TrainingExperiment Tracking

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details
Munich (DEU) permanent
Deep knowledge of RL theorySimulation-based learningPython proficiencyMathematics and statistics foundationPublications at top conferences

Your mission: • Research and prototype novel RL algorithms (e.g. exploration, POMDPs, multi-agent systems)  • Design and implement use-cases for DRL on edge devices  • Translate theory into scalable s...

January 6, 2026 View Details
Waterloo, Ontario, Canada permanent
Reinforcement LearningPythonResearchArtificial IntelligenceRoboticsSelf-adaptive SystemsState-of-the-Art AlgorithmsCollaborationMentorshipIndustry Experience

Huawei Canada has an immediate a 12-month contract opening for an Engineer. About the team: The Intelligent Complex Systems Team, currently a part of the Waterloo Research Centre, examines recent adv...

January 30, 2026 View Details
Location not specified Remote
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesCross-System OptimizationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 22, 2026 View Details
Zurich, Switzerland permanent
MultimodalReinforcement LearningRL AlgorithmsEnvironment ScalingStrategic ApplicationExperimentation & Analysis

Snapshot Are you a Research Engineer with a passion for Reinforcement Learning and Multimodality? Join Google DeepMind’s Frontier AI Unit! We are seeking a researcher to help us make learning efficie...

January 21, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish CommunicationGlobal Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAIModel DevelopmentAdvanced ModelsResource-efficient ModelsMulti-modal ArchitecturesData IntegrationSystem OptimizationResearch-driven Approach

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentModel ArchitectureDecision MakingResource EfficiencyMulti-modal Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentResource EfficiencyMulti-modal ArchitecturesEnglish CommunicationCollaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentAdvanced Model ArchitecturesResearch-Driven ApproachResource-Efficient ModelsMulti-Modal Architectures

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDomain-Specific CapabilitiesDecision-Making OptimizationAdaptive BehaviorResearch-Driven ApproachEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI ResearchAdvanced Model ArchitecturesResource-efficient ModelsMulti-modal ArchitecturesEnglish CommunicationCollaborationFintech Innovation

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDomain-Specific CapabilitiesDecision-Making OptimizationAdaptive BehaviorResource ManagementMulti-Hardware SystemsEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAIAdvanced Model ArchitecturesResource-efficient ModelsMulti-modal ArchitecturesDomain-specific CapabilitiesDecision-making OptimizationAdaptive BehaviorEnglish CommunicationRemote Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentSystem OptimizationAdvanced Model ArchitecturesHands-on ResearchData IntegrationResource EfficiencyMulti-modal SystemsEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDomain-Specific CapabilitiesResource OptimizationDecision-Making OptimizationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAIModel DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesProgrammingResearch

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAIModel DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesData IntegrationEnglish CommunicationCollaborationFintech Innovation

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningDecision MakingAdaptive BehaviorResource-Efficient ModelsMulti-Modal ArchitecturesAdvanced Model Architectures

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentAdvanced ModelsResource-efficient ModelsMulti-modal ArchitecturesHands-on Research

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentDomain-Specific CapabilitiesResource-Efficient ModelsMulti-Modal Architectures

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesEnglish CommunicationGlobal Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAIModel DevelopmentAdvanced AlgorithmsResource EfficiencyMulti-modal ArchitecturesData IntegrationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningResearchModel DevelopmentDecision MakingResource EfficiencyMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDecision Optimization

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesDomain-Specific CapabilitiesResearch-Driven ApproachEnglish CommunicationFintech InnovationGlobal Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesData IntegrationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentAdvanced ModelsResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Remote job permanent
Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details
Menlo Park, California, United States permanent
PythonTensorFlowPyTorchReinforcement LearningLarge Language ModelsRLHFDPOPPOMulti-agent SystemsPost-training Optimization

Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for build...

September 6, 2025 View Details
Munich, Germany; Singapore (Munich (DE-MUC-ARP), Singapore (SG-SIN-FUS)) permanent
Reinforcement LearningJAXPythonC++TensorFlowPyTorchRoboticsIndustrial AssemblyProduction SystemsDiffusion Policies

Intrinsic is Alphabet’s bet aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what’s possible for industrial robo...

December 19, 2025 View Details
Austin, TX (HQ) Remote permanent
Reinforcement LearningPythonC++SimulatorsDistributed trainingRoboticsComputer HardwareMentoringCode-ReviewsSystemAnalysis

Apptronik is building robots for the real world to improve human quality of life and to help solve the ever-increasing labor shortage problem. Our team has been building some of the most advanced robo...

December 11, 2025 View Details
London (London, United Kingdom) permanent
PhDMastersReinforcement LearningResearchSynthetic DataRepresentation LearningOffline RLTemporal Credit AssignmentPolicy Optimization

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
Vancouver (Vancouver, Canada) Remote permanent
MachineLearningReinforcementLearningreward modelingLargeScaleSystemsDataProcessingAgreementsData AnnotationLean Tools DevelopmentRewardModelBuildingDatabaseIntegrationAblationStudies

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details
Palo Alto, CA permanent
Reinforcement LearningComputer VisionLLM AgentsSoftware DevelopmentTestingData AcquisitionEvaluation SuitesProduct ExperienceResearchCommunication

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details

Masterarbeit (w/m/d) - Reinforcement Learning basierte Flugregelung

Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR)

Location not specified
IT/SoftwareProgramming languagesPythonActor-CriticReinforcement LearningSimulationUncertaintyGaussian ProcessesSafety FilterUncertainty Estimation

Kennziffer: 3616 Arbeitsort: Braunschweig, Cochstedt Eintrittsdatum: April 2026 Karrierestufe: Studien- & Abschlussarbeit Beschäftigungsgrad: Teilzeit, Vollzeit Dauer der Beschäftigung: nach Absprache...

December 9, 2025 View Details
Munich - Berlin - London - Paris (Berlin, London, Munich, Paris) Remote permanent
ProgrammingLanguages_CPythonRustJavaC++ReinforcementLearningMultiAgentSystemsDeepLearningDataEngineeringProductionDeployment

Who we are Helsing is a defence AI company. Our mission is to protect our democracies. We aim to achieve technological leadership, so that open societies can continue to make sovereign decisions and ...

December 8, 2025 View Details
Edmonton, Alberta, Canada permanent
Reinforcement LearningLarge Language ModelsLLMRLHFGRPOReward-free MethodsAgentic Reinforcement LearningAgentic EvaluationMulti-turn TasksPythonPyTorchDeepSpeed

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher. About the team: Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organizati...

January 30, 2026 View Details
San Jose, CA (HQ) permanent
PyTorchReinforcement LearningPPOSACSHyperparameter TuningDomain RandomizationCurriculum knowledgereward modelingTensorBoardWeights&Biases

Figure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in ...

November 14, 2025 View Details

Logistik-Automatisierung

Reinforcement Learning für adaptive Steuerung

Markkleeberg,  Sachsen ‐ Freelance ‐ Onsite freelance
Technical AutomationSimulationComputer-Aided Design3D Data VisualizationAgile Project MethodsAIAlgorithmsC++Continuous IntegrationPythonScrumRobotics

Keywords Automation Simulations Computer-Aided Design 3D Visualization Agile Methodology Artificial Intelligence Algorithms C++ (Programming Language) Continuous Integration Recruitment Python (Progra...

September 3, 2025 View Details
Palo Alto, California, United States permanent
PythonC++PyTorchIsaac SimMuJoCoReinforcement LearningSimulationSim-to-RealProduct Deployment

AI Research Engineer, Reinforcement Learning | AI & Robotics Location: Palo Alto, CA (on-site) About 1X We build humanoid robots that work alongside people to solve labor shortages and create abunda...

January 30, 2026 View Details