Jobs

Full Stack Reinforcement Learning (RL) Engineer Specialist - Freelance Project

Agency

Chile (Worldwide - Remote) Remote permanent

PythonJavaScriptPyTorchTensorFlowReinforcement LearningFastAPIReactDockerAWSGCPBackend SystemsFrontend Systems

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details

Full Stack Reinforcement Learning (RL) Engineer Specialist - Freelance Project

Agency

Brazil (Worldwide - Remote) Remote permanent

Reinforcement LearningFull Stack DevelopmentPythonJavaScriptPyTorchTensorFlowAPIsBackend ServicesFrontend InterfacesData Exploration

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details

Full Stack Reinforcement Learning (RL) Engineer Specialist - Freelance Project

Agency

Argentina (Worldwide - Remote) Remote permanent

Reinforcement LearningModel DevelopmentPipeline DesignBackend ServicesFrontend InterfacesData ExplorationPythonJavaScriptPyTorchReact

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details

Full Stack Reinforcement Learning (RL) Engineer Specialist - Freelance Project

Agency

LATAM - Remote (Worldwide - Remote) Remote permanent

Reinforcement LearningPythonJavaScriptPyTorchTensorFlowJAXOpenAI GymFastAPIReactDistributed TrainingExperiment Tracking

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details

Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)

Autonomous Teaming

Munich (DEU) permanent

Deep knowledge of RL theorySimulation-based learningPython proficiencyMathematics and statistics foundationPublications at top conferences

Your mission: • Research and prototype novel RL algorithms (e.g. exploration, POMDPs, multi-agent systems) • Design and implement use-cases for DRL on edge devices • Translate theory into scalable s...

January 6, 2026 View Details

Engineer - Reinforcement Learning

Confidential

Waterloo, Ontario, Canada permanent

Reinforcement LearningPythonResearchArtificial IntelligenceRoboticsSelf-adaptive SystemsState-of-the-Art AlgorithmsCollaborationMentorshipIndustry Experience

Huawei Canada has an immediate a 12-month contract opening for an Engineer. About the team: The Intelligent Complex Systems Team, currently a part of the Waterloo Research Centre, examines recent adv...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Tether Operations Limited

Location not specified Remote

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesCross-System OptimizationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 22, 2026 View Details

Research Engineer, Multimodal Reinforcement Learning

Deepmind

Zurich, Switzerland permanent

MultimodalReinforcement LearningRL AlgorithmsEnvironment ScalingStrategic ApplicationExperimentation & Analysis

Snapshot Are you a Research Engineer with a passion for Reinforcement Learning and Multimodality? Join Google DeepMind’s Frontier AI Unit! We are seeking a researcher to help us make learning efficie...

January 21, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish CommunicationGlobal Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAIModel DevelopmentAdvanced ModelsResource-efficient ModelsMulti-modal ArchitecturesData IntegrationSystem OptimizationResearch-driven Approach

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentModel ArchitectureDecision MakingResource EfficiencyMulti-modal Systems

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentResource EfficiencyMulti-modal ArchitecturesEnglish CommunicationCollaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentAdvanced Model ArchitecturesResearch-Driven ApproachResource-Efficient ModelsMulti-Modal Architectures

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDomain-Specific CapabilitiesDecision-Making OptimizationAdaptive BehaviorResearch-Driven ApproachEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI ResearchAdvanced Model ArchitecturesResource-efficient ModelsMulti-modal ArchitecturesEnglish CommunicationCollaborationFintech Innovation

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDomain-Specific CapabilitiesDecision-Making OptimizationAdaptive BehaviorResource ManagementMulti-Hardware SystemsEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAIAdvanced Model ArchitecturesResource-efficient ModelsMulti-modal ArchitecturesDomain-specific CapabilitiesDecision-making OptimizationAdaptive BehaviorEnglish CommunicationRemote Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentSystem OptimizationAdvanced Model ArchitecturesHands-on ResearchData IntegrationResource EfficiencyMulti-modal SystemsEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDomain-Specific CapabilitiesResource OptimizationDecision-Making OptimizationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAIModel DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesProgrammingResearch

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAIModel DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesData IntegrationEnglish CommunicationCollaborationFintech Innovation

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningDecision MakingAdaptive BehaviorResource-Efficient ModelsMulti-Modal ArchitecturesAdvanced Model Architectures

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentAdvanced ModelsResource-efficient ModelsMulti-modal ArchitecturesHands-on Research

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentDomain-Specific CapabilitiesResource-Efficient ModelsMulti-Modal Architectures

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesEnglish CommunicationGlobal Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAIModel DevelopmentAdvanced AlgorithmsResource EfficiencyMulti-modal ArchitecturesData IntegrationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningResearchModel DevelopmentDecision MakingResource EfficiencyMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesDecision Optimization

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesDomain-Specific CapabilitiesResearch-Driven ApproachEnglish CommunicationFintech InnovationGlobal Collaboration

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAdvanced Model ArchitecturesResource-Efficient ModelsMulti-Modal ArchitecturesData IntegrationEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentAdvanced ModelsResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% Remote)

Confidential

Remote job permanent

Reinforcement LearningAI Model DevelopmentAdvanced Model ArchitecturesResource-Efficient SystemsMulti-Modal ArchitecturesEnglish Communication

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

January 30, 2026 View Details

AI Research Scientist - Reinforcement Learning

Snowflake

Menlo Park, California, United States permanent

PythonTensorFlowPyTorchReinforcement LearningLarge Language ModelsRLHFDPOPPOMulti-agent SystemsPost-training Optimization

Snowflake is about empowering enterprises to achieve their full potential — and people too. With a culture that’s all in on impact, innovation, and collaboration, Snowflake is the sweet spot for build...

September 6, 2025 View Details

Staff/Senior Reinforcement Learning Engineer (Industrial Assembly)

Intrinsicrobotics

Munich, Germany; Singapore (Munich (DE-MUC-ARP), Singapore (SG-SIN-FUS)) permanent

Reinforcement LearningJAXPythonC++TensorFlowPyTorchRoboticsIndustrial AssemblyProduction SystemsDiffusion Policies

Intrinsic is Alphabet’s bet aiming to reimagine the potential of industrial robotics. Our team believes that advances in AI, perception and simulation will redefine what’s possible for industrial robo...

December 19, 2025 View Details

Senior Reinforcement Learning Engineer

Apptronik

Austin, TX (HQ) Remote permanent

Reinforcement LearningPythonC++SimulatorsDistributed trainingRoboticsComputer HardwareMentoringCode-ReviewsSystemAnalysis

Apptronik is building robots for the real world to improve human quality of life and to help solve the ever-increasing labor shortage problem. Our team has been building some of the most advanced robo...

December 11, 2025 View Details

Research Scientist Intern, Reinforcement Learning

Wayve

London (London, United Kingdom) permanent

PhDMastersReinforcement LearningResearchSynthetic DataRepresentation LearningOffline RLTemporal Credit AssignmentPolicy Optimization

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details

Machine Learning Engineer, Reinforcement Learning & Reward Modeling

Wayve

Vancouver (Vancouver, Canada) Remote permanent

MachineLearningReinforcementLearningreward modelingLargeScaleSystemsDataProcessingAgreementsData AnnotationLean Tools DevelopmentRewardModelBuildingDatabaseIntegrationAblationStudies

At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...

December 11, 2025 View Details

Member of Technical Staff - Macrohard / Computer Control - Reinforcement Learning

Xai

Palo Alto, CA permanent

Reinforcement LearningComputer VisionLLM AgentsSoftware DevelopmentTestingData AcquisitionEvaluation SuitesProduct ExperienceResearchCommunication

About xAI xAI’s mission is to create AI systems that can accurately understand the universe and aid humanity in its pursuit of knowledge. Our team is small, highly motivated, and focused on engineeri...

December 10, 2025 View Details

Masterarbeit (w/m/d) - Reinforcement Learning basierte Flugregelung

Deutsches Zentrum für Luft- und Raumfahrt e. V. (DLR)

Location not specified

IT/SoftwareProgramming languagesPythonActor-CriticReinforcement LearningSimulationUncertaintyGaussian ProcessesSafety FilterUncertainty Estimation

Kennziffer: 3616 Arbeitsort: Braunschweig, Cochstedt Eintrittsdatum: April 2026 Karrierestufe: Studien- & Abschlussarbeit Beschäftigungsgrad: Teilzeit, Vollzeit Dauer der Beschäftigung: nach Absprache...

December 9, 2025 View Details

AI Research Engineer - Reinforcement Learning

Helsing

Munich - Berlin - London - Paris (Berlin, London, Munich, Paris) Remote permanent

ProgrammingLanguages_CPythonRustJavaC++ReinforcementLearningMultiAgentSystemsDeepLearningDataEngineeringProductionDeployment

Who we are Helsing is a defence AI company. Our mission is to protect our democracies. We aim to achieve technological leadership, so that open societies can continue to make sovereign decisions and ...

December 8, 2025 View Details

Researcher - Reinforcement Learning

Confidential

Edmonton, Alberta, Canada permanent

Reinforcement LearningLarge Language ModelsLLMRLHFGRPOReward-free MethodsAgentic Reinforcement LearningAgentic EvaluationMulti-turn TasksPythonPyTorchDeepSpeed

Huawei Canada has an immediate 12-month contract opening for a Reinforcement Learning Researcher. About the team: Founded in 2012, the Noah’s Ark lab has evolved into a prominent research organizati...

January 30, 2026 View Details

Senior Reinforcement Learning Engineer, Helix

Figureai

San Jose, CA (HQ) permanent

PyTorchReinforcement LearningPPOSACSHyperparameter TuningDomain RandomizationCurriculum knowledgereward modelingTensorBoardWeights&Biases

Figure is an AI Robotics company developing a general purpose humanoid. Our Humanoid is designed for corporate tasks targeting labor shortages and jobs that are undesirable or unsafe. We are based in ...

November 14, 2025 View Details

PhD Autonomy Engineer Intern - Planning & Controls (Reinforcement Learning)

Skydio

Zurich, Switzerland (HQ) Remote permanent

Reinforcement LearningPythonPyTorchJAXRay RLlibC++CUDARoboticsSim2Real Techniques

Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial mobility. The Skydio team combines deep expertise in artificial...

October 1, 2025 View Details

Logistik-Automatisierung

Reinforcement Learning für adaptive Steuerung

Markkleeberg, Sachsen ‐ Freelance ‐ Onsite freelance

Technical AutomationSimulationComputer-Aided Design3D Data VisualizationAgile Project MethodsAIAlgorithmsC++Continuous IntegrationPythonScrumRobotics

Keywords Automation Simulations Computer-Aided Design 3D Visualization Agile Methodology Artificial Intelligence Algorithms C++ (Programming Language) Continuous Integration Recruitment Python (Progra...

September 3, 2025 View Details

AI Research Engineer, Reinforcement Learning

Confidential

Palo Alto, California, United States permanent

PythonC++PyTorchIsaac SimMuJoCoReinforcement LearningSimulationSim-to-RealProduct Deployment

AI Research Engineer, Reinforcement Learning | AI & Robotics Location: Palo Alto, CA (on-site) About 1X We build humanoid robots that work alongside people to solve labor shortages and create abunda...

January 30, 2026 View Details

Latest Job Openings