Jobs

Research Intern – Reinforcement Learning (RL)

Levelai

Noida permanent

Computer ScienceReinforcement LearningRL EnvironmentsTraining CurriculumProbabilityMathsOptimizationAgent-Based SystemsLarge-Scale Datasets

🚀 Build the next generation of Agentic AI with us Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agent...

March 31, 2026 View Details

Research Intern – Reinforcement Learning (RL) - Onsite

Levelai

Bay Area, California permanent

Computer ScienceReinforcement LearningRL environmentsPGL WorkshopsProbabilityMathOptimizationAgent-based systemsLarge-scale datasetsReinforcement Learning fundamentals

🚀 Build the next generation of Agentic AI with us Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agent...

March 28, 2026 View Details

AI Engineer (Reinforcement Learning)

Robotec.ai

Warsaw, Masovian Voivodeship, Poland Hybrid permanent

AIReinforcement LearningRoboticsSimulationComputer VisionDeep LearningMachine LearningPythonPyTorchGit

Robotec.ai is a software company that develops hi-tech solutions for robotics and automotive industries. We help our customers build state-of-the-art robotic simulations and testing tools to ensure th...

March 27, 2026 View Details

Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)

Autonomous Teaming

Munich (DEU) permanent

Reinforcement LearningTheoryPracticeQuick IterationProductionizing ML infrastructureNumerical SimulationGazeboPolicy GradientsQ-LearningIsaacSimMujocoCarla

What we offer: • Opportunity to work on a new solution from scratch in a technical complex environment • Work in an international, agile, cross-functional team creating the future of autonomous system...

March 20, 2026 View Details

Senior/Staff Deep Reinforcement Learning Engineer

Doordashusa

San Francisco, CA (San Francisco) Remote permanent

GPU Deep LearningJAXPolicy DesignSimulationDistributed TrainingGPU AccelerationAutonomous DrivingProduction DeploymentReward ShapingModel-Based RL

About the Team Our DD Labs team builds real-time autonomous delivery systems. The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical pla...

March 25, 2026 View Details

Recherche de chemins d’attaque par Reinforcement Learning F/H

Thales

Gennevilliers permanent

Reinforcement LearningCybersecurityThreat ModelingSysMLPythonDockerGitBloodHoundMITRE D3FEND

Lieu : Gennevilliers, France Construisons ensemble un avenir de confiance Thales est un leader mondial des hautes technologies spécialisé dans trois secteurs d’activité : Défense & Sécurité, Aéronau...

March 25, 2026 View Details

Reinforcement Learning Infrastructure (Cybersecurity)

Bugcrowd

Remote - US (Remote US) Remote permanent

Reinforcement LearningSystems EngineeringSecurityFuzzingContainerizationLinux EnvironmentsPythonRustSystems ArchitectureAI IntegrationBinary Exploitation

We are Bugcrowd. Since 2012, we’ve been empowering organizations to take back control and stay ahead of threat actors by uniting the collective ingenuity and expertise of our customers and trusted all...

March 16, 2026 View Details

Research Scientist, Reinforcement Learning

Deepmind

London, UK Remote permanent

Machine LearningReinforcement LearningAlgorithm DevelopmentExperiment DesignData AnalysisCollaborationCode ImplementationScientific CommunicationInfrastructure ManagementFirst-Principles Thinking

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportuni...

March 14, 2026 View Details

AI Research Engineer - Reinforcement Learning (100% remote Worldwide)

Confidential

Remote job permanent

AIReinforcement LearningModel DevelopmentResource EfficiencyThe Composable Architecture

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

March 11, 2026 View Details

Master Thesis: Building an Uncertainty-Robust Reinforcement Learning-based model for UAV self-separation under Uncertainty

Confidential

Amsterdam, Noord-Holland, Nederland permanent

PythonMLRLSimulationAerospaceControl Systems

Background The autonomous operation of unmanned aerial vehicles (UAVs) plays an increasingly important role in research and commercial applications. These vehicles can assist with crucial application...

March 6, 2026 View Details

Reinforcement Learning Engineer

MLabs

New York, New York, United States permanent

Production ExperienceRisk ManagementSystems AdaptabilityTechnical LeadershipObjective Function DesignValidation FrameworksEvaluation Loop Mastery

Reinforcement Learning (RL) Engineer Location: New York (Office) On-site | Full-time Compensation: Competitive Our client is an elite development firm and a high-growth software company responsibl...

March 5, 2026 View Details

Research Engineer - Reinforcement Learning and Agentic AI (f/m/div.)

BoschGroup

Renningen, BW, Germany Hybrid permanent

Deep Reinforcement LearningAgentic AIAgentic GenAI systemsMachine Learning SystemsData-driven ApproachesGenerative ModelsEncryption AlgorithmsSafe RLOffline RLMulti-modal Large Models

At Bosch, we shape the future by inventing high-quality technologies and services that spark enthusiasm and enrich people’s lives. Our promise to our associates is rock-solid: We grow together, we enj...

March 4, 2026 View Details

AI/ML Scientist Intern (Simulation Optimization and Reinforcement Learning)

TWG Global AI

Santa Monica, California, United States temporary

PythonTime Series ForecastingReinforcement LearningOptimizationConversational AIAI ReasoningDocumentationCross-functional CollaborationEmerging Technologies

The Organization At TWG Group Holdings, LLC (“TWG Global”), we drive innovation and business transformation across a diverse portfolio—including investments, finance, insurance, and media—by leveragi...

March 3, 2026 View Details

Research Scientist, Science of Post-Training and Reinforcement Learning

Deepmind

London, UK permanent

ResearchImplementationExperimentationEvaluationCommunicationCollaborationFirst-principlesPhD

Snapshot We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus o...

March 3, 2026 View Details

Reinforcement Learning and World Model for Autonomous Driving Intern - 2026

NVIDIA

2 Locations permanent

Reinforcement LearningSimulation ModelsState-of-the-art AlgorithmsNeural RenderingRoboticsHigh-throughput data pipelinesPublishingMulti-Modal World Simulation ModelModel-Centric RLWorld Simulation

We are in search of a hardworking intern with expertise in Reinforcement Learning and Multi-Modal World Simulation Model to propel the evolution of ML-centric autonomous driving and Physical AI soluti...

March 3, 2026 View Details

Senior AI Researcher- Reinforcement learning (f/m/d)

AlephAlpha

Heidelberg, Baden Würtemberg, Germany Remote permanent

Reinforcement LearningLarge-scale TrainingLLM TrainingDistributed AlgorithmsStatistical EvaluationExperiment DesignCross-functional CollaborationModel ImprovementInnovation ExplorationProduction Implementation

Our Mission Aleph Alpha is one of the few companies in Europe with end-to-end in-house model development including pre- and post-training. We’re building models that have general-purpose capabilities...

March 3, 2026 View Details

Senior Expert/VP Reinforcement Learning

Confidential

Munich Hybrid permanent

Reinforcement LearningAI TestingAI ValidationEmbedded AI InferenceTrust DevelopmentProximal Policy OptimizationRequirements GatheringStakeholder CommunicationSolution ScopingBayesian Machine LearningProbabilistic Models

Resaro builds advanced AI testing software to help organizations verify, validate, and trust their most critical AI systems — from computer vision to generative AI and autonomous systems. Our mission ...

March 1, 2026 View Details

Senior Machine Learning / Reinforcement Learning Engineer

Sleek

India Remote permanent

Machine LearningReinforcement LearningProduction-Grade SystemsDomain ModelAgentic SystemsMethod OptimizationModel EvaluationDeploymentProduction FeedbackEfficiencyControls ArchitectureTest-Time Reinforcement Learning

Through proprietary software and AI, along with a focus on customer delight, Sleek makes the back-office easy for micro SMEs. We give Entrepreneurs time back to focus on what they love doing - growin...

February 5, 2026 View Details

Senior Reinforcement Learning Engineer

Apptronik

Austin, TX (HQ) Remote permanent

Reinforcement LearningSimulationBuilding Physics SoftwareDistributed TrainingC++PythonMotion RetargetingRoboticsCollaborationState-of-the-art Algorithms

Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in every facet of life. Our flagship humanoid robot, Apollo, is built to collaborate thoughtfully with p...

February 26, 2026 View Details

Reinforcement Learning Engineer

Apptronik

Austin, TX (HQ) Remote permanent

Reinforcement LearningPyTorchJAXMuJoCoPythonC++Distributed TrainingRoboticsIsaacGymMotion Retargeting

Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in every facet of life. Our flagship humanoid robot, Apollo, is built to collaborate thoughtfully with p...

February 26, 2026 View Details

Machine Learning Engineer, Reinforcement Learning

Doordashusa

San Francisco, CA; Sunnyvale, CA; Seattle WA (San Francisco) Remote permanent

Applied Machine LearningReinforcement LearningMulti-armed BanditsContextual BanditsData Informed InsightsScalable SolutionsUser-Centric DesignProduction Machine LearningMarkov Decision ProcessesDeep Reinforcement Learning

About the Team Come help us build the world's most reliable local e-commerce platform for on-demand last-mile grocery and retail delivery! We're looking for an experienced senior machine learning eng...

February 26, 2026 View Details

Senior Research Scientist - Reinforcement Learning, MoEs

Canva

London, , United Kingdom Hybrid permanent

Reinforcement LearningScalable VisionPost-trainingAgent SystemsPlanningRetrievalData PipelinesExperiment DesignMixture of ExpertsMultimodal Tool Use

Join the team redefining how the world experiences design Hiya, g'day, mabuhay, kia ora, 你好, hallo, vítejte! Thanks for stopping by. We know job hunting can be a little time consuming and you're pro...

February 25, 2026 View Details

Senior Research Scientist - Reinforcement Learning, MoEs

Canva

Vienna, Vienna, Austria Hybrid permanent

Reinforcement LearningAgentic SystemsPost-TrainingDemand ModelingTraining LoopsData Mapping and TransformationExperiment DesignScalable SystemsProduct AlignmentMixture of Experts

Join the team redefining how the world experiences design. Servus, hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte! Thanks for stopping by. We know job hunting can be a little time consuming and yo...

February 25, 2026 View Details

Sr Data Scientist - Mar Tech (Applied ML, Recommender Systems, Reinforcement Learning)

Target

7000 Target Pkwy N,NCD-0375 Brooklyn Park,MN 55445 permanent

Data ScienceGPU-based Machine LearningMachine LearningModel DevelopmentSoftware DevelopmentAI Model IntegrationGuest Card ManagementTeam CollaborationRetail Domain KnowledgeGenAI Techniques

The pay range is $98,000.00 - $176,000.00 Pay is based on several factors which vary based on position. These include labor markets and in some instances may include education, work experience and ce...

February 25, 2026 View Details

Researcher - Reinforcement Learning & LLM

Confidential

Montréal, Quebec, Canada permanent

ResearchReinforcement LearningLarge Language ModelsGraph LearningContinual LearningScientific PublicationsEnglish CommunicationAgentic Self-Improvement

Huawei Canada has an immediate 12-month contract opening for a Researcher. About the team: Welcome to the Advanced Wireless Technology Wireless Lab, an epitome of innovation located in Ottawa, Canad...

February 25, 2026 View Details

Reinforcement Learning Research Engineer – Exploration & Decision Intelligence (m/w/d)

Autonomous Teaming

Munich (DEU) permanent

Python proficiencySimulation-based learningMath and statistics foundationPublications at top conferences

Your mission: • Research and prototype novel RL algorithms (e.g. exploration, POMDPs, multi-agent systems) • Design and implement use-cases for DRL on edge devices • Translate theory into scalable s...

February 12, 2026 View Details

Reinforcement Learning Engineer

Code Metal

San Francisco, California, United States Remote permanent

Distributed TrainingPyTorchData CurationQuality AssuranceOrchestration ToolsReinforcement LearningEvaluation FrameworksCode Generation

At Code Metal AI, you’ll be part of a world class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our project...

August 11, 2025 View Details

Senior Engineering Manager, Reinforcement Learning Environments (RLE)

Handshake

San Francisco, California, United States permanent

LeadershipTeam ManagementEngineering ManagementTechnical DirectionRoadmap DevelopmentPlatform ArchitectureScalable SystemsPlugins developmentReliabilityObservabilityPerformanceEngineering Best Practices

About Handshake Handshake is the career network for the AI economy. 20 million knowledge workers, 1,600 educational institutions, 1 million employers (including 100% of the Fortune 50), and every fou...

February 18, 2026 View Details

Data Scientist - Reinforcement Learning F/H

Thales

Issy-les-Moulineaux Hybrid permanent

Data ScientistReinforcement LearningMachine LearningAICybersecurityCloudInnovation

Lieu : Issy-les-Moulineaux, France Construisons ensemble un avenir de confiance Thales est un leader mondial des hautes technologies spécialisé dans trois secteurs d’activité : Défense & Sécurité, A...

February 16, 2026 View Details

Research Engineer - Reinforcement Learning, Self-Driving

Appliedintuition

Sunnyvale, California, United States (Sunnyvale) Remote permanent

Reinforcement LearningProof-of-Concept DrivingAutonomous SystemsRoboticsAILarge-scale SimulationData AccessSystem DeploymentReinforcement Learning Infrastructure

About Applied Intuition Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure...

February 13, 2026 View Details

Research Intern - Reinforcement Learning, Robotics

Appliedintuition

Sunnyvale, California, United States (Sunnyvale) Remote permanent

Reinforcement LearningRoboticsResearchData AnalysisPythonMachine LearningProgrammingMathematicsAlgorithmsProblem Solving

About Applied Intuition Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure...

February 12, 2026 View Details

Research Engineer, Machine Learning (Reinforcement Learning)

Anthropic

London, UK Remote permanent

Reinforcement LearningMachine LearningCode GenerationInference FrameworksAgentic AI WorkflowTool UsePerformance OptimizationDistributed SystemsAutomated Testing FrameworksAutonomous Software Generation

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickl...

February 11, 2026 View Details

Research Scientist - Reinforcement Learning （上海）_CR

BoschGroup

Shanghai, Shanghai, China permanent

PythonPyTorchDeep LearningReinforcement LearningOptimizationEnglish FluencyDoctorate degree in Psychology

Success stories don’t just happen. They are made. Openness, tolerance and integrity shape our work climate, which promote the efficiency of every employee. Strengthen our innovative power by acceptin...

March 6, 2025 View Details

Research Scientist - Reinforcement Learning （北京）_CR

BoschGroup

Beijing, Beijing, China permanent

PythonPyTorchDeep LearningReinforcement LearningOptimizationEnglish FluencyPhDPublication

Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology - with us, you will ...

March 17, 2025 View Details

Energy & Materials Intern - Generative Models and Reinforcement Learning

Tri

Los Altos, CA Hybrid internship

Computer ScienceMaterials ScienceDiffusion ModelsReinforcement LearningHigh-Performance ComputingData Pipelines

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative sh...

February 3, 2026 View Details

Senior Machine Learning Engineer, Reinforcement Learning

Pathrobotics

Columbus, Ohio (Path Robotics) permanent

Machine LearningReinforcement LearningPythonPyTorchTensorFlowSimulation EnvironmentsMuJoCoIsaac GymOptimization

Build the Path Forward At Path Robotics, we’re building the future of embodied intelligence. Our AI-driven systems enable robots to adapt, learn, and perform in the real world closing the skilled lab...

January 23, 2026 View Details

Full Stack Software Engineer, Reinforcement Learning

Anthropic

San Francisco, CA Hybrid permanent

Full-Stack DevelopmentWeb DevelopmentAPI DevelopmentData Collection PlatformsObservability SystemsVendor InterfacesHuman Data CollectionQuality AssuranceFeedback MechanismsEvaluation Dashboards

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickl...

January 30, 2026 View Details

AI Scientist (Reinforcement Learning)

Confidential

Singapore Hybrid permanent

Reinforcement LearningBayesian Machine LearningTrust Region methodsPolicy OptimizationAutomated TestingRed-team TestingMulti-Agent RLPythonNumPy

Resaro was founded on the belief that AI will change the world in ways we cannot even imagine, but every new technology needs safeguards to advance. About the Role We are looking for an AI Scientist...

January 30, 2026 View Details

Reinforcement Learning Engineer

Batoncorporation

New York, New York, US permanent

Reinforcement LearningProduction SystemsRisk ManagementPolicy EvaluationSimulationData ModelingAutonomous SystemsSystem DeploymentTradeoff Analysis

Who We Are Baton Corporation is the development company that builds and operates the entire technology stack behind pump.fun, the largest memecoin launchpad in production today. The systems are low l...

January 20, 2026 View Details

Member of Technical Staff - Post Training, Reinforcement Learning

Liquid Ai

United States Remote permanent

PythonPyTorchReinforcement LearningOptimizationDeepSpeedFSDPvLLMData PipelinesOpen-source

Work With Us At Liquid, we’re not just building AI models—we’re redefining the architecture of intelligence itself. Spun out of MIT, our mission is to build efficient AI systems at every scale. Our L...

November 7, 2025 View Details

Reinforcement Learning Engineering Intern

Persona.ai

Pensacola, Florida, United States internship

PythonC/C++Machine LearningReinforcement LearningPytorchPhysics SimulatorsControls SoftwareReinforcement Learning AlgorithmsRobot HardwareVision or Localization

Persona AI is developing and commercializing rugged, multi-purpose humanoid robots that perform real work. Persona’s founding team has a decades-long history in humanoid robotics, bionics, and product...

January 8, 2026 View Details

Reinforcement learning engineer

Dexmate

Santa Clara, California, United States permanent

Reinforcement LearningRoboticsPythonPyTorchTensorFlowJAXRobot KinematicsGPU-based SimulationDistributed Systems

Role Overview We're seeking Reinforcement Learning experts to develop and deploy cutting-edge RL algorithms that enhance our robots' capabilities. Responsibilities • Design and implement reinforcem...

January 19, 2026 View Details

Research Scientist, Reinforcement Learning

Basis Research

New York, New York, United States permanent

Reinforcement LearningPlanningModel-Based RLExploration StrategiesOptimal ControlBayesian OptimizationNeural-Symbolic IntegrationFoundational AI ResearchModelingAbstraction

About Basis Basis is a nonprofit applied AI research organization with two mutually reinforcing goals. The first is to understand and build intelligence. This means to establish the mathematical pri...

November 23, 2025 View Details

Machine Learning Engineer: Imitation and Reinforcement Learning for Robotics

Bedrock Robotics

San Francisco, California, United States permanent

Machine LearningDeep LearningPyTorchPythonSystems ProgrammingBehavior CloningReinforcement LearningData IngestionModel DeploymentMetrics Development

We’re looking for a Machine Learning Engineer with a focus on behavior learning, specifically data-driven behavior policies and robust data infrastructure. In this role, you'll be responsible for deve...

July 12, 2025 View Details

Senior Machine Learning Engineer (Reinforcement Learning)

Datatonic

Montreal, Quebec, Canada Remote permanent

Reinforcement LearningPythonMachine LearningData ScienceModel OptimizationProductionizationCommunicationConsultingData EngineeringMLOps

Shape the Future of AI & Data with Us At Datatonic, we are Google Cloud's premier partner in AI, driving transformation for world-class businesses. We push the boundaries of technology with expertise...

November 10, 2025 View Details

Lead Engineer, Reinforcement Learning & Scenario Generation

Serverobotics

Redwood City , California , United States Remote permanent

Machine LearningReinforcement LearningSimulationDistributed SystemsContainerizationCurriculum LearningDomain RandomizationMulti-Agent SystemsAPI DevelopmentDebugging3D Asset Generation

At Serve Robotics, we’re reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, make deliverie...

December 18, 2025 View Details

PhD Autonomy Engineer Intern - Planning & Controls (Reinforcement Learning)

Skydio

Zurich, Zurich, Switzerland Remote internship

Reinforcement LearningPythonPyTorchC++Robotics SimulationSim2RealSafetyControl TheoryMotion PlanningCollaboration

Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial mobility. The Skydio team combines deep expertise in artificial...

September 30, 2025 View Details

AI Engineer - Reinforcement Learning (Senior)

Rivr

Zurich permanent

Deep LearningReinforcement LearningSupervised LearningSelf-Supervised LearningRoboticsAutonomyManipulationSim-to-Real TransferPythonC++Deep Neural NetworksNeural Network Architectures

RIVR is a Swiss robotics company pioneering Physical AI and robotic solutions to revolutionize last-mile delivery, giving 1 human the power of 1000. Through the combination of artificial neural networ...

August 28, 2024 View Details

Research Scientist - Reinforcement Learning

Ifm Us

Sunnyvale, CA permanent

Reinforcement LearningFoundation ModelsSelf-PlayAgentic TasksAlgorithm DesignFull-stack EngineeringTechnical PublicationsOpen-source Community EngagementScalable Training SystemsInterdisciplinary Collaboration

About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next g...

July 31, 2025 View Details

Full Stack Reinforcement Learning (RL) Engineer Specialist - Freelance Project

Agency

Bosnia and Herzegovina (Worldwide - Remote) Remote permanent

Reinforcement LearningPythonJavaScriptPyTorchTensorFlowOpenAI GymFastAPIReactDistributed SystemsExperiment Tracking

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details

Latest Job Openings