MisuJob - AI Job Search Platform MisuJob

Jobs

Browse 250+ jobs updated daily

Latest Job Openings

Noida permanent
Computer ScienceReinforcement LearningRL EnvironmentsTraining CurriculumProbabilityMathsOptimizationAgent-Based SystemsLarge-Scale Datasets

🚀 Build the next generation of Agentic AI with us Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agent...

March 31, 2026 View Details
Bay Area, California permanent
Computer ScienceReinforcement LearningRL environmentsPGL WorkshopsProbabilityMathOptimizationAgent-based systemsLarge-scale datasetsReinforcement Learning fundamentals

🚀 Build the next generation of Agentic AI with us Our platform combines conversation intelligence, multimodal understanding, and agentic AI systems to power both human agents and autonomous AI agent...

March 28, 2026 View Details
Warsaw, Masovian Voivodeship, Poland Hybrid permanent
AIReinforcement LearningRoboticsSimulationComputer VisionDeep LearningMachine LearningPythonPyTorchGit

Robotec.ai is a software company that develops hi-tech solutions for robotics and automotive industries. We help our customers build state-of-the-art robotic simulations and testing tools to ensure th...

March 27, 2026 View Details
Munich (DEU) permanent
Reinforcement LearningTheoryPracticeQuick IterationProductionizing ML infrastructureNumerical SimulationGazeboPolicy GradientsQ-LearningIsaacSimMujocoCarla

What we offer: • Opportunity to work on a new solution from scratch in a technical complex environment • Work in an international, agile, cross-functional team creating the future of autonomous system...

March 20, 2026 View Details
San Francisco, CA (San Francisco) Remote permanent
GPU Deep LearningJAXPolicy DesignSimulationDistributed TrainingGPU AccelerationAutonomous DrivingProduction DeploymentReward ShapingModel-Based RL

About the Team Our DD Labs team builds real-time autonomous delivery systems. The Planning & Decision-Making group is investing heavily in deep reinforcement learning to move beyond classical pla...

March 25, 2026 View Details
Gennevilliers permanent
Reinforcement LearningCybersecurityThreat ModelingSysMLPythonDockerGitBloodHoundMITRE D3FEND

Lieu : Gennevilliers, France Construisons ensemble un avenir de confiance Thales est un leader mondial des hautes technologies spécialisé dans trois secteurs d’activité : Défense & Sécurité, Aéronau...

March 25, 2026 View Details
Remote - US (Remote US) Remote permanent
Reinforcement LearningSystems EngineeringSecurityFuzzingContainerizationLinux EnvironmentsPythonRustSystems ArchitectureAI IntegrationBinary Exploitation

We are Bugcrowd. Since 2012, we’ve been empowering organizations to take back control and stay ahead of threat actors by uniting the collective ingenuity and expertise of our customers and trusted all...

March 16, 2026 View Details
London, UK Remote permanent
Machine LearningReinforcement LearningAlgorithm DevelopmentExperiment DesignData AnalysisCollaborationCode ImplementationScientific CommunicationInfrastructure ManagementFirst-Principles Thinking

At Google DeepMind, we value diversity of experience, knowledge, backgrounds and perspectives and harness these qualities to create extraordinary impact. We are committed to equal employment opportuni...

March 14, 2026 View Details
Remote job permanent
AIReinforcement LearningModel DevelopmentResource EfficiencyThe Composable Architecture

Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...

March 11, 2026 View Details
New York, New York, United States permanent
Production ExperienceRisk ManagementSystems AdaptabilityTechnical LeadershipObjective Function DesignValidation FrameworksEvaluation Loop Mastery

Reinforcement Learning (RL) Engineer Location: New York (Office) On-site | Full-time Compensation: Competitive Our client is an elite development firm and a high-growth software company responsibl...

March 5, 2026 View Details
Renningen, BW, Germany Hybrid permanent
Deep Reinforcement LearningAgentic AIAgentic GenAI systemsMachine Learning SystemsData-driven ApproachesGenerative ModelsEncryption AlgorithmsSafe RLOffline RLMulti-modal Large Models

At Bosch, we shape the future by inventing high-quality technologies and services that spark enthusiasm and enrich people’s lives. Our promise to our associates is rock-solid: We grow together, we enj...

March 4, 2026 View Details
Santa Monica, California, United States temporary
PythonTime Series ForecastingReinforcement LearningOptimizationConversational AIAI ReasoningDocumentationCross-functional CollaborationEmerging Technologies

The Organization At TWG Group Holdings, LLC (“TWG Global”), we drive innovation and business transformation across a diverse portfolio—including investments, finance, insurance, and media—by leveragi...

March 3, 2026 View Details
London, UK permanent
ResearchImplementationExperimentationEvaluationCommunicationCollaborationFirst-principlesPhD

Snapshot We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus o...

March 3, 2026 View Details
2 Locations permanent
Reinforcement LearningSimulation ModelsState-of-the-art AlgorithmsNeural RenderingRoboticsHigh-throughput data pipelinesPublishingMulti-Modal World Simulation ModelModel-Centric RLWorld Simulation

We are in search of a hardworking intern with expertise in Reinforcement Learning and Multi-Modal World Simulation Model to propel the evolution of ML-centric autonomous driving and Physical AI soluti...

March 3, 2026 View Details
Heidelberg, Baden Würtemberg, Germany Remote permanent
Reinforcement LearningLarge-scale TrainingLLM TrainingDistributed AlgorithmsStatistical EvaluationExperiment DesignCross-functional CollaborationModel ImprovementInnovation ExplorationProduction Implementation

Our Mission Aleph Alpha is one of the few companies in Europe with end-to-end in-house model development including pre- and post-training. We’re building models that have general-purpose capabilities...

March 3, 2026 View Details
Munich Hybrid permanent
Reinforcement LearningAI TestingAI ValidationEmbedded AI InferenceTrust DevelopmentProximal Policy OptimizationRequirements GatheringStakeholder CommunicationSolution ScopingBayesian Machine LearningProbabilistic Models

Resaro builds advanced AI testing software to help organizations verify, validate, and trust their most critical AI systems — from computer vision to generative AI and autonomous systems. Our mission ...

March 1, 2026 View Details
India Remote permanent
Machine LearningReinforcement LearningProduction-Grade SystemsDomain ModelAgentic SystemsMethod OptimizationModel EvaluationDeploymentProduction FeedbackEfficiencyControls ArchitectureTest-Time Reinforcement Learning

Through proprietary software and AI, along with a focus on customer delight, Sleek makes the back-office easy for micro SMEs. We give Entrepreneurs time back to focus on what they love doing - growin...

February 5, 2026 View Details
Austin, TX (HQ) Remote permanent
Reinforcement LearningSimulationBuilding Physics SoftwareDistributed TrainingC++PythonMotion RetargetingRoboticsCollaborationState-of-the-art Algorithms

Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in every facet of life. Our flagship humanoid robot, Apollo, is built to collaborate thoughtfully with p...

February 26, 2026 View Details
Austin, TX (HQ) Remote permanent
Reinforcement LearningPyTorchJAXMuJoCoPythonC++Distributed TrainingRoboticsIsaacGymMotion Retargeting

Apptronik is a human-centered robotics company developing AI-powered robots to support humanity in every facet of life. Our flagship humanoid robot, Apollo, is built to collaborate thoughtfully with p...

February 26, 2026 View Details
San Francisco, CA; Sunnyvale, CA; Seattle WA (San Francisco) Remote permanent
Applied Machine LearningReinforcement LearningMulti-armed BanditsContextual BanditsData Informed InsightsScalable SolutionsUser-Centric DesignProduction Machine LearningMarkov Decision ProcessesDeep Reinforcement Learning

About the Team Come help us build the world's most reliable local e-commerce platform for on-demand last-mile grocery and retail delivery! We're looking for an experienced senior machine learning eng...

February 26, 2026 View Details
London, , United Kingdom Hybrid permanent
Reinforcement LearningScalable VisionPost-trainingAgent SystemsPlanningRetrievalData PipelinesExperiment DesignMixture of ExpertsMultimodal Tool Use

Join the team redefining how the world experiences design Hiya, g'day, mabuhay, kia ora, 你好, hallo, vítejte! Thanks for stopping by. We know job hunting can be a little time consuming and you're pro...

February 25, 2026 View Details
Vienna, Vienna, Austria Hybrid permanent
Reinforcement LearningAgentic SystemsPost-TrainingDemand ModelingTraining LoopsData Mapping and TransformationExperiment DesignScalable SystemsProduct AlignmentMixture of Experts

Join the team redefining how the world experiences design. Servus, hey, g'day, mabuhay, kia ora, 你好, hallo, vítejte! Thanks for stopping by. We know job hunting can be a little time consuming and yo...

February 25, 2026 View Details
7000 Target Pkwy N,NCD-0375 Brooklyn Park,MN 55445 permanent
Data ScienceGPU-based Machine LearningMachine LearningModel DevelopmentSoftware DevelopmentAI Model IntegrationGuest Card ManagementTeam CollaborationRetail Domain KnowledgeGenAI Techniques

The pay range is $98,000.00 - $176,000.00 Pay is based on several factors which vary based on position. These include labor markets and in some instances may include education, work experience and ce...

February 25, 2026 View Details
Montréal, Quebec, Canada permanent
ResearchReinforcement LearningLarge Language ModelsGraph LearningContinual LearningScientific PublicationsEnglish CommunicationAgentic Self-Improvement

Huawei Canada has an immediate 12-month contract opening for a Researcher. About the team: Welcome to the Advanced Wireless Technology Wireless Lab, an epitome of innovation located in Ottawa, Canad...

February 25, 2026 View Details
San Francisco, California, United States Remote permanent
Distributed TrainingPyTorchData CurationQuality AssuranceOrchestration ToolsReinforcement LearningEvaluation FrameworksCode Generation

At Code Metal AI, you’ll be part of a world class team with talent from MIT, OpenAI and other top companies, focused on pioneering work in large language models (LLMs) and code generation. Our project...

August 11, 2025 View Details
San Francisco, California, United States permanent
LeadershipTeam ManagementEngineering ManagementTechnical DirectionRoadmap DevelopmentPlatform ArchitectureScalable SystemsPlugins developmentReliabilityObservabilityPerformanceEngineering Best Practices

About Handshake Handshake is the career network for the AI economy. 20 million knowledge workers, 1,600 educational institutions, 1 million employers (including 100% of the Fortune 50), and every fou...

February 18, 2026 View Details
Issy-les-Moulineaux Hybrid permanent
Data ScientistReinforcement LearningMachine LearningAICybersecurityCloudInnovation

Lieu : Issy-les-Moulineaux, France Construisons ensemble un avenir de confiance Thales est un leader mondial des hautes technologies spécialisé dans trois secteurs d’activité : Défense & Sécurité, A...

February 16, 2026 View Details
Sunnyvale, California, United States (Sunnyvale) Remote permanent
Reinforcement LearningProof-of-Concept DrivingAutonomous SystemsRoboticsAILarge-scale SimulationData AccessSystem DeploymentReinforcement Learning Infrastructure

About Applied Intuition Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure...

February 13, 2026 View Details
Sunnyvale, California, United States (Sunnyvale) Remote permanent
Reinforcement LearningRoboticsResearchData AnalysisPythonMachine LearningProgrammingMathematicsAlgorithmsProblem Solving

About Applied Intuition Applied Intuition, Inc. is powering the future of physical AI. Founded in 2017 and now valued at $15 billion, the Silicon Valley company is creating the digital infrastructure...

February 12, 2026 View Details
London, UK Remote permanent
Reinforcement LearningMachine LearningCode GenerationInference FrameworksAgentic AI WorkflowTool UsePerformance OptimizationDistributed SystemsAutomated Testing FrameworksAutonomous Software Generation

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickl...

February 11, 2026 View Details
Shanghai, Shanghai, China permanent
PythonPyTorchDeep LearningReinforcement LearningOptimizationEnglish FluencyDoctorate degree in Psychology

Success stories don’t just happen. They are made. Openness, tolerance and integrity shape our work climate, which promote the efficiency of every employee. Strengthen our innovative power by acceptin...

March 6, 2025 View Details
Beijing, Beijing, China permanent
PythonPyTorchDeep LearningReinforcement LearningOptimizationEnglish FluencyPhDPublication

Do you want beneficial technologies being shaped by your ideas? Whether in the areas of mobility solutions, consumer goods, industrial technology or energy and building technology - with us, you will ...

March 17, 2025 View Details
Los Altos, CA Hybrid internship
Computer ScienceMaterials ScienceDiffusion ModelsReinforcement LearningHigh-Performance ComputingData Pipelines

At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative sh...

February 3, 2026 View Details
Columbus, Ohio (Path Robotics) permanent
Machine LearningReinforcement LearningPythonPyTorchTensorFlowSimulation EnvironmentsMuJoCoIsaac GymOptimization

Build the Path Forward At Path Robotics, we’re building the future of embodied intelligence. Our AI-driven systems enable robots to adapt, learn, and perform in the real world closing the skilled lab...

January 23, 2026 View Details
San Francisco, CA Hybrid permanent
Full-Stack DevelopmentWeb DevelopmentAPI DevelopmentData Collection PlatformsObservability SystemsVendor InterfacesHuman Data CollectionQuality AssuranceFeedback MechanismsEvaluation Dashboards

About Anthropic Anthropic’s mission is to create reliable, interpretable, and steerable AI systems. We want AI to be safe and beneficial for our users and for society as a whole. Our team is a quickl...

January 30, 2026 View Details
Singapore Hybrid permanent
Reinforcement LearningBayesian Machine LearningTrust Region methodsPolicy OptimizationAutomated TestingRed-team TestingMulti-Agent RLPythonNumPy

Resaro was founded on the belief that AI will change the world in ways we cannot even imagine, but every new technology needs safeguards to advance. About the Role We are looking for an AI Scientist...

January 30, 2026 View Details
New York, New York, US permanent
Reinforcement LearningProduction SystemsRisk ManagementPolicy EvaluationSimulationData ModelingAutonomous SystemsSystem DeploymentTradeoff Analysis

Who We Are Baton Corporation is the development company that builds and operates the entire technology stack behind pump.fun, the largest memecoin launchpad in production today. The systems are low l...

January 20, 2026 View Details
United States Remote permanent
PythonPyTorchReinforcement LearningOptimizationDeepSpeedFSDPvLLMData PipelinesOpen-source

Work With Us At Liquid, we’re not just building AI models—we’re redefining the architecture of intelligence itself. Spun out of MIT, our mission is to build efficient AI systems at every scale. Our L...

November 7, 2025 View Details
Pensacola, Florida, United States internship
PythonC/C++Machine LearningReinforcement LearningPytorchPhysics SimulatorsControls SoftwareReinforcement Learning AlgorithmsRobot HardwareVision or Localization

Persona AI is developing and commercializing rugged, multi-purpose humanoid robots that perform real work. Persona’s founding team has a decades-long history in humanoid robotics, bionics, and product...

January 8, 2026 View Details
Santa Clara, California, United States permanent
Reinforcement LearningRoboticsPythonPyTorchTensorFlowJAXRobot KinematicsGPU-based SimulationDistributed Systems

Role Overview We're seeking Reinforcement Learning experts to develop and deploy cutting-edge RL algorithms that enhance our robots' capabilities. Responsibilities • Design and implement reinforcem...

January 19, 2026 View Details
New York, New York, United States permanent
Reinforcement LearningPlanningModel-Based RLExploration StrategiesOptimal ControlBayesian OptimizationNeural-Symbolic IntegrationFoundational AI ResearchModelingAbstraction

About Basis Basis is a nonprofit applied AI research organization with two mutually reinforcing goals. The first is to understand and build intelligence. This means to establish the mathematical pri...

November 23, 2025 View Details
San Francisco, California, United States permanent
Machine LearningDeep LearningPyTorchPythonSystems ProgrammingBehavior CloningReinforcement LearningData IngestionModel DeploymentMetrics Development

We’re looking for a Machine Learning Engineer with a focus on behavior learning, specifically data-driven behavior policies and robust data infrastructure. In this role, you'll be responsible for deve...

July 12, 2025 View Details
Montreal, Quebec, Canada Remote permanent
Reinforcement LearningPythonMachine LearningData ScienceModel OptimizationProductionizationCommunicationConsultingData EngineeringMLOps

Shape the Future of AI & Data with Us At Datatonic, we are Google Cloud's premier partner in AI, driving transformation for world-class businesses. We push the boundaries of technology with expertise...

November 10, 2025 View Details
Redwood City , California , United States Remote permanent
Machine LearningReinforcement LearningSimulationDistributed SystemsContainerizationCurriculum LearningDomain RandomizationMulti-Agent SystemsAPI DevelopmentDebugging3D Asset Generation

At Serve Robotics, we’re reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, make deliverie...

December 18, 2025 View Details
Zurich, Zurich, Switzerland Remote internship
Reinforcement LearningPythonPyTorchC++Robotics SimulationSim2RealSafetyControl TheoryMotion PlanningCollaboration

Skydio is the leading US drone company and the world leader in autonomous flight, the key technology for the future of drones and aerial mobility. The Skydio team combines deep expertise in artificial...

September 30, 2025 View Details
Zurich permanent
Deep LearningReinforcement LearningSupervised LearningSelf-Supervised LearningRoboticsAutonomyManipulationSim-to-Real TransferPythonC++Deep Neural NetworksNeural Network Architectures

RIVR is a Swiss robotics company pioneering Physical AI and robotic solutions to revolutionize last-mile delivery, giving 1 human the power of 1000. Through the combination of artificial neural networ...

August 28, 2024 View Details
Sunnyvale, CA permanent
Reinforcement LearningFoundation ModelsSelf-PlayAgentic TasksAlgorithm DesignFull-stack EngineeringTechnical PublicationsOpen-source Community EngagementScalable Training SystemsInterdisciplinary Collaboration

About the Institute of Foundation Models We are a dedicated research lab for building, understanding, using, and risk-managing foundation models. Our mandate is to advance research, nurture the next g...

July 31, 2025 View Details
Bosnia and Herzegovina (Worldwide - Remote) Remote permanent
Reinforcement LearningPythonJavaScriptPyTorchTensorFlowOpenAI GymFastAPIReactDistributed SystemsExperiment Tracking

What You’ll Do Support projects by designing and implementing reinforcement learning systems that bridge research and deployment. Work across the stack to contribute to both backend services and front...

December 17, 2025 View Details