ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

Senior Expert/VP Reinforcement Learning 

Confidential

Munich Hybrid permanent

Posted: March 1, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

Resaro builds advanced AI testing software to help organizations verify, validate, and trust their most critical AI systems — from computer vision to generative AI and autonomous systems.

Job Description

Resaro builds advanced AI testing software to help organizations verify, validate, and trust their most critical AI systems — from computer vision to generative AI and autonomous systems. Our mission is to ensure that AI technologies deployed in real-world, high-stakes environments are robust, explainable, and secure.

We work closely with our customers through embedded delivery teams who operate on-site or in close collaboration. These teams tailor solutions to specific mission needs, helping organizations — especially in the public safety and national security sectors — evaluate and improve the performance of their AI-enabled systems.

About the Role:

As the Senior Expert/ VP Reinforcement Learning, you will be the primary architect of our AI Test, Evaluation, Verification, and Validation (TEVV) product suite for reinforcement learning systems. You will lead the development of next-generation AI testing and assurance frameworks with applications in Autonomous Driving and Robotics. Your mission is to scale our capabilities in Reinforcement Learning, to ensure autonomous agents are safe, robust, and explainable in the field.

Key Responsibilities

Independently implement Resaro’s RL validation prototype to expose agent instability and vulnerability in a mission-critical and complex environment.

Scale, lead and mentor a global, cross-functional, high-performing team of AI researchers and engineers, drawing on experience steering organizations of 30+ experts.

Define the long-term vision and technical roadmap for RL TEVV, focusing on validating RL algorithms and learned policies in complex environments with mission-critical applications across system control, autonomous vehicles, and robotics.

Advance methods for learning probabilistic reward functions from human feedback (RLHF) to align AI behavior with mission goals.

Partner with Product Management to translate product vision, customer problems, and market opportunities into end‑to‑end solution architecture and technical roadmaps that support a product-led growth strategy.

Must-Have Skills and Experience

Master / Ph.D. in Robot Reinforcement Learning or a closely related field.

Proven track record in developing and implementing novel RL and ML algorithms, e.g. research or commercial implementation.

Demonstrated deep theoretical understanding of and practical experience with the RL framework, including bandit setting, (in-)finite horizon setting, on- and off-policy RL, and trust-region RL approaches.

Experience in Bayesian Machine Learning and probabilistic models.

Understanding of AI/ML/RL lifecycle and the state-of-the-art approaches and limitations  of testing and validating complex use cases.

Strong skills in requirements gathering, stakeholder communication, and solution scoping.

Nice-to-Have

Experience with fully differentiable deep learning for highly unstable systems.

Experience with Active Learning and RLHF.

Background in model compression and pruning for deploying large RL models onto edge devices.

Hands-on experience with Bayesian Meta-Learning to reduce training time and absolute error in complex models.

A strong portfolio of innovation, including multiple successful paper submissions at conferences like NeurIPS, ICML, ICLR, IROS, ICRA, CoRL, and a deep patent history (e.g., 17+ patents).

Experience spearheading global AI initiatives and delivering AI solutions for both B2G (Unmanned Systems) and B2B (IoT) sectors.

Demonstrated success in leading cross-functional teams to deliver technical solutions.

Knowledge of deployment constraints in high-security or classified environments.

Prior exposure or experience with directly engaging senior stakeholders from Director to C-suite level.

Prior security clearance at Government CONFIDENTIAL and above.

Why Join Resaro

Work on mission-critical AI systems in defence, aerospace, and public safety.

Help define the future of AI testing and assurance in real-world environments.

Collaborate with a tight-knit, expert team working at the intersection of AI, systems engineering, and policy.

Shape product direction while being close to the operational reality of AI deployments.

Resaro is an Equal Opportunity Employer. We respect each individual and support the diverse cultures, perspectives, skills and experiences within our teams.

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply