MisuJob - AI Job Search Platform MisuJob

Senior Machine Learning Engineer - Learned Planning/Reinforcement Learning

Torcrobotics

Remote - U.S, Ann Arbor, MI (Ann Arbor, MI , Remote - US) Remote permanent

Posted: April 6, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

A Senior Machine Learning Engineer is responsible for designing and implementing complex machine learning models, including reinforcement learning for autonomous driving

Job Description

About the Company

At Torc, we have always believed that autonomous vehicle technology will transform how we travel, move freight, and do business.

A leader in autonomous driving since 2007, Torc has spent over a decade commercializing our solutions with experienced partners. Now a part of the Daimler family, we are focused solely on developing software for automated trucks to transform how the world moves freight.

Join us and catapult your career with the company that helped pioneer autonomous technology, and the first AV software company with the vision to partner directly with a truck manufacturer.

Meet the Team
As a Senior Machine Learning Engineer – Learned Planner / Reinforcement Learning, you will develop and deploy machine learning models that drive decision-making for autonomous trucks. Working closely with teams across perception, prediction, planning, and safety, you will build learned behavior systems that enable safe, efficient, and human-like driving in real-world freight environments.

This role focuses on owning model development and delivery for scoped problem areas, contributing to architecture decisions, and driving improvements in model performance, reliability, and iteration speed within the autonomy stack.

What You’ll Do

• Design, develop, and deploy learned behavior models using approaches such as reinforcement learning, behavior cloning, and imitation learning

• Own end-to-end model development for scoped problem areas, from data ingestion and training to evaluation and deployment

• Write production-quality ML code to support scalable training, evaluation, and inference workflows

• Analyze model performance, identify failure modes, and iterate to improve robustness and generalization across driving scenarios

• Contribute to training pipelines, data workflows, and infrastructure, including working with large-scale datasets from simulation, fleet logs, and on-vehicle data

• Collaborate with simulation, validation, and autonomy teams to test and evaluate learned behavior models across diverse environments

• Support integration of learned planning models into simulation and validation frameworks, enabling faster iteration and improved coverage

• Contribute to model architecture discussions and technical decision-making within the team

• Mentor junior engineers on implementation, experimentation, and best practices

What You’ll Need to Succeed

• Bachelor’s degree in Computer Science, Robotics, Electrical Engineering, Machine Learning, or related technical field with 6+ years of industry experience, OR Master’s degree with 3+ years OR PhD with 1+ years of experience

• Experience applying reinforcement learning, imitation learning, or sequence modeling to robotics, autonomous systems, or complex control problems

• Strong programming skills in Python and PyTorch, with experience writing production-quality ML code

• Experience training, evaluating, and improving models using large-scale datasets and distributed compute environments

• Solid understanding of ML architectures used in autonomy systems (e.g., transformers, RNNs, graph neural networks, policy networks)

• Experience debugging model behavior, analyzing performance metrics, and improving model reliability

• Ability to translate ambiguous problems into structured ML solutions and deliver results independently

• Experience collaborating cross-functionally to integrate ML models into larger autonomy systems

Bonus Points:

• Experience in autonomous driving, robotics, or simulation-based training environments

• Experience with reinforcement learning frameworks or distributed training systems (e.g., Ray)

• Experience working with simulation environments, scenario generation, or large-scale behavior datasets

• Familiarity with vehicle dynamics, motion planning, or multi-agent decision-making systems

• Experience deploying ML models into production or real-world robotics systems

• Experience with learned planning systems or policy learning in real-world or simulation environments

• Experience integrating learned behavior models into validation and V&V workflows

• Background in multi-agent modeling, driver behavior modeling, or long-horizon decision-making systems

Work Location: For this position, we are open to hiring in either the Ann Arbor, MI OR Blacksburg, VA (U.S.) office work locations in a hybrid capacity. We are also open to hiring Remote in the United States

Perks of Being a Full-time Torc’r

Torc cares about our team members and we strive to provide benefits and resources to support their health, work/life balance, and future. Our culture is collaborative, energetic, and team focused. Torc offers:

• A competitive compensation package that includes a bonus component and stock options

• 100% paid medical, dental, and vision premiums for full-time employees

• 401K plan with a 6% employer match

• Flexibility in schedule and generous paid vacation (available immediately after start date)

• Company-wide holiday office closures

• AD+D and Life Insurance

At Torc, we’re committed to building a diverse and inclusive workplace. We celebrate the uniqueness of our Torc’rs and do not discriminate based on race, religion, color, national origin, gender (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender identity, gender expression, age, veteran status, or disabilities.
Even if you don’t meet 100% of the qualifications listed for this opportunity, we encourage you to apply.

Our compensation reflects the cost of labor across several geographic markets. Pay is based on a number of factors and may vary depending on job-related knowledge, skills, and experience. Torc's total compensation package will also include our corporate bonus and stock option plan. Dependent on the position offered, sign-on payments, relocation, and other forms of compensation may be provided as part of a total compensation package, in addition to a full range of medical, financial, and/or other benefits.

Job ID: 102603

Hiring Range for Job Opening
US Pay Range
$226,400—$271,700 CAD

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply