ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

VLM Research Engineer (m/f/d)

Deltia

Berlin, Berlin, Germany permanent

Posted: December 18, 2025

Interested in this position?

Create a free account to apply with AI-powered matching

Job Description

We’re looking for a Research Engineer to push the limits of vision-language models for real-world video understanding. You’ll work on applied, state-of-the-art multimodal models and turn them into production pipelines used by customers.

Your role

• Design and adapt vision-language and video models for scene understanding, temporal reasoning and activity / action recognition

• Build and maintain large-scale training and evaluation pipelines on GPU clusters

• Curate and augment video-text and action datasets, including synthetic labels and retrieval-based augmentation

• Develop robust benchmarks for video QA, instruction following and temporal understanding, and use them to drive iterative model improvements

• Cut and refactor model architectures for efficiency and deployability (compression, pruning, distillation)

• Deliver production-ready inference pipelines to product and customer teams, working closely with CV, platform and robotics engineers

You bring

• Completed PhD (or equivalent research track record) in computer vision, machine learning, robotics or a related field

• Strong background in video-centric deep learning: scene understanding, temporal / activity / action recognition, or video generation

• Experience training and adapting large vision or VLM models (e.g. InternVL, Qwen-VL, DeepSeek-VL, similar stacks)

• Proven work with multi-GPU training (PyTorch, distributed, mixed precision) and large-scale datasets

• Solid engineering habits: clean Python, reproducible experiments, reliable data and training pipelines

• Track record of moving research into usable systems (demos, internal tools, or productised features) in fast-moving teams

Nice to have

• Publications at top-tier venues (CVPR, ICCV, ECCV, NeurIPS, ICLR, etc.) on video, multimodal learning or scene understanding

• Experience with 3D/4D scene representations, action generation or embodied / sense-plan-act style projects

• Inference optimisation: quantisation, TensorRT, model distillation, or deployment on constrained hardware

• Prior experience in a startup or applied research lab environment

What we offer

A competitive salary & stock options*

Be on the forefront in defining what artificial intelligence means in manufacturing

Gain hands-on experience in working in an AI-first software company

Supportive and inclusive culture that values diversity and promotes the advancement of underrepresented groups within the company

Collaborate with a diverse (currently more than 10 nationalities) and talented team, working on cutting-edge projects with real-world impact

Network with professionals and leaders in the field, opening doors to potential future career opportunities

We have a very flat hierarchy, open 360° feedback, and flexible working hours

Ethics⚖: We are committed to developing ethical AI software

Don't meet all the requirements?

Deltia is committed to creating a workplace that is diverse, fair, and inclusive. We encourage candidates from all backgrounds, even if they do not meet every qualification, to submit their application. We firmly believe that having a team with diverse perspectives only strengthens our company and drives innovation. Our commitment also extends to providing an accessible environment for everyone, including those with disabilities. Please let us know if you require any accommodations during the application process or while working with us, and we will do our best to support you.

*Only full-time, permanent roles are eligible for stock options. Part-time roles, contract roles, work-student, internships and freelance roles are not eligible for stock options;

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply