ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

Senior Performance Engineer- Pretraining

AlephAlpha

Heidelberg, Baden Würtemberg, Germany Remote permanent

Posted: February 26, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

This role involves designing and implementing efficient machine learning models that meet the specific needs of our German-speaking customers in the finance, manufacturing, and public administration sectors. The ideal candidate will have expertise in German language processing and knowledge of European regulatory requirements. The successful candidate will work closely with cross-functional teams to drive model development and deployment.

Job Description

Our Mission

Aleph Alpha is one of the few companies in Europe doing serious foundation model pre-training. Our customers - in finance, manufacturing, public administration - need models that understand German, meet European regulatory requirements, and work reliably in high-stakes settings. We're building that in Heidelberg.

We are hiring a Performance Engineer to grow our pre-training efficiency team. If you are excited about making models fast, this is the role for you!

Team Culture

At Aleph Alpha, we foster a culture built on ownership, autonomy, and empowerment. Teams and individual contributors are trusted to take responsibility for their work and drive meaningful impact. We maintain a flat organizational structure with efficient, supportive management that enables quick decision‑making, open communication, and a strong sense of shared purpose.

About the role:

You will engineer the systems required to train foundation models at scale. Your objective is to maximize hardware utilization and training throughput on our large-scale GPU clusters (thousands of NVIDIA Blackwell GPUs). You will work at the intersection of deep learning frameworks, distributed systems, and GPU microarchitecture, eliminating bottlenecks from the Python layer down to the GPU kernel.

This role is for Aleph Alpha Research.

Your responsibilities:

• End-to-End Optimization: Profile training loops using PyTorch Profiler, Nsight Systems and Nsight Compute to identify system- and kernel-level bottlenecks in order to maximize model throughput.

• Distributed Strategy and Topology: Configure and tune composite parallelism strategies (e.g. TP, DP, HSDP/FSDP, EP), optimizing load balance, minimizing critical-path bottlenecks, and managing communication-to-computation trade-offs for large-scale LLM training.

• Hardware-Aware Modeling: Partner with AI Researchers to define model architectures for hardware efficiency without compromising convergence.

You could be a great fit if you:

• Are proficient in Python and the PyTorch library.

• Have a strong engineering background in parallel and/or distributed systems with proven track record of excellence.

• Have hands-on experience with modern machine learning techniques (especially large language models and their life cycle).

• Deeply understand the CUDA programming model.

• Have experience in distributed programming with APIs like NCCL or MPI.

• Have experience analysing profiling traces with tools such as PyTorch Profiler and Nvidia Nsight.

• Please note this role requires regular on-site collaboration in Heidelberg as a member of the Training Efficiency Team.

Strong candidates may also have:

• Contributions to modern distributed training frameworks (e.g., TorchTitan, Megatron-LM, DeepSpeed).

• Familiarity with low-precision training formats (MXFP4, MXFP8) and their impact on numerical stability and throughput.

• A deep understanding of NCCL communication primitives, NVSHMEM or CUDA IPC and their performance.

• A proven track record of implementing and optimising modern transformer-based model training.

• A proven track record working on the NVIDIA Blackwell architecture.

Compensation and Benefits

• Competitive salary and equity package

• 30 days of paid vacation

• Access to a variety of fitness & wellness offerings via Wellhub

• Mental health support through nilo.health

• JobRad® Bike Lease

• Substantially subsidized company pension plan for your future security

• Subsidized Germany-wide transportation ticket

• Budget for additional technical equipment

• Flexible working hours for better work-life balance and hybrid working model

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply