ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

Machine Learning Engineer — Training Optimization

Featherlessai

Remote (world) permanent

Posted: January 22, 2026

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

Optimize large-scale model training pipelines to scale and improve efficiency.

Job Description

About the Role

We’re looking for an ML Engineer focused on training optimization to help us scale and improve large-scale model training. You’ll work at the intersection of research and production, optimizing training pipelines for speed, stability, and cost—while collaborating closely with researchers pushing model architecture and capability forward.

This is a high-impact role with real ownership: your work directly affects how fast we can iterate, how large we can scale, and how efficiently we deploy new models.

What You’ll Do

• Optimize large-scale model training pipelines (throughput, convergence, stability, and cost)

• Improve distributed training strategies (data, model, and pipeline parallelism)

• Tune optimizers, schedulers, batch sizing, and precision (bf16 / fp16 / fp8)

• Reduce training time and compute cost via profiling, bottleneck analysis, and systems-level improvements

• Collaborate with researchers on architecture-aware training strategies

• Build and maintain robust training infrastructure (checkpointing, fault tolerance, reproducibility)

• Evaluate and integrate new training techniques (e.g. gradient checkpointing, ZeRO, FSDP, custom kernels)

• Own training performance metrics and continuously push them forward

What We’re Looking For

• Strong experience training large neural networks (LLMs or similarly large models)

• Hands-on experience with training optimization (not just model usage)

• Solid understanding of:

• Backpropagation, optimization algorithms, and training dynamics

• Distributed systems for ML training

• Experience with PyTorch (required)

• Comfort working close to hardware (GPUs, memory, networking constraints)

• Ability to move fluidly between research ideas and production-ready code

Nice to Have

• Experience with large-scale distributed training (multi-node, multi-GPU)

• Familiarity with DeepSpeed, FSDP, Megatron, or custom training stacks

• Experience optimizing training on AMD or NVIDIA GPUs

• Contributions to open-source ML infrastructure or research codebases

• Exposure to non-Transformer architectures (RNNs, hybrid models, etc.)

Why Join Us

• Real ownership at Series-A stage — your work shapes the company’s trajectory

• Work on cutting-edge models and training systems at scale

• Small, highly technical team with fast feedback loops

• Strong emphasis on engineering quality and research rigor

• Competitive compensation + meaningful equity

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply