At Wayve we're committed to creating a diverse, fair and respectful culture that is inclusive of everyone based on their unique skills and perspectives, and regardless of sex, race, religion or belief...
Jobs
Browse 250+ jobs updated daily
Latest Job Openings
About STACK STACK builds software that helps teams plan, build, and operate with clarity and speed. We’re investing in an in-house AI team to train and run models that meaningfully improve our produc...
NVIDIA is transforming healthcare with AI to power the next generation of innovation in Biology and Life Sciences. BioNeMo platform is rapidly growing and it is becoming the defacto platform for AI-dr...
Senior Software Engineer I, Inference
Coreweave
CoreWeave is The Essential Cloud for AI™. Built for pioneers by pioneers, CoreWeave delivers a platform of technology, tools, and teams that enables innovators to build and scale AI with confidence. T...
Inference Optimization Engineer
Bentoml
About BentoML BentoML is a leading inference platform provider that helps AI teams run large language models and other generative AI workloads at scale. With support from investors such as DCM, enter...
We invite you to join NinjaTech AI as an Applied Scientist specialized in AI inference and distributed systems to help optimize and scale our AI models for production environments. You will work at t...
Fullstack Engineer - Frontend Focus
Inference
Inference.net is hiring a Senior Full-Stack (Frontend-Focused) Engineer Help us build beautiful, performant web experiences that give users super-powers over our globally distributed LLM inference pl...
Machine Learning Researcher
Inference
Help us push the boundaries of what's possible in LLM post-training. If you love training models, exploring new architectures, running experiments, and turning research insights into products that shi...
Help us make inference blazingly fast. If you love squeezing every last drop of performance out of GPUs, diving deep into CUDA kernels, and turning optimization techniques into production systems, we'...
Applied Machine Learning Engineer
Inference
Help us build the systems that train specialized AI models for the fastest-growing companies in the world. If you love taking cutting-edge ML techniques and turning them into products that ship, we'd ...
Filmmaker / Storyteller
Inference
Filmmaker / Storyteller Inference.net is seeking a Filmmaker / Storyteller to join our team and help define the narrative of building the world's largest distributed GPU cluster. This role combines c...
Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co...
Airbnb was born in 2007 when two hosts welcomed three guests to their San Francisco home, and has since grown to over 5 million hosts who have welcomed over 2 billion guest arrivals in almost every co...
About us Most AI is frozen in place - it doesn't adapt to the world. We think that's backwards. Our mandate is to build efficient intelligence that evolves in real-time. Our vision is AI systems that...
Location: San Francisco, CA (Onsite | Remote) About Virtue AI Virtue AI sets the standard for advanced AI security platforms. Built on decades of foundational and award-winning research in AI securi...
About Virtue AI Virtue AI sets the standard for advanced AI security platforms. Built on decades of foundational and award-winning research in AI security, its AI-native architecture unifies automate...
Machine Learning Engineer — Inference Optimization
Featherlessai
About the Role We’re looking for a Machine Learning Engineer to own and push the limits of model inference performance at scale. You’ll work at the intersection of research and production—turning cut...
AI Researcher — Inference Optimization
Featherlessai
Role Overview We are seeking an AI Researcher with deep experience in inference optimization to design, evaluate, and deploy high-performance inference systems for large-scale machine learning models...
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers an...
Teamwork makes the stream work. Roku is changing how the world watches TV Roku is the #1 TV streaming platform in the U.S., Canada, and Mexico, and we've set our sights on powering every television ...
About Liquid AI Spun out of MIT CSAIL, we build general-purpose AI systems that run efficiently across deployment targets, from data center accelerators to on-device hardware, ensuring low latency, m...
About H: H exists to push the boundaries of superintelligence with agentic AI. By automating complex, multi-step tasks typically performed by humans, AI agents will help unlock full human potential. ...
At d-Matrix, we are focused on unleashing the potential of generative AI to power the transformation of technology. We are at the forefront of software and hardware innovation, pushing the boundaries ...
LLM Inference Engineer
Periodic Labs
About Periodic Labs We are an AI + physical sciences lab building state of the art models to make novel scientific discoveries. We are well funded and growing rapidly. Team members are owners who ide...
OutcomesAI is a healthcare technology company building an AI-enabled nursing platform designed to augment clinical teams, automate routine workflows, and safely scale nursing capacity. Our solution co...
Inference Engineer
Cartesia
About Cartesia Our mission is to build the next generation of AI: ubiquitous, interactive intelligence that runs wherever you are. Today, not even the best models can continuously process and reason ...
Software Engineer (Inference Engine)
Furiosa Ai
About the job Software Engineer (Inference Engine)는 FuriosaAI NPU에서 구동되는 대규모 언어모델 및 멀티모달 모델을 위한 고성능 추론 엔진을 개발하고 최적화합니다. 최신 추론 최적화 기술을 선도적으로 연구조사 하여 엔진에 적용하며, 컴파일러팀, 하드웨어팀과 긴밀한 협업을 통해 엔진의 성능을 고도화하는 역할...
Applied AI Inference Engineer
Baseten
ABOUT BASETEN Baseten powers mission-critical inference for the world's most dynamic AI companies, like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma and Writer. By uniting applied AI research, ...
Sr. Software Engineer, ML Edge Inference
Serverobotics
At Serve Robotics, we’re reimagining how things move in cities. Our personable sidewalk robot is our vision for the future. It’s designed to take deliveries away from congested streets, make deliverie...
Architecture Intern - Inference Location: San Jose, CA Team: Architecture About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x h...
Head of Inference Kernels
Etched
About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With...
Inference Software Engineer
Etched
About Etched Etched is building the world’s first AI inference system purpose-built for transformers - delivering over 10x higher performance and dramatically lower cost and latency than a B200. With...
At Toyota Research Institute (TRI), we’re on a mission to improve the quality of human life. We’re developing new tools and capabilities to amplify the human experience. To lead this transformative sh...
Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. Our founders (Alex Smola,Mu Li), and a team of Deep Learning, Optimization, NLP, and Statistics scientists...
Boson AI is an early-stage startup building large audio models for everyone to enjoy and use. Our founders (Alex Smola,Mu Li), and a team of Deep Learning, Optimization, NLP, and Statistics scientists...
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building ...
Waymo is an autonomous driving technology company with the mission to be the world's most trusted driver. Since its start as the Google Self-Driving Car Project in 2009, Waymo has focused on building ...
Crusoe's mission is to accelerate the abundance of energy and intelligence. We’re crafting the engine that powers a world where people can create ambitiously with AI — without sacrificing scale, speed...
About us: Paytm is India's payment Super App offering consumers and merchants most comprehensive payment services. Pioneer of mobile QR payments revolution in India, today, Paytm is India’s largest pa...
Senior AI Inference Engineer (llama.cpp specialist) - 100% Remote
Tether Operations Limited
Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...
Lead AI Inference Engineer
Tether Operations Limited
Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...
NVIDIA is at the forefront of the generative AI revolution! The Algorithmic Model Optimization Team specifically focuses on optimizing generative AI models such as large language models (LLM) and diff...
Software Engineer, ML (Training and Inference)
Isomorphiclabs
Isomorphic Labs is applying frontier AI to help unlock deeper scientific insights, faster breakthroughs, and life-changing medicines with an ambition to solve all disease. The future is coming. A fut...
Our work at NVIDIA is dedicated towards a computing model focused on visual and AI computing. For two decades, NVIDIA has pioneered visual computing, the art and science of computer graphics, with our...
Inference Engineering Manager
Perplexity
ABOUT THE ROLE We are looking for an Inference Engineering Manager to lead our AI Inference team. This is a unique opportunity to build and scale the infrastructure that powers Perplexity's products ...
Lead AI Inference Engineer
Confidential
Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...
Lead AI Inference Engineer
Confidential
Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...
NVIDIA is at the forefront of the generative AI revolution. We are looking for a Software Engineer, Performance Analysis, and Optimization for LLM Inference, to join our performance engineering team. ...
Lead AI Inference Engineer
Confidential
Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...
Lead AI Inference Engineer
Confidential
Join Tether and Shape the Future of Digital Finance At Tether, we’re not just building products, we’re pioneering a global financial revolution. Our cutting-edge solutions empower businesses—from exc...