ARCHIVED
This job listing has been archived and is no longer accepting applications.
MisuJob - AI Job Search Platform MisuJob

Vision Language Model Engineer

Echotwin

San Francisco, California, United States permanent

Posted: September 6, 2025

Interested in this position?

Create a free account to apply with AI-powered matching

Quick Summary

EchoTwin AI is a pioneering AI-driven infrastructure intelligence company that transforms municipal fleets into mobile urban sensors, providing real-time insights into infrastructure, compliance, and safety. The company is headquartered in San Francisco, California, and is a US-based company. The role requires expertise in AI, spatial reasoning, and software development.

Job Description

Company Overview

EchoTwin AI is pioneering AI-driven infrastructure intelligence, redefining how cities are managed. Powered by a proprietary visual intelligence engine with full spatial reasoning, EchoTwin transforms municipal fleets into mobile urban sensors—creating living digital twins that provide real-time insights into infrastructure, compliance, and safety. By enabling municipalities to proactively monitor, predict, and resolve issues, EchoTwin helps build resilient, self-healing, and sustainable urban ecosystems. More than “smart cities,” EchoTwin is advancing the era of cognizant cities—urban environments with the awareness to see, think, and act on challenges in real time.

What You’ll Do

As a Vision Language Model Engineer, you will design, develop, and optimize advanced vision-language models that integrate visual and textual data to enable intelligent systems. You will work closely with cross-functional teams to build models that power applications such as image captioning, visual question answering, and multimodal AI at the edge.

Key Responsibilities

• Design and implement state-of-the-art vision-language models using deep learning frameworks.

• Develop and fine-tune models that combine computer vision and natural language processing for tasks like image captioning, visual question answering, and text-to-image generation.

• Collaborate with data scientists and software engineers to integrate models into production systems.

• Optimize model performance for accuracy, latency, and scalability in real-world applications.

• Conduct experiments to evaluate model performance and iterate on architectures and training pipelines.

• Stay up-to-date with the latest research in vision-language models and incorporate advancements into projects.

• Contribute to data preprocessing, augmentation, and annotation pipelines for multimodal datasets.

• Document model development processes and present findings to technical and non-technical stakeholders.

Qualifications

• Bachelor’s, Master’s or Ph.D. in Computer Science, Machine Learning, Artificial Intelligence, or a related field (or equivalent experience).

• 3+ years of experience in machine learning, with a focus on vision-language models or multimodal AI.

• Hands-on experience with deep learning frameworks such as PyTorch or TensorFlow.

• Proven track record of building and deploying computer vision and/or NLP models.

• Proficiency in Python and relevant ML libraries (e.g., Hugging Face, OpenCV, Transformers).

• Experience with large-scale model training and optimization (e.g., distributed training, quantization).

• Strong understanding of neural network architectures (e.g., CNNs, Transformers, CLIP, or similar).

• Experience with multimodal datasets and preprocessing techniques for images and text.

• Familiarity with cloud platforms (e.g., AWS, GCP, Azure) and model deployment workflows.

• Strong problem-solving skills and ability to work in a fast-paced, collaborative environment.

• Excellent communication skills to explain complex technical concepts to diverse audiences.

Benefits and Perks

There are endless learning and development opportunities from a highly diverse and talented peer group, including experts in various fields, including Computer Vision, GenAI, Digital Twin, Government Contracting, Systems and Device Engineering, Operations, Communications, and more!

• Options for medical, dental, and vision coverage for employees and dependents (for US employees)

• Flexible Spending Account (FSA) and Dependent Care Flexible Spending Account (DCFSA)

• 401(k) with 3% company matching

• Unlimited PTO

• Profit sharing

Please do not forward resumes to our jobs alias, EchoTwin AI employees, or any other company location. EchoTwin AI is not responsible for any fees related to unsolicited resumes.

Life at EchoTwin AI

If you want to empower the world’s most important cities—and the institutions that run them—you belong here. At EchoTwin AI, we value excellence regardless of background and are committed to building a team that reflects the communities we serve.

EchoTwin AI is an Equal Opportunity Employer. We consider all qualified applicants without regard to race, color, religion, sex (including pregnancy), sexual orientation, gender identity or expression, national origin, age, disability, veteran status, genetic information, or any other status protected by applicable law.

Why Apply Through MisuJob?

AI-Powered Job Matching: MisuJob uses advanced artificial intelligence to analyze your skills, experience, and career goals. Our matching algorithm compares your profile against thousands of job requirements to find positions where you have the highest chance of success. This saves you hours of manual job searching and ensures you only see relevant opportunities.

One-Click Applications: Once you create your profile, applying to jobs is effortless. Your resume and cover letter are automatically tailored to highlight the most relevant experience for each position. You can apply to multiple jobs in minutes, not hours.

Career Intelligence: Beyond job matching, MisuJob provides valuable career insights. See how your skills compare to market demands, identify skill gaps to address, and understand salary benchmarks for your experience level. Make data-driven decisions about your career path.

Frequently Asked Questions

How do I apply for this position?

Click the "Register to Apply" button above to create a free MisuJob account. Once registered, you can apply with one click and track your application status in your dashboard.

Is MisuJob free for job seekers?

Yes, MisuJob is completely free for job seekers. Create your profile, get matched with jobs, and apply without any cost. We help you find your dream job without any hidden fees.

How does AI matching work?

Our AI analyzes your resume, skills, and experience to understand your professional profile. It then compares this against job requirements using natural language processing to calculate a match percentage. Higher matches mean better fit for the role.

Can I apply to jobs in other countries?

Absolutely. MisuJob features jobs from companies worldwide, including remote positions. Filter by location or look for remote opportunities to find jobs that match your preferences.

Ready to Apply?

Join thousands of job seekers using MisuJob's AI to find and apply to their dream jobs automatically.

Register to Apply