Senior Software Engineer, Vision Language Models
Remote
Mission Summary
At Motional, data play a critical role in fueling our ML-centered autonomous driving vehicle. Our robo-taxi fleet collects petabytes of data on the road every day – the Data Mining team is mining & filtering the massive influx of fleet data by developing billion-scale data workflows and state-of-the-art mining algorithms. Through our mining and learning frameworks we continuously improve the on-road performance of ML products for perception, prediction & planning with every mile driven.
We mine for model errors, anomalies, rare objects & long-tail driving scenarios across millions of driving hours – these are used for laser-focused ML model training and continuous edge case validation. We are looking for an engineer to spearhead new mining strategies & workflows and help deliver high-quality data that improve our core ML products.
What you'll be doing:
- Develop data products utilizing foundation models such as multi-modal encoding models (CLIP-style models), vision language models (VLMs) and large language models (LLMs) and adapt them to the autonomous driving domain via pre-training, fine-tuning and prompt optimization.
- Own large-scale mining workflows that surface rare objects, model errors & long-tail events.
- Build high-quality datasets to improve ML products through training & edge case validation.
- Contribute to data processing pipelines that fuel our in-house billion-scale image search engine.
- Provide statistical depth on model performance & generalization through rigorous error analysis across complex driving scenarios.
What we’re looking for:
- BS in computer science, similar discipline or equivalent experience.
- 3+ years of experience architecting and shipping high-performance & large-scale distributed systems.
- Experience with deploying vision language models (VLMs) or large-scale vision encoders (e.g. CLIP) in production settings for image/video understanding, object detection or searching.
- Experience with core cloud services (e.g. AWS’s S3, Athena, RDS or similar) and modern vector databases (OpenSearch, Weaviate, Pinecone etc.).
- Solid software engineering principles – such as software design patterns, configuration management, source control, build processes, code reviews, testing methodologies, app containerization, continuous integration etc.
- Fluency in Python and experience on production-quality software development.
Bonus points (not required):
- MS/PhD in computer science, machine learning, statistics or computer vision.
- Experience with at least one of the following ML techniques/models: Few-shot Learning, Metric Learning, Information Retrieval, Recommender Systems, Contrastive Learning, Semi-supervised Learning, Object Detection / Segmentation / Prediction.
- Experience with PyTorch or other deep …
This job isn't fresh anymore!
Search Fresh JobsJob Profile
Benefits program Bonus Company equity Dental Equity Health saving accounts Life Insurance Medical Pet Insurance Vision
Tasks- Build datasets
- Code reviews
- Develop data products
A/B Testing Algorithms Autonomous Vehicles AWS Cloud Services Computer Vision Data Mining Data processing Data processing pipelines Deep Learning Design Patterns Distributed Systems Large Language Models Machine Learning Model training Object detection Opensearch Pinecone Python PyTorch S3 Software design Software Development Software Engineering Statistical analysis TensorFlow Testing Vision Language Models Weaviate
Experience3 years
EducationB.S. in Computer Science Computer Science Computer Vision Engineering Machine Learning MS MS in Computer Science Ph.D. in Computer Science Statistics
TimezonesAmerica/Anchorage America/Chicago America/Denver America/Los_Angeles America/New_York Pacific/Honolulu UTC-10 UTC-5 UTC-6 UTC-7 UTC-8 UTC-9