FreshRemote.Work

Senior ML Engineer, Embodied AI

Remote

Serve Robotics aims to develop dependable and proficient sidewalk autonomy software. Our Perception & Prediction team is looking for a talent in identified research areas such as artificial intelligence, robotics, and embodied AI, particularly including areas such as representation learning for Embodied AI tasks, end-to-end learning, vision-language-action models, contextually aware AI.

Key Responsibilities

  • Pioneer the development of embodied intelligence for robotics, enabling unprecedented generalization across diverse environments and embodiments.

  • Invent/improve novel data-driven paradigms for embodied intelligence, leveraging a variety of modalities (images, video, point clouds (Lidar), text, audio etc).

  • Create business impactful solutions to improve human robot interaction through embodied intelligence.

  • Work with ML infrastructure engineers to assess and monitor model performance, ideate on active learning framework from long tail problems and pave innovative solutions to handle generalization capabilities from being data aware.

  • Collaborate with various teams to understand real-world problems and define tasks, incorporating insights into ML products.

Qualifications

  • MS with 5 years of industry experience or PhD with minimum 3 years of industry experience with focus in embodied intelligence, AR/VR , Robotics use-cases in a end-to-end learning framework.

  • Demonstrated leadership in scientific efforts, with contributions to top rated conferences and journals.

  • Experience with deep learning frameworks (such as pytorch, tensorflow) and computational data science libraries.

  • Experience with using large-scale datasets and sensor data, with use of various transformer architectures and diffusion models.

  • Adept with latest technology trends in AI / Robotics and Foundational models. Deep understanding of computer vision, machine learning and deep learning basic concepts.

  • Comfortable working with SQL queries and ETL logic for data ingress.

  • Strong programming skills for efficient and robust code.

  • Experience with multiple sensors such as Lidar, Mono/Stereo cameras, IMU, etc.

  • Strong communication skills.

What makes you standout

  • Hands-on experience with fine-tuning LLMs or vision-language models and their performance challenges and opportunities.

  • Experience with MLOps or ML Infrastructure, ability to streamline machine learning workflows.

  • Familiarity with edge-device perception stack deployment, experience with NVIDIA software libraries such as CUDA or TensorRT.

  • Open source project contributor.

  • Experience with GCP or AWS, Kubernetes and Docker.

Hiring locations

Currently, we are only hiring US and Canada residents, even for remote jobs.

More about us

Serve Robotics is the team that created the Postmates delivery robot and brought it to life in LA, completing tens of thousands of deliveries in LA’s busiest neighborhoods. Now we're an independent company growing rapidly in order to connect people with what they need via robots designed to serve people.

We are proud to be an equal employment opportunity and affirmative action employer. Qualified applicants are considered without regards to race, color, religion, national origin, age, sex, marital status, ancestry, physical or mental disability, veteran status, gender identity, or sexual orientation.

See Serve in action

Serve at TED 2020 main stage

Apply

Job Profile

Restrictions

Only hiring US and Canada residents

Tasks
  • Collaborate on real-world problems
  • Create solutions for human-robot interaction
  • Develop embodied intelligence
  • Invent data-driven paradigms
  • Monitor model performance
Skills

Artificial Intelligence AWS Cameras Communication Computational data science Computer Vision Contextually aware ai CUDA Deep Learning Diffusion Models Docker Embodied ai End-to-end learning ETL GCP Imu Kubernetes Lidar Machine Learning ML infrastructure MLOps Mono cameras PyTorch Representation learning Robotics SQL Stereo cameras TensorFlow TensorRT Transformer architectures Vision-language-action models

Experience

5 years

Education

MS Ph.D.