Zoox Logo

Zoox

Machine Learning Engineer - Perception

Sorry, this job was removed at 08:08 p.m. (EST) on Wednesday, Jul 23, 2025
Hybrid
Boston, MA, USA
172K-235K Annually
Hybrid
Boston, MA, USA
172K-235K Annually

Similar Jobs

Yesterday
Hybrid
Boston, MA, USA
229K-317K Annually
Senior level
229K-317K Annually
Senior level
Artificial Intelligence • Machine Learning • Robotics • Software • Transportation • Design • Manufacturing
The role involves developing and optimizing large language models for robotaxis, enhancing their understanding of urban environments, and integrating models into decision-making processes.
Top Skills: NumpyPythonPyTorch
2 Days Ago
Hybrid
Charlestown, MA, USA
140K-175K Annually
Senior level
140K-175K Annually
Senior level
Robotics • Transportation
The Senior Machine Learning Engineer will design multi-modal vision systems, optimize and deploy models, mentor junior engineers, and improve data strategies for robotic perception tasks.
Top Skills: AWSC++CudaDockerGCPOnnxPythonPyTorchTensorrt
2 Days Ago
Remote or Hybrid
U.S.
199K-267K Annually
Senior level
199K-267K Annually
Senior level
Artificial Intelligence • Automotive • Machine Learning • Transportation
Lead the development of advanced perception algorithms for self-driving vehicles. Collaborate and mentor engineers, optimize algorithms, and implement sensor fusion for robust perception systems.
Top Skills: C++CudaDeep LearningMachine LearningPythonPyTorchTensorFlow
The Perception team at Zoox is fundamental to our autonomous vehicle technology, creating the understanding of the world for our self-driving robots. We enable safe and efficient navigation in complex environments through sophisticated detection, classification, and tracking systems.


As a member of the Detection, you'll be responsible for helping build sensor fusion models for detecting agents around the robot. You will work on building the vision backbone of the detector by designing sensor fusion models for Zoox's sensor data. You will work on incorporating SOTA language alignment strategies into the per-sensor modality backbones and help Zoox advance faster into new geofences by making the models more generalizable and adaptable to new circumstances.


In this role, you will:

  • Train and upgrade current image backbone by performing various studies into incorporating SOTA approaches to be multi-frame, even higher resolution and using large foundational, language-aligned feature encoder
  • Deploy temporal image backbone into on-vehicle inference architecture and optimize runtime compute resource;
  • Incorporate temporal information into the the existing Zoox’s multi-sensor (Vision, Lidar and Radar) early fusion system to predict object velocity and scene flow
  • Develop effective metrics and evaluating algorithm performance with these metrics to demonstrate improvement.
  • Collaborate with teams such as ML Optimization, Data Labeling, V&V/Metrics and Planner/Prediction to ensure project is successful for Zoox releases

Qualifications

  • Master's degree or PhD in computer science or related field
  • Experience with Vision Language Model or multi-modal 3D foundation model.
  • Experience with Python and/or C++
  • Experience with model deployment with TensorRT
  • Proficient in PyTorch model design, implementation, training and evaluation.

Bonus Qualifications

  • GPU/CUDA programming experience
  • Experience with customized TRT plugin development 
  • Experience working with LiDAR, Camera and Radar data

About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.

A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Zoox Boston, Massachusetts, USA Office

100 Summer Street, Boston, MA, United States, 02110

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account