The Machine Learning Engineer Intern at Zoox will work on perception systems, developing algorithms for autonomous driving, collaborating with teams, and leveraging advanced machine learning models and datasets.
Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. Internships at Zoox are reserved for those who demonstrate outstanding academic performance, activities outside their course work, aptitude, curiosity, and a passion for Zoox's mission.
Perception at Zoox is the "Retina of Zoox" — the system responsible for understanding the world around the autonomous vehicle.
As an MLE intern working on Perception, you may be assigned to one of the following teams:
On the Offline Driving Intelligence team, you will develop advanced multimodal large language models that enhance scenario understanding and driving. You'll develop and fine-tune models with driving data, ensuring models can efficiently identify hazards, interpret driving restrictions, drive and answer questions about the scenario. Working alongside world-class engineers and researchers, you'll leverage premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions, directly impacting productivity, safety and the capability of Zoox's autonomous system.
On the Perception Attributes team, you will collect and generate datasets for specialized vehicle classification and semantic enrichment, design and frame machine learning problems for real-world autonomous driving scenarios and train and evaluate state-of-the-art machine learning models with a focus on computer vision. You will also collaborate with engineers to deploy models for real-time inference on our vehicles, and contribute to improving our vehicle's ability to recognize and respond to emergency vehicles, school buses, construction vehicles, and other specialized road actors.
On the Perception Scene Understanding team, you will develop advanced ML models that perceive our vehicle's surroundings to identify hazards and driving restrictions. You will utilize vision-language models for detecting rare events and ensuring safe driving in these situations. You'll work with state-of-the-art machine learning models that operate in real-time on our robotaxi platform with minimal latency. Collaborating with world-class engineers and researchers across sensors, planning, and other teams, you'll have access to premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions.
On the Occupancy and Rare Events team, you will develop multimodal foundation models that serve as the common backbone for on-vehicle perception, enhancing the system's ability to detect long-tail events and generalize to new geofences. In this role, you will develop effective tokenization techniques for Vision, Lidar, and Radar modalities, leverage LLM techniques to align token embeddings across modalities into a common feature space supporting various 3D tasks (detection, segmentation, tracking, feature matching, dense depth), You'll collaborate with top-notch engineers across PCP, MLInfra, and Offboard Driving Intelligence teams, utilizing Zoox's large-scale dataset to train and evaluate models that directly impact the autonomous system's real-world performance.
On the perception optimization team, you will build optimized inference pipelines for on-bot algorithms. A major focus of optimization is ML models, with techniques such as quantization, pruning, and advanced transformer optimizations such as token pruning, merging and layer pruning being used to deploy large models into the bot to operate at real time. In this role, you will experiment with optimizing SOTA large ML models to make them fit into on-bot compute, including both post-training optimization (e.g. quantization) as well as architectural approaches (e.g. token merging).
Requirements:
- Currently working towards a B.S., M.S., Ph.D., or advanced degree in a relevant engineering program
- Must be returning to school to continue your education upon completing this internship
- Good academic standing
- Able to commit to a 12-week internship beginning in May or June of 2026.
- At least one previous industry internship, co-op, or project completed in a relevant area
- Ability to relocate to the Bay Area, California or Boston for the duration of the internship
- Interns at Zoox may not use any proprietary information they are working on as part of their thesis, any published work with their university, or to be distributed to anyone outside of Zoox
Qualifications (It’s helpful if you meet a majority of the following qualifications, but it isn’t a requirement):
- Advanced understanding of Python or C++ (C++ preferred)
- Experience with production ML pipelines: dataset creation, labeling, training, metrics
- Experience training/finetuning MLLMs or at least MLLms (SFT/RL)
- Experience with Vision-Language Models
- Experience with model deployment with TensorRT
- Experience with Neural Network design and implementation
- Experience working with LiDAR, Camera and Radar data
- Experience with building and processing large scale dataset
- GPU/CUDA programming experience
Bonus Qualifications:
- Experience with multimodal foundation model optimization techniques
- Experience in algorithm development for Autonomous Driving software
Compensation:
The monthly salary range for this position is $5,500 to $9,500. Compensation will vary based on geographic location and level of education. Additional benefits may include medical insurance, and a housing stipend (relocation assistance will be offered based on eligibility).
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Follow us on LinkedIn
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
Top Skills
C++
Cuda
Gpu
Python
Tensorrt
Zoox Boston, Massachusetts, USA Office
100 Summer Street, Boston, MA, United States, 02110
Similar Jobs
eCommerce • Healthtech • Pet • Retail • Pharmaceutical
Manage end-to-end project delivery for construction projects, coordinate with internal and external partners, and resolve complex issues while ensuring compliance and quality standards.
Top Skills:
AutocadBluebeamMs ProjectPmwebProjectmatesUnifier
eCommerce • Healthtech • Pet • Retail • Pharmaceutical
Lead merchandising strategy for healthcare, content, and services; manage cross-functional teams, drive customer-centric initiatives, and enhance site experiences.
Fitness • Hardware • Healthtech • Sports • Wearables
Lead WHOOP's AI strategy, developing intelligent systems to enhance health performance. Oversee cross-functional teams, ensure compliance, and foster innovation in AI applications.
Top Skills:
AIComputer VisionGenerative AiLarge Language ModelsMachine LearningMlopsNlp
What you need to know about the Boston Tech Scene
Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.
Key Facts About Boston Tech
- Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
- Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
- Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
- Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
- Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
- Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories


