Zoox Logo

Zoox

Senior/Staff Software Engineer - Machine Learning & System Optimization

Posted Yesterday
Be an Early Applicant
Hybrid
Boston, MA, USA
226K-307K Annually
Senior level
Hybrid
Boston, MA, USA
226K-307K Annually
Senior level
Design and optimize on-vehicle perception inference: allocate CPU/GPU resources, compress and quantize large multimodal models, build TensorRT model conversion pipelines, and deliver production-quality low-latency C++/CUDA inference for power- and thermal-constrained vehicle SoCs.
The summary above was generated by AI

The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.

As a Machine Learning and System Optimization Engineer, you will orchestrate and allocate overall system capacity to various core perception models running on-bot, as well as drive large initiatives that allow for more efficient inference by sharing various parts of the perception stack with one another.

You will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack. We are looking for experts with hands-on experience compressing, accelerating, and deploying complex models, including LLMs, VLMs, or foundation models, for power- and thermal-constrained vehicle SoCs.

In addition, you will optimize ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.

In this role, you will:

  • Allocate and distribute system resources (CPU/GPU/interconnect) to various models and inference engines running on the robot.

  • Spearhead cross-cutting initiatives that allow for better compute utilization through sharing/fusing models and better scheduling strategies.

  • Optimize large-scale models (Multi-Modal Sensor Fusion models, LLMs, VLMs) using advanced quantization (PTQ, QAT), pruning, mixed-precision inference frameworks, and parameter-efficient fine-tuning (LoRA, QLoRA).

  • Architect and implement model conversion and compilation pipelines using TensorRT for edge deployment.

  • Write production-level, low-latency, and memory-safe C++ and CUDA code for real-time inference on vehicle systems.

Qualifications:

  • Deep experience in system and performance optimization in CPU/GPU systems designed for low latency or high throughput.

  • Deep expertise in working with real-time systems & required constraints such as processing latency, memory utilization, and memory bandwidth pressure.

  • Deep expertise in model quantization (PTQ, QAT) and mixed-precision inference frameworks (INT8, FP8, FP4, BF16/FP16).

  • Proficiency in low-level programming for AI accelerators, specifically developing and optimizing custom ML OPs and TensorRT Plugins with efficient CUDA kernel implementations.

  • Production-level C++ (14/17/20) and Python programming skills, with experience developing concurrent, memory-safe, real-time inference code for edge devices.

Bonus Qualifications:

  • Prior experience in high-performance robotics applications such as AV/drones/robots.

  • Familiarity with SOTA autonomous driving perception algorithms (temporal 3D object detection, BEV, 3D Occupancy Networks) and multi-modal sensor processing (Vision, LiDAR, Radar).

  • Experience with end-to-end autonomous driving paradigms (VLM/VLA models, Foundation models) and edge deployment technologies (e.g., TensorRT-LLM).

About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.

Follow us on LinkedIn

Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.

A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.

Zoox Boston, Massachusetts, USA Office

100 Summer Street, Boston, MA, United States, 02110

Similar Jobs

49 Minutes Ago
Easy Apply
Remote or Hybrid
USA
Easy Apply
97K-138K Annually
Junior
97K-138K Annually
Junior
Cloud • Information Technology • Security • Software • Cybersecurity
Sell Zscaler cloud-native Zero Trust solutions to commercial/private equity accounts. Build C-suite relationships, create long-term account strategies, collaborate with internal teams, act as trusted advisor, and drive net-new logo acquisition and quota attainment through direct and channel selling.
Top Skills: Cloud-NativeZero Trust ExchangeZscaler
58 Minutes Ago
Easy Apply
Remote or Hybrid
United States
Easy Apply
Senior level
Senior level
Legal Tech • Software • Generative AI
Design and deliver custom AI-driven legal workflows, implement and scale post-sales solutions, define KPIs and success metrics, inform product strategy, partner with Customer Success to demonstrate ROI, and maintain objective, ethics-compliant processes and playbooks for plaintiff law firms.
Top Skills: AnthropicOpenai
An Hour Ago
Remote or Hybrid
235K-414K Annually
Expert/Leader
235K-414K Annually
Expert/Leader
Artificial Intelligence • Cloud • Machine Learning • Mobile • Software • Virtual Reality • App development
Design and build next-generation ads formats, backend infrastructure, and scalable distributed systems. Lead technical direction, collaborate across teams to define product requirements, experiment, analyze and optimize ad performance, and ensure availability, scalability, operational excellence, and cost management. Mentor engineers and apply AI tools and high-velocity workflows to deliver production-ready systems.

What you need to know about the Boston Tech Scene

Boston is a powerhouse for technology innovation thanks to world-class research universities like MIT and Harvard and a robust pipeline of venture capital investment. Host to the first telephone call and one of the first general-purpose computers ever put into use, Boston is now a hub for biotechnology, robotics and artificial intelligence — though it’s also home to several B2B software giants. So it’s no surprise that the city consistently ranks among the greatest startup ecosystems in the world.

Key Facts About Boston Tech

  • Number of Tech Workers: 269,000; 9.4% of overall workforce (2024 CompTIA survey)
  • Major Tech Employers: Thermo Fisher Scientific, Toast, Klaviyo, HubSpot, DraftKings
  • Key Industries: Artificial intelligence, biotechnology, robotics, software, aerospace
  • Funding Landscape: $15.7 billion in venture capital funding in 2024 (Pitchbook)
  • Notable Investors: Summit Partners, Volition Capital, Bain Capital Ventures, MassVentures, Highland Capital Partners
  • Research Centers and Universities: MIT, Harvard University, Boston College, Tufts University, Boston University, Northeastern University, Smithsonian Astrophysical Observatory, National Bureau of Economic Research, Broad Institute, Lowell Center for Space Science & Technology, National Emerging Infectious Diseases Laboratories

Sign up now Access later

Create Free Account

Please log in or sign up to report this job.

Create Free Account