Jobs / Zoo***

Senior/Staff Software Engineer - Machine Learning & System Optimization

Zoo*** · Seattle, WA, United States

Visa sponsorship details are locked. Unlock company name and apply link with .

Seattle, WA, United States226,000-307,000 USD/yearlyOnsite

Remuneration

226,000-307,000 USD/yearly

Location

Seattle, WA, United States

Visa sponsorship

Sponsors visa

Job summary

The Perception team is pioneering the development of a multi-modality foundation model to drive the next generation of autonomous system intelligence.

Benefits

Including paid time off (e.g.About ZooxWe’re looking for top talent that shares our passion and wants to be part of a fFollow us on LinkedInAccommodationsA Final Note:You do not need to match every listed expectation to apply for this position.

Qualifications

We are looking for experts with hands-on experience compressing, accelerating, and deploying complex models, including LLMs, VLMs, or foundation models, for power- and thermal-constrained vehicle SoCs.
Deep experience in system and performance optimization in CPU/GPU systems designed for low latency or high throughput.
Deep expertise in working with real-time systems & required constraints such as processing latency, memory utilization, and memory bandwidth pressure.
Deep expertise in model quantization (PTQ, QAT) and mixed-precision inference frameworks (INT8, FP8, FP4, BF16/FP16).
Proficiency in low-level programming for AI accelerators, specifically developing and optimizing custom ML OPs and TensorRT Plugins with efficient CUDA kernel implementations.
Production-level C++ (14/17/20) and Python programming
Prior experience in high-performance robotics applications such as AV/drones/robots.
Familiarity with SOTA autonomous driving perception algorithms (temporal 3D object detection, BEV, 3D Occupancy Networks) and multi-modal sensor processing (Vision, LiDAR, Radar).
Experience with end-to-end autonomous driving paradigms (VLM/VLA models, Foundation models) and edge deployment

Responsibilities

You will focus on bringing highly efficient, production-ready large-scale models to our on-vehicle stack.
In addition, you will optimize ML models, write custom CUDA kernels, and build highly concurrent inference code to ensure real-time, deterministic execution on edge devices.
IN THIS ROLE, YOU WILL:

Degrees

Associate

Industry

AutomotiveEnergyInsurance

Company size

Smb