Waymo’s EMMA: Teaching Cars to Think – Jyh Jing Hwang, Waymo
Waymo's EMMA: teaching cars to predict human behavior
In the rapidly evolving world of autonomous vehicles, understanding human behavior remains the ultimate challenge. Waymo, a leader in self-driving technology, has made significant strides with their EMMA (Embodied Multi-Modal Agent) model, which aims to bridge the gap between human unpredictability and machine learning. This advancement represents a crucial step toward fully autonomous driving systems that can operate safely alongside human drivers, pedestrians, and cyclists.
Key Points
-
EMMA uses a multi-modal approach that integrates various input types (visual, spatial, temporal) to create a more comprehensive understanding of the driving environment, allowing for better prediction of human behavior.
-
Waymo's model employs a neural network architecture that processes real-world driving scenarios to learn patterns and make predictions about what humans might do next, based on both physical constraints and social cues.
-
Rather than relying solely on rules-based programming, EMMA utilizes a foundation model approach that can generalize across different scenarios, making it more adaptable to novel situations that weren't explicitly covered in training data.
The Breakthrough in Human Behavior Prediction
The most insightful aspect of Waymo's EMMA model is its ability to capture the nuanced, social aspects of driving behavior. Traditional autonomous vehicle systems excel at following traffic rules and detecting obstacles, but they often struggle with predicting human intentions—especially in ambiguous situations like four-way stops or when pedestrians give subtle indications of their next moves.
This matters enormously because autonomous vehicles need to navigate a world designed for human drivers. The unpredictability of human behavior has been one of the biggest roadblocks to widespread adoption of self-driving technology. By creating systems that can interpret and predict human actions more accurately, Waymo is addressing what may be the final frontier in autonomous driving: social intelligence.
The significance extends beyond just technical achievement. As the autonomous vehicle industry faces increasing scrutiny over safety concerns, technologies like EMMA could help restore public confidence. The ability to understand and anticipate human behavior means fewer unexpected interactions and potentially fewer accidents, addressing one of the key concerns about self-driving technology.
Beyond the Video: Real-World Applications and Limitations
What the presentation doesn't fully explore is how this technology performs across different cultural contexts. Driving behaviors an
Recent Videos
Hermes Agent Master Class
https://www.youtube.com/watch?v=R3YOGfTBcQg Welcome to the Hermes Agent Master Class — an 11-episode series taking you from zero to fully leveraging every feature of Nous Research's open-source agent. In this first episode, we install Hermes from scratch on a brand new machine with no prior skills or memory, walk through full configuration with OpenRouter, tour the most important CLI and slash commands, and run our first real task: a competitor research report on a custom children's book AI business idea. Every future episode will build on this fresh install so you can see the compounding value of the agent in real time....
Apr 29, 2026Andrej Karpathy – Outsource your thinking, but you can’t outsource your understanding
https://www.youtube.com/watch?v=96jN2OCOfLs Here's what Andrej Karpathy just figured out that everyone else is still dancing around: we're not in an era of "better models." We're in a different era of computing altogether. And the difference between understanding that and not understanding it is the difference between being a vibe coder and being an agentic engineer. Last October, Karpathy had a realization. AI didn't stop being ChatGPT-adjacent. It fundamentally shifted. Agentic coherent workflows started to actually work. And he's spent the last three months living in side projects, VB coding, exploring what's actually possible. What he found is a framework that explains...
Mar 30, 2026Andrej Karpathy on the Decade of Agents, the Limits of RL, and Why Education Is His Next Mission
A summary of key takeaways from Andrej Karpathy's conversation with Dwarkesh Patel In a wide-ranging conversation with Dwarkesh Patel, Andrej Karpathy — former head of AI at Tesla, founding member of OpenAI, and creator of some of the most popular AI educational content on the internet — shared his views on where AI is headed, what's still broken, and why he's now pouring his energy into education. Here are the key takeaways. "It's the Decade of Agents, Not the Year of Agents" Karpathy's now-famous quote is a direct pushback on industry hype. Early agents like Claude Code and Codex are...