A fully convolutional, recurrent neural architecture designed to predict future occupancy and motion flow fields

aiMotive

Driving safer mobility

Published Jul 24, 2025

Optimized for Neural Processing Units (NPUs) with convolutional acceleration, CCLSTM delivers state-of-the-art (SOTA) performance in the 2024 Waymo Occupancy Flow Forecasting Challenge, while maintaining real-time efficiency and full end-to-end trainability from camera input to future motion prediction.

CCLSTM is the result of a patented innovation by Péter Lengyel, Research Engineer at aiMotive, offering a novel approach that combines convolutional and recurrent modeling to enhance motion forecasting accuracy and efficiency.

Why CCLSTM?

Predicting the future motion of dynamic agents is a cornerstone capability in autonomous driving. CCLSTM approaches this task using Occupancy Flow Fields—a rich, scalable representation that captures motion, spatial extent, and multi-modal futures in a unified framework. Unlike traditional detection-and-tracking pipelines or transformer-based approaches, CCLSTM is:

FULLY CONVOLUTIONAL – Built entirely from convolution operations, making it ideal for deployment on modern NPUs (e.g., aiWare).
RECURRENT AND AUTOREGRESSIVE – Recursively encodes history with theoretically unlimited lookback and forecasts arbitrary horizons autoregressively.
END-TO-END TRAINABLE – Integrates seamlessly with bird’s-eye view (BEV) encoders, requiring no intermediate heuristics or separate modules.
EXPLAINABLE AND CONTROLLABLE – Preserves semantic richness and enables dynamic behavior control, such as planning with different driving styles.

An overview of CCLSTM. Rasterized input grids are concatenated along the channel dimension and encoded via a CNN. The encoded features are aggregated via the accumulator CLSTM. The hidden and cell states of the accumulator CLSTM are used to initialize the forecasting CLSTM. The forecasting CLSTM is then autoregressively called to predict encoded futures states. The future hidden states are then passed to a CNN Decoder, to produce occupancy and flow grids.

To read the whole article, learn more about the problem with the existing methods and the results, click here.

A fully convolutional, recurrent neural architecture designed to predict future occupancy and motion flow fields

aiMotive

Driving safer mobility

Why CCLSTM?

Self-driving Stories

4,131 followers

More articles by this author

Others also viewed

🥇Top ML Papers of the Week

Unleashing Generative AI with Neural Architecture Search & NVIDIA Nemotron Ultra

AI, Robotics, and Quantum Computing at GTC 2025 (Part 2)

Neuromorphic Computing: Pioneering the Future of AI

The key tasks of robot perception and current mapping algorithms

Neuromorphic Computing Market Trends: Emerging Technologies, Players, and Opportunities To 2030

The high odds of our species forking within the life of our 5-year-olds

Neuromorphic Hardware for AI Applications - The Sequel!

Applying Physics-Informed Neural Networks (PINNs): Hands-On Modeling of Lid Driven Cavity

Synthetic Meningitis: The Robot Cook Has a Fever

Explore topics

Why CCLSTM?

Self-driving Stories

4,131 followers

aiSim Testing Series | Part 3: Scenario Settings

Aug 14, 2025

aiSim™ – 5.7.0 release notes

Jul 9, 2025

aiSim Testing Series | Part 2: Environment Settings

Jun 27, 2025

aiSim Testing Series | Part 1: Sensor Setup

May 22, 2025

aiSim™ – 5.6.0 release notes

Mar 26, 2025

aiSim™ – 5.5.0 release notes

Feb 5, 2025

aiSim™ – 5.4.0 release notes

Nov 14, 2024

Self-driving Stories – September - October 2024

Nov 1, 2024

aiSim™ – 5.3.0 release notes

Sep 19, 2024

Self-driving Stories – July - August 2024

Sep 4, 2024

Others also viewed

🥇Top ML Papers of the Week

Unleashing Generative AI with Neural Architecture Search & NVIDIA Nemotron Ultra

AI, Robotics, and Quantum Computing at GTC 2025 (Part 2)

Neuromorphic Computing: Pioneering the Future of AI

The key tasks of robot perception and current mapping algorithms

Neuromorphic Computing Market Trends: Emerging Technologies, Players, and Opportunities To 2030

The high odds of our species forking within the life of our 5-year-olds

Neuromorphic Hardware for AI Applications - The Sequel!

Applying Physics-Informed Neural Networks (PINNs): Hands-On Modeling of Lid Driven Cavity

Synthetic Meningitis: The Robot Cook Has a Fever

Explore topics