MS-RAFT-3D: A Multi-Scale Architecture for Recurrent Image-Based Scene Flow

J Schmid, A Jahedi, NB Senn… - 2025 IEEE International …, 2025 - ieeexplore.ieee.org
2025 IEEE International Conference on Image Processing (ICIP), 2025ieeexplore.ieee.org
Although multi-scale concepts have recently proven useful for recurrent network
architectures in the field of optical flow and stereo, they have not been considered for image-
based scene flow so far. Hence, based on a single-scale recurrent scene flow backbone, we
develop a multi-scale approach that generalizes successful hierarchical ideas from optical
flow to image-based scene flow. By considering suitable concepts for the feature and the
context encoder, the overall coarse-to-fine framework and the training loss, we succeed to …
Although multi-scale concepts have recently proven useful for recurrent network architectures in the field of optical flow and stereo, they have not been considered for image-based scene flow so far. Hence, based on a single-scale recurrent scene flow backbone, we develop a multi-scale approach that generalizes successful hierarchical ideas from optical flow to image-based scene flow. By considering suitable concepts for the feature and the context encoder, the overall coarse-to-fine framework and the training loss, we succeed to design a scene flow approach that outperforms the current state of the art on KITTI and Spring by 8.7% (3.89 vs. 4.26) and 65.8% (9.13 vs. 26.71), respectively. Our code is available at https://github.com/cv-stuttgart/MS-RAFT-3D.
ieeexplore.ieee.org