TARS: Traffic-Aware Radar Scene Flow Estimation

Dominic Spata; Jialong Wu; Marco Braun; Matthias Rottmann

arxiv: 2503.10210 · v2 · submitted 2025-03-13 · 💻 cs.CV

TARS: Traffic-Aware Radar Scene Flow Estimation

Jialong Wu , Marco Braun , Dominic Spata , Matthias Rottmann This is my paper

Pith reviewed 2026-05-22 23:52 UTC · model grok-4.3

classification 💻 cs.CV

keywords radar scene flowtraffic vector fieldautonomous drivingobject detectionmotion estimationradar point cloudstraffic awarenessscene understanding

0 comments

The pith

Radar scene flow estimation improves by enforcing motion rigidity at the traffic level rather than per object.

A machine-rendered reading of the paper's core claim, the machinery that carries it, and where it could break.

The paper introduces TARS to estimate scene flow from sparse radar point clouds by jointly running object detection and flow prediction. It builds a Traffic Vector Field in feature space from the detector output so that flow respects consistent rigid motion across all traffic participants instead of assuming rigidity only inside single detected boxes. This traffic-level view supplies the scene flow branch with both local point-neighbor cues and global consistency constraints. The resulting estimates outperform prior methods on a private dataset and on View-of-Delft.

Core claim

By incorporating the feature map from an object detector trained with detection losses, the method constructs a Traffic Vector Field in feature space that supplies holistic traffic-level scene understanding; scene flow is then computed by combining point-level motion cues from neighbors with traffic-level consistency of rigid motion.

What carries the argument

The Traffic Vector Field (TVF) built in the detector feature space, which supplies traffic-level rigid-motion consistency to the scene flow estimator.

If this is right

Joint detection and flow training makes radar scene flow aware of both the static environment and moving road users.
Point-level neighbor cues plus traffic-level rigid-motion consistency together produce lower errors than either cue alone.
The same detector features improve both detection and downstream flow without requiring separate motion-specific training.
Performance gains appear on both proprietary and public View-of-Delft radar data.

Where Pith is reading between the lines

These are editorial extensions of the paper, not claims the author makes directly.

If traffic-level rigidity generalizes, the same feature-field construction could be tried on denser LiDAR or camera data.
The approach may reduce flow errors in dense urban traffic where individual objects are hard to segment.
Real-time versions would need to verify that the added vector-field computation stays within latency budgets.
The method could be extended to predict future traffic states by propagating the vector field forward in time.

Load-bearing premise

Motion rigidity holds usefully at the traffic level and detector features can be turned into a Traffic Vector Field that improves flow without adding new errors.

What would settle it

A controlled test on the same radar datasets in which removing the traffic-level consistency term raises scene flow error above the instance-level baseline.

Figures

Figures reproduced from arXiv: 2503.10210 by Dominic Spata, Jialong Wu, Marco Braun, Matthias Rottmann.

**Figure 2.** Figure 2: Overview of TARS. TARS employs a hierarchical architecture. At each level, it infers point-level motion cues using a double [PITH_FULL_IMAGE:figures/full_fig_p003_2.png] view at source ↗

**Figure 4.** Figure 4: TVF decoder & scene flow head: capture motion rigidity [PITH_FULL_IMAGE:figures/full_fig_p005_4.png] view at source ↗

**Figure 5.** Figure 5: (a) Temporal update module: leverages low-level point [PITH_FULL_IMAGE:figures/full_fig_p005_5.png] view at source ↗

**Figure 6.** Figure 6: Qualitative results on the proprietary dataset, compared with HALFlow [ [PITH_FULL_IMAGE:figures/full_fig_p008_6.png] view at source ↗

**Figure 7.** Figure 7: Qualitative results on the proprietary dataset, compared with HALFlow [ [PITH_FULL_IMAGE:figures/full_fig_p015_7.png] view at source ↗

read the original abstract

Scene flow provides crucial motion information for autonomous driving. Recent LiDAR scene flow models utilize the rigid-motion assumption at the instance level, assuming objects are rigid bodies. However, these instance-level methods are not suitable for sparse radar point clouds. In this work, we present a novel Traffic-Aware Radar Scene-Flow (TARS) estimation method, which utilizes motion rigidity at the traffic level. To address the challenges in radar scene flow, we perform object detection and scene flow jointly and boost the latter. We incorporate the feature map from the object detector, trained with detection losses, to make radar scene flow aware of the environment and road users. From this, we construct a Traffic Vector Field (TVF) in the feature space to achieve holistic traffic-level scene understanding in our scene flow branch. When estimating the scene flow, we consider both point-level motion cues from point neighbors and traffic-level consistency of rigid motion within the space. TARS outperforms the state of the art on a proprietary dataset and the View-of-Delft dataset, improving the benchmarks by 23% and 15%, respectively.

Editorial analysis

A structured set of objections, weighed in public.

Desk editor's note, referee report, simulated authors' rebuttal, and a circularity audit. Tearing a paper down is the easy half of reading it; the pith above is the substance, this is the friction.

Desk Editor's Note private letter to a colleague

TARS adds a traffic-level rigidity prior to radar scene flow by deriving a TVF from joint detector features, with reported gains that need full experimental details to evaluate.

read the letter

The main takeaway is that this paper adapts scene flow to sparse radar by moving rigidity from the instance level to the traffic level, using a Traffic Vector Field built from the feature map of a jointly trained detector. That combination is the actual new piece; prior LiDAR work stays per-object, and radar papers have mostly stayed with point-neighbor or ego-motion cues, so the TVF step is a concrete attempt to inject broader scene structure without needing dense points per object. The joint detection-flow training is a straightforward way to make the flow branch environment-aware, and the abstract frames the sparsity problem clearly enough that the motivation lands. The reported 23% and 15% improvements on the proprietary set and View-of-Delft are the headline numbers, but they sit on top of that architecture choice. The soft spots are mostly in the evaluation layer. The abstract supplies no metric definitions, no baseline names, no ablation numbers on the TVF component or the loss weights, and no error bars, so the size of the actual contribution is hard to judge from what is shown. The proprietary dataset also blocks direct checking. On the traffic-level rigidity premise, the concern about independent object motions is worth watching in the full text; if the TVF only adds neighbor consistency without an explicit cross-object term, it could end up diluting rather than helping the point-level signals. The paper is aimed at people working on radar perception stacks for driving, especially those already doing multi-task detection and flow. A reader who needs a practical radar baseline or wants to test traffic priors would get something usable if the experiments hold up. I would send it to peer review so the methods and numbers can be examined in detail rather than desk-rejecting on the abstract alone.

Referee Report

2 major / 2 minor

Summary. The paper proposes TARS, a joint object detection and scene flow method for sparse radar point clouds that constructs a Traffic Vector Field (TVF) in feature space from detector features to enforce motion rigidity at the traffic level (rather than instance level), incorporating both point-neighbor cues and traffic-level consistency; it reports 23% and 15% improvements over prior art on a proprietary dataset and the View-of-Delft dataset.

Significance. If the traffic-level rigidity premise holds and the TVF demonstrably improves flow estimation without propagating inconsistent velocities from independent agents, the approach could shift radar scene flow from instance-centric to holistic traffic-aware modeling, with potential value for autonomous driving where radar sparsity limits instance-level methods.

major comments (2)

[Method (TVF and scene flow branch)] Method section on TVF construction: the paper invokes traffic-level rigidity to justify the joint detection+flow pipeline and reported gains, yet provides no explicit equations or loss terms showing how the TVF enforces cross-object motion consistency beyond neighbor-based point cues; this leaves the central premise unverified precisely where it claims advantage over instance-level baselines.
[Experiments] Experiments section: the 23% and 15% benchmark improvements are stated without error bars, ablation isolating the TVF component, dataset split details for the proprietary set, or quantitative comparison of traffic-level vs. instance-level rigidity assumptions, so the evidence does not yet confirm that the gains stem from the claimed traffic-level mechanism rather than other factors.

minor comments (2)

[Abstract] Abstract and introduction: the claimed percentage improvements are given without naming the underlying metrics (e.g., EPE, accuracy@threshold) or baselines, which should be stated explicitly even in the abstract.
[Implementation details] Notation: the free parameters (loss weighting factors between detection and flow) are mentioned in the axiom ledger but their specific values or sensitivity analysis are not referenced in the text.

Simulated Author's Rebuttal

2 responses · 0 unresolved

We thank the referee for the constructive feedback on the TARS manuscript. We agree that additional mathematical detail on the TVF and stronger experimental validation are needed to substantiate the traffic-level rigidity claims. We address each major comment below and will incorporate the requested changes in the revised version.

read point-by-point responses

Referee: [Method (TVF and scene flow branch)] Method section on TVF construction: the paper invokes traffic-level rigidity to justify the joint detection+flow pipeline and reported gains, yet provides no explicit equations or loss terms showing how the TVF enforces cross-object motion consistency beyond neighbor-based point cues; this leaves the central premise unverified precisely where it claims advantage over instance-level baselines.

Authors: We acknowledge that the current description of TVF construction is high-level and does not include explicit equations or loss terms that isolate traffic-level cross-object consistency from point-neighbor cues. In the revision we will add the precise mathematical formulation of the TVF in feature space together with the loss terms that enforce rigid-motion consistency across multiple objects, thereby making the traffic-level mechanism verifiable and clearly differentiated from instance-level baselines. revision: yes
Referee: [Experiments] Experiments section: the 23% and 15% benchmark improvements are stated without error bars, ablation isolating the TVF component, dataset split details for the proprietary set, or quantitative comparison of traffic-level vs. instance-level rigidity assumptions, so the evidence does not yet confirm that the gains stem from the claimed traffic-level mechanism rather than other factors.

Authors: We agree that the experimental section requires these additions to attribute the reported gains specifically to the traffic-level mechanism. In the revised manuscript we will report error bars over multiple runs, present an ablation study that isolates the TVF component, provide the train/test split criteria and ratios for the proprietary dataset (subject to confidentiality constraints), and include a direct quantitative comparison of traffic-level versus instance-level rigidity assumptions. revision: yes

Circularity Check

0 steps flagged

No significant circularity detected

full rationale

The paper describes a joint detection and scene-flow pipeline that constructs a Traffic Vector Field from detector features to enforce traffic-level motion consistency. No equations, loss terms, or parameters are shown that reduce any claimed output or prediction to a quantity defined by the inputs themselves. The reported gains are presented as empirical results on external datasets rather than tautological re-statements of fitted values or self-citations. The derivation therefore remains self-contained.

Axiom & Free-Parameter Ledger

1 free parameters · 1 axioms · 1 invented entities

The central claim rests on a domain assumption of traffic-level rigid motion and introduces the Traffic Vector Field as a new construct without external falsifiable evidence.

free parameters (1)

loss weighting factors between detection and flow tasks
Typical hyperparameters required for joint training; not specified in abstract.

axioms (1)

domain assumption Traffic scenes exhibit consistent rigid motion at a holistic traffic level that can be captured in feature space
Invoked when the method constructs the Traffic Vector Field and enforces traffic-level consistency during scene flow estimation.

invented entities (1)

Traffic Vector Field (TVF) no independent evidence
purpose: To provide holistic traffic-level scene understanding for the scene flow branch
New entity constructed from detector feature maps; no independent evidence outside the model is provided.

pith-pipeline@v0.9.0 · 5725 in / 1449 out tokens · 42032 ms · 2026-05-22T23:52:33.853345+00:00 · methodology

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Weakly Supervised Cross-Modal Learning for 4D Radar Scene Flow Estimation
cs.CV 2026-05 conditional novelty 7.0

Weakly supervised iterative framework for radar scene flow estimation using back-projected 2D instance masks and odometry-based rigid static loss to outperform LiDAR-dependent and fully supervised baselines on the VoD...
Weakly Supervised Cross-Modal Learning for 4D Radar Scene Flow Estimation
cs.CV 2026-05 unverdicted novelty 7.0

A task-specific iterative framework for weakly supervised 4D radar scene flow estimation uses instance-aware self-supervised losses from 2D tracking/segmentation and a rigid static loss from odometry to outperform LiD...

Reference graph

Works this paper leans on

24 extracted references · 24 canonical work pages · cited by 1 Pith paper · 2 internal anchors

[1]

Delving Deeper into Convolutional Networks for Learning Video Representations

Nicolas Ballas, Li Yao, Chris Pal, and Aaron Courville. Delving deeper into convolutional networks for learning video representations. arXiv preprint arXiv:1511.06432 ,

work page internal anchor Pith review Pith/arXiv arXiv
[2]

Radardistill: Boosting radar-based ob- ject detection performance via knowledge distillation from lidar features

Geonho Bang, Kwangjin Choi, Jisong Kim, Dongsuk Kum, and Jun Won Choi. Radardistill: Boosting radar-based ob- ject detection performance via knowledge distillation from lidar features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15491– 15500, 2024. 7

work page 2024
[3]

V oxelnext: Fully sparse voxelnet for 3d object de- tection and tracking

Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, and Jiaya Jia. V oxelnext: Fully sparse voxelnet for 3d object de- tection and tracking. In Proceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 21674–21683, 2023. 7

work page 2023
[4]

Bi-pointflownet: Bidi- rectional learning for point cloud based scene flow estima- tion

Wencan Cheng and Jong Hwan Ko. Bi-pointflownet: Bidi- rectional learning for point cloud based scene flow estima- tion. In European Conference on Computer Vision , pages 108–124. Springer, 2022. 2, 4

work page 2022
[5]

Robust 3d object detection from lidar-radar point clouds via cross-modal feature augmentation

Jianning Deng, Gabriel Chan, Hantao Zhong, and Chris Xi- aoxuan Lu. Robust 3d object detection from lidar-radar point clouds via cross-modal feature augmentation. In 2024 IEEE International Conference on Robotics and Automation (ICRA), pages 6585–6591. IEEE, 2024. 7

work page 2024
[6]

Self-supervised scene flow estima- tion with 4-d automotive radar

Fangqiang Ding, Zhijun Pan, Yimin Deng, Jianning Deng, and Chris Xiaoxuan Lu. Self-supervised scene flow estima- tion with 4-d automotive radar. IEEE Robotics and Automa- tion Letters, 7(3):8233–8240, 2022. 2, 6, 7, 1

work page 2022
[7]

Hidden gems: 4d radar scene flow learning using cross-modal supervision

Fangqiang Ding, Andras Palffy, Dariu M Gavrila, and Chris Xiaoxuan Lu. Hidden gems: 4d radar scene flow learning using cross-modal supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9340–9349, 2023. 2, 5, 6, 7, 1, 3, 4

work page 2023
[8]

Pillarflownet: A real- time deep multitask network for lidar-based 3d object detec- tion and scene flow estimation

Fabian Duffhauss and Stefan A Baur. Pillarflownet: A real- time deep multitask network for lidar-based 3d object detec- tion and scene flow estimation. In 2020 IEEE/RSJ Interna- tional Conference on Intelligent Robots and Systems (IROS), pages 10734–10741. IEEE, 2020. 2, 3

work page 2020
[9]

Pointrnn: Point recurrent neural network for moving point cloud processing

Hehe Fan and Yi Yang. Pointrnn: Point recurrent neural network for moving point cloud processing. arXiv preprint arXiv:1910.08287, 2019. 5, 2

work page arXiv 1910
[10]

A solution for the best rotation to re- late two sets of vectors

Wolfgang Kabsch. A solution for the best rotation to re- late two sets of vectors. Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crys- tallography, 32(5):922–923, 1976. 2, 1

work page 1976
[11]

I can’t believe it’s not scene flow! InEuropean Conference on Computer Vision , pages 242–257

Ishan Khatri, Kyle Vedder, Neehar Peri, Deva Ramanan, and James Hays. I can’t believe it’s not scene flow! InEuropean Conference on Computer Vision , pages 242–257. Springer,

work page
[12]

Flow4d: Leveraging 4d voxel network for lidar scene flow estimation

Jaeyeul Kim, Jungwan Woo, Ukcheol Shin, Jean Oh, and Sunghoon Im. Flow4d: Leveraging 4d voxel network for lidar scene flow estimation. IEEE Robotics and Automation Letters, 2025. 2, 7, 3

work page 2025
[13]

Flow- step3d: Model unrolling for self-supervised scene flow es- timation

Yair Kittenplon, Yonina C Eldar, and Dan Raviv. Flow- step3d: Model unrolling for self-supervised scene flow es- timation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages 4114– 4123, 2021. 2, 6, 7, 4

work page 2021
[14]

Pointpillars: Fast encoders for object detection from point clouds

Alex H Lang, Sourabh V ora, Holger Caesar, Lubing Zhou, Jiong Yang, and Oscar Beijbom. Pointpillars: Fast encoders for object detection from point clouds. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12697–12705, 2019. 3, 4

work page 2019
[15]

Flownet3d: Learning scene flow in 3d point clouds

Xingyu Liu, Charles R Qi, and Leonidas J Guibas. Flownet3d: Learning scene flow in 3d point clouds. In Pro- ceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 529–537, 2019. 1, 2, 4

work page 2019
[16]

Lion: Linear group rnn for 3d object detection in point clouds

Zhe Liu, Jinghua Hou, Xinyu Wang, Xiaoqing Ye, Jingdong Wang, Hengshuang Zhao, and Xiang Bai. Lion: Linear group rnn for 3d object detection in point clouds. Advances in Neural Information Processing Systems, 37:13601–13626,

work page
[17]

Ratrack: moving object detection and tracking with 4d radar point cloud

Zhijun Pan, Fangqiang Ding, Hantao Zhong, and Chris Xi- aoxuan Lu. Ratrack: moving object detection and tracking with 4d radar point cloud. In 2024 IEEE International Con- ference on Robotics and Automation (ICRA) , pages 4480–

work page 2024
[18]

Hierarchical attention learning of scene flow in 3d point clouds

Guangming Wang, Xinrui Wu, Zhe Liu, and Hesheng Wang. Hierarchical attention learning of scene flow in 3d point clouds. IEEE Transactions on Image Processing, 30:5168– 5181, 2021. 2, 3, 4, 7, 8, 5

work page 2021
[19]

Pv-raft: Point-voxel correlation fields for scene flow estima- tion of point clouds

Yi Wei, Ziyi Wang, Yongming Rao, Jiwen Lu, and Jie Zhou. Pv-raft: Point-voxel correlation fields for scene flow estima- tion of point clouds. In Proceedings of the IEEE/CVF con- ference on computer vision and pattern recognition , pages 6954–6963, 2021. 2, 4

work page 2021
[20]

Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

Benjamin Wilson, William Qi, Tanmay Agarwal, John Lambert, Jagjeet Singh, Siddhesh Khandelwal, Bowen Pan, Ratnesh Kumar, Andrew Hartnett, Jhony Kaesemodel Pontes, et al. Argoverse 2: Next generation datasets for self-driving perception and forecasting. arXiv preprint arXiv:2301.00493, 2023. 3

work page internal anchor Pith review Pith/arXiv arXiv 2023
[21]

Cbam: Convolutional block attention module

Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV), pages 3–19, 2018. 5, 1

work page 2018
[22]

Pointpwc-net: Cost volume on point clouds for (self- ) supervised scene flow estimation

Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, and Li Fuxin. Pointpwc-net: Cost volume on point clouds for (self- ) supervised scene flow estimation. In Computer Vision– ECCV 2020: 16th European Conference, Glasgow, UK, Au- gust 23–28, 2020, Proceedings, Part V 16 , pages 88–107. Springer, 2020. 1, 2, 3, 4, 7

work page 2020
[23]

DeFlow: Decoder of scene flow network in autonomous driving

Qingwen Zhang, Yi Yang, Heng Fang, Ruoyu Geng, and Patric Jensfelt. DeFlow: Decoder of scene flow network in autonomous driving. In 2024 IEEE International Confer- ence on Robotics and Automation (ICRA), pages 2105–2111,

work page 2024
[24]

H3dnet: 3d object detection using hybrid geometric primi- tives

Zaiwei Zhang, Bo Sun, Haitao Yang, and Qixing Huang. H3dnet: 3d object detection using hybrid geometric primi- tives. In Computer Vision–ECCV 2020: 16th European Con- ference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII 16, pages 311–329. Springer, 2020. 3

work page 2020

[1] [1]

Delving Deeper into Convolutional Networks for Learning Video Representations

Nicolas Ballas, Li Yao, Chris Pal, and Aaron Courville. Delving deeper into convolutional networks for learning video representations. arXiv preprint arXiv:1511.06432 ,

work page internal anchor Pith review Pith/arXiv arXiv

[2] [2]

Radardistill: Boosting radar-based ob- ject detection performance via knowledge distillation from lidar features

Geonho Bang, Kwangjin Choi, Jisong Kim, Dongsuk Kum, and Jun Won Choi. Radardistill: Boosting radar-based ob- ject detection performance via knowledge distillation from lidar features. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 15491– 15500, 2024. 7

work page 2024

[3] [3]

V oxelnext: Fully sparse voxelnet for 3d object de- tection and tracking

Yukang Chen, Jianhui Liu, Xiangyu Zhang, Xiaojuan Qi, and Jiaya Jia. V oxelnext: Fully sparse voxelnet for 3d object de- tection and tracking. In Proceedings of the IEEE/CVF Con- ference on Computer Vision and Pattern Recognition, pages 21674–21683, 2023. 7

work page 2023

[4] [4]

Bi-pointflownet: Bidi- rectional learning for point cloud based scene flow estima- tion

Wencan Cheng and Jong Hwan Ko. Bi-pointflownet: Bidi- rectional learning for point cloud based scene flow estima- tion. In European Conference on Computer Vision , pages 108–124. Springer, 2022. 2, 4

work page 2022

[5] [5]

Robust 3d object detection from lidar-radar point clouds via cross-modal feature augmentation

Jianning Deng, Gabriel Chan, Hantao Zhong, and Chris Xi- aoxuan Lu. Robust 3d object detection from lidar-radar point clouds via cross-modal feature augmentation. In 2024 IEEE International Conference on Robotics and Automation (ICRA), pages 6585–6591. IEEE, 2024. 7

work page 2024

[6] [6]

Self-supervised scene flow estima- tion with 4-d automotive radar

Fangqiang Ding, Zhijun Pan, Yimin Deng, Jianning Deng, and Chris Xiaoxuan Lu. Self-supervised scene flow estima- tion with 4-d automotive radar. IEEE Robotics and Automa- tion Letters, 7(3):8233–8240, 2022. 2, 6, 7, 1

work page 2022

[7] [7]

Hidden gems: 4d radar scene flow learning using cross-modal supervision

Fangqiang Ding, Andras Palffy, Dariu M Gavrila, and Chris Xiaoxuan Lu. Hidden gems: 4d radar scene flow learning using cross-modal supervision. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 9340–9349, 2023. 2, 5, 6, 7, 1, 3, 4

work page 2023

[8] [8]

Pillarflownet: A real- time deep multitask network for lidar-based 3d object detec- tion and scene flow estimation

Fabian Duffhauss and Stefan A Baur. Pillarflownet: A real- time deep multitask network for lidar-based 3d object detec- tion and scene flow estimation. In 2020 IEEE/RSJ Interna- tional Conference on Intelligent Robots and Systems (IROS), pages 10734–10741. IEEE, 2020. 2, 3

work page 2020

[9] [9]

Pointrnn: Point recurrent neural network for moving point cloud processing

Hehe Fan and Yi Yang. Pointrnn: Point recurrent neural network for moving point cloud processing. arXiv preprint arXiv:1910.08287, 2019. 5, 2

work page arXiv 1910

[10] [10]

A solution for the best rotation to re- late two sets of vectors

Wolfgang Kabsch. A solution for the best rotation to re- late two sets of vectors. Acta Crystallographica Section A: Crystal Physics, Diffraction, Theoretical and General Crys- tallography, 32(5):922–923, 1976. 2, 1

work page 1976

[11] [11]

I can’t believe it’s not scene flow! InEuropean Conference on Computer Vision , pages 242–257

Ishan Khatri, Kyle Vedder, Neehar Peri, Deva Ramanan, and James Hays. I can’t believe it’s not scene flow! InEuropean Conference on Computer Vision , pages 242–257. Springer,

work page

[12] [12]

Flow4d: Leveraging 4d voxel network for lidar scene flow estimation

Jaeyeul Kim, Jungwan Woo, Ukcheol Shin, Jean Oh, and Sunghoon Im. Flow4d: Leveraging 4d voxel network for lidar scene flow estimation. IEEE Robotics and Automation Letters, 2025. 2, 7, 3

work page 2025

[13] [13]

Flow- step3d: Model unrolling for self-supervised scene flow es- timation

Yair Kittenplon, Yonina C Eldar, and Dan Raviv. Flow- step3d: Model unrolling for self-supervised scene flow es- timation. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition , pages 4114– 4123, 2021. 2, 6, 7, 4

work page 2021

[14] [14]

Pointpillars: Fast encoders for object detection from point clouds

Alex H Lang, Sourabh V ora, Holger Caesar, Lubing Zhou, Jiong Yang, and Oscar Beijbom. Pointpillars: Fast encoders for object detection from point clouds. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 12697–12705, 2019. 3, 4

work page 2019

[15] [15]

Flownet3d: Learning scene flow in 3d point clouds

Xingyu Liu, Charles R Qi, and Leonidas J Guibas. Flownet3d: Learning scene flow in 3d point clouds. In Pro- ceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 529–537, 2019. 1, 2, 4

work page 2019

[16] [16]

Lion: Linear group rnn for 3d object detection in point clouds

Zhe Liu, Jinghua Hou, Xinyu Wang, Xiaoqing Ye, Jingdong Wang, Hengshuang Zhao, and Xiang Bai. Lion: Linear group rnn for 3d object detection in point clouds. Advances in Neural Information Processing Systems, 37:13601–13626,

work page

[17] [17]

Ratrack: moving object detection and tracking with 4d radar point cloud

Zhijun Pan, Fangqiang Ding, Hantao Zhong, and Chris Xi- aoxuan Lu. Ratrack: moving object detection and tracking with 4d radar point cloud. In 2024 IEEE International Con- ference on Robotics and Automation (ICRA) , pages 4480–

work page 2024

[18] [18]

Hierarchical attention learning of scene flow in 3d point clouds

Guangming Wang, Xinrui Wu, Zhe Liu, and Hesheng Wang. Hierarchical attention learning of scene flow in 3d point clouds. IEEE Transactions on Image Processing, 30:5168– 5181, 2021. 2, 3, 4, 7, 8, 5

work page 2021

[19] [19]

Pv-raft: Point-voxel correlation fields for scene flow estima- tion of point clouds

Yi Wei, Ziyi Wang, Yongming Rao, Jiwen Lu, and Jie Zhou. Pv-raft: Point-voxel correlation fields for scene flow estima- tion of point clouds. In Proceedings of the IEEE/CVF con- ference on computer vision and pattern recognition , pages 6954–6963, 2021. 2, 4

work page 2021

[20] [20]

Argoverse 2: Next Generation Datasets for Self-Driving Perception and Forecasting

Benjamin Wilson, William Qi, Tanmay Agarwal, John Lambert, Jagjeet Singh, Siddhesh Khandelwal, Bowen Pan, Ratnesh Kumar, Andrew Hartnett, Jhony Kaesemodel Pontes, et al. Argoverse 2: Next generation datasets for self-driving perception and forecasting. arXiv preprint arXiv:2301.00493, 2023. 3

work page internal anchor Pith review Pith/arXiv arXiv 2023

[21] [21]

Cbam: Convolutional block attention module

Sanghyun Woo, Jongchan Park, Joon-Young Lee, and In So Kweon. Cbam: Convolutional block attention module. In Proceedings of the European conference on computer vision (ECCV), pages 3–19, 2018. 5, 1

work page 2018

[22] [22]

Pointpwc-net: Cost volume on point clouds for (self- ) supervised scene flow estimation

Wenxuan Wu, Zhi Yuan Wang, Zhuwen Li, Wei Liu, and Li Fuxin. Pointpwc-net: Cost volume on point clouds for (self- ) supervised scene flow estimation. In Computer Vision– ECCV 2020: 16th European Conference, Glasgow, UK, Au- gust 23–28, 2020, Proceedings, Part V 16 , pages 88–107. Springer, 2020. 1, 2, 3, 4, 7

work page 2020

[23] [23]

DeFlow: Decoder of scene flow network in autonomous driving

Qingwen Zhang, Yi Yang, Heng Fang, Ruoyu Geng, and Patric Jensfelt. DeFlow: Decoder of scene flow network in autonomous driving. In 2024 IEEE International Confer- ence on Robotics and Automation (ICRA), pages 2105–2111,

work page 2024

[24] [24]

H3dnet: 3d object detection using hybrid geometric primi- tives

Zaiwei Zhang, Bo Sun, Haitao Yang, and Qixing Huang. H3dnet: 3d object detection using hybrid geometric primi- tives. In Computer Vision–ECCV 2020: 16th European Con- ference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XII 16, pages 311–329. Springer, 2020. 3

work page 2020