FAT decomposes structured prediction into specialist hypothesis generation and foundation-model proxy reasoning, yielding consistent gains over baselines on detection, trajectory, and segmentation tasks.
Vision language models in autonomous driving: A survey and outlook
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it