Contrast & Compress: Learning Lightweight Embeddings for Short Trajectories

Abhishek Vivekanandan; Christian Hubschneider; J. Marius Z\"ollner

arxiv: 2506.02571 · v1 · pith:3ORWK6A3new · submitted 2025-06-03 · 💻 cs.CV

Contrast & Compress: Learning Lightweight Embeddings for Short Trajectories

Abhishek Vivekanandan , Christian Hubschneider , J. Marius Z\"ollner This is my paper

classification 💻 cs.CV

keywords embeddingstrajectorieslearningmotionsimilaritycompactcontrastivecosine

0 comments

read the original abstract

The ability to retrieve semantically and directionally similar short-range trajectories with both accuracy and efficiency is foundational for downstream applications such as motion forecasting and autonomous navigation. However, prevailing approaches often depend on computationally intensive heuristics or latent anchor representations that lack interpretability and controllability. In this work, we propose a novel framework for learning fixed-dimensional embeddings for short trajectories by leveraging a Transformer encoder trained with a contrastive triplet loss that emphasize the importance of discriminative feature spaces for trajectory data. We analyze the influence of Cosine and FFT-based similarity metrics within the contrastive learning paradigm, with a focus on capturing the nuanced directional intent that characterizes short-term maneuvers. Our empirical evaluation on the Argoverse 2 dataset demonstrates that embeddings shaped by Cosine similarity objectives yield superior clustering of trajectories by both semantic and directional attributes, outperforming FFT-based baselines in retrieval tasks. Notably, we show that compact Transformer architectures, even with low-dimensional embeddings (e.g., 16 dimensions, but qualitatively down to 4), achieve a compelling balance between retrieval performance (minADE, minFDE) and computational overhead, aligning with the growing demand for scalable and interpretable motion priors in real-time systems. The resulting embeddings provide a compact, semantically meaningful, and efficient representation of trajectory data, offering a robust alternative to heuristic similarity measures and paving the way for more transparent and controllable motion forecasting pipelines.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Objective-Behavior Alignment: Diagnostics for MORL Policy Selection
cs.LG 2026-06 unverdicted novelty 5.0

Proposes an exploratory diagnostic workflow to highlight behavioral variation along MORL Pareto fronts not captured by objective values, with validation on grid and continuous control tasks.
Recall to Predict: Grounding Motion Forecasting in Interpretable Motion Bank
cs.CV 2026-05 unverdicted novelty 5.0

A differentiable motion forecasting model retrieves and refines interpretable trajectory anchors from a contrastively learned motion bank to improve transparency without sacrificing multi-modal accuracy.