Temporal Convolutional Networks: A Unified Approach to Action Segmentation

Austin Reiter; Colin Lea; Gregory D. Hager; Rene Vidal

arxiv: 1608.08242 · v1 · pith:D3OYRN2Inew · submitted 2016-08-29 · 💻 cs.CV

Temporal Convolutional Networks: A Unified Approach to Action Segmentation

Colin Lea , Rene Vidal , Austin Reiter , Gregory D. Hager This is my paper

classification 💻 cs.CV

keywords actionconvolutionalnetworkrelationshipssegmentationtemporalapproachcaptures

0 comments

read the original abstract

The dominant paradigm for video-based action segmentation is composed of two steps: first, for each frame, compute low-level features using Dense Trajectories or a Convolutional Neural Network that encode spatiotemporal information locally, and second, input these features into a classifier that captures high-level temporal relationships, such as a Recurrent Neural Network (RNN). While often effective, this decoupling requires specifying two separate models, each with their own complexities, and prevents capturing more nuanced long-range spatiotemporal relationships. We propose a unified approach, as demonstrated by our Temporal Convolutional Network (TCN), that hierarchically captures relationships at low-, intermediate-, and high-level time-scales. Our model achieves superior or competitive performance using video or sensor data on three public action segmentation datasets and can be trained in a fraction of the time it takes to train an RNN.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

MoCo-AIS: A Contrastive Learning Framework for Similarity Computation of Vessel Trajectories
cs.AI 2026-06 unverdicted novelty 6.0

MoCo-AIS is a MoCo-based contrastive learning framework that learns vessel trajectory embeddings and improves similarity computation over baselines on large-scale real-world AIS datasets while offering a benchmarking ...