pith. machine review for the scientific record. sign in

arxiv: 1906.03363 · v1 · submitted 2019-06-08 · 💻 cs.CV

Recognition: unknown

TransNet: A deep network for fast detection of common shot transitions

Authors on Pith no claims yet
classification 💻 cs.CV
keywords networkdatasetdetectionselectedshottransitionstransnetabove
0
0 comments X
read the original abstract

Shot boundary detection (SBD) is an important first step in many video processing applications. This paper presents a simple modular convolutional neural network architecture that achieves state-of-the-art results on the RAI dataset with well above real-time inference speed even on a single mediocre GPU. The network employs dilated convolutions and operates just on small resized frames. The training process employed randomly generated transitions using selected shots from the TRECVID IACC.3 dataset. The code and a selected trained network will be available at https://github.com/soCzech/TransNet.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 2 Pith papers

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. TransVLM: A Vision-Language Framework and Benchmark for Detecting Any Shot Transitions

    cs.CV 2026-04 unverdicted novelty 7.0

    TransVLM formalizes Shot Transition Detection as identifying full temporal transition segments rather than single cut points and introduces a VLM that injects optical flow as a motion prior via simple feature fusion, ...

  2. OmniShotCut: Holistic Relational Shot Boundary Detection with Shot-Query Transformer

    cs.CV 2026-04 unverdicted novelty 7.0

    OmniShotCut treats shot boundary detection as structured relational prediction via a shot-query Transformer, uses fully synthetic transitions for training data, and releases OmniShotCutBench for evaluation.