pith. sign in

arxiv: 1904.09062 · v1 · pith:4A2CQXWEnew · submitted 2019-04-19 · 📡 eess.IV · stat.ML

Semi-Supervised First-Person Activity Recognition in Body-Worn Video

classification 📡 eess.IV stat.ML
keywords footagebody-wornvideodatasetsreal-worldactivitiesactivitychallenges
0
0 comments X
read the original abstract

Body-worn cameras are now commonly used for logging daily life, sports, and law enforcement activities, creating a large volume of archived footage. This paper studies the problem of classifying frames of footage according to the activity of the camera-wearer with an emphasis on application to real-world police body-worn video. Real-world datasets pose a different set of challenges from existing egocentric vision datasets: the amount of footage of different activities is unbalanced, the data contains personally identifiable information, and in practice it is difficult to provide substantial training footage for a supervised approach. We address these challenges by extracting features based exclusively on motion information then segmenting the video footage using a semi-supervised classification algorithm. On publicly available datasets, our method achieves results comparable to, if not better than, supervised and/or deep learning methods using a fraction of the training data. It also shows promising results on real-world police body-worn video.

This paper has not been read by Pith yet.

discussion (0)

Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

  1. Visual Timelines of Police Encounters in Body-Worn Camera Footage: Operational Context and Activity Cataloging for Training and Analysis in OpenBWC

    cs.CV 2026-05 unverdicted novelty 5.0

    A pipeline that converts body-worn camera footage into labeled visual timelines by classifying 10-second windows along operational-context and motion-intensity axes using CLIP and optical-flow features.