Semi-Supervised First-Person Activity Recognition in Body-Worn Video

Adam Dhillon; Alexander Song; Andrea L. Bertozzi; Hao Li; Honglin Chen; Matt Haberland; Osman Akar; P. Jeffrey Brantingham; Tiankuang Zhou

arxiv: 1904.09062 · v1 · pith:4A2CQXWEnew · submitted 2019-04-19 · 📡 eess.IV · stat.ML

Semi-Supervised First-Person Activity Recognition in Body-Worn Video

Honglin Chen , Hao Li , Alexander Song , Matt Haberland , Osman Akar , Adam Dhillon , Tiankuang Zhou , Andrea L. Bertozzi

show 1 more author

P. Jeffrey Brantingham

This is my paper

classification 📡 eess.IV stat.ML

keywords footagebody-wornvideodatasetsreal-worldactivitiesactivitychallenges

0 comments

read the original abstract

Body-worn cameras are now commonly used for logging daily life, sports, and law enforcement activities, creating a large volume of archived footage. This paper studies the problem of classifying frames of footage according to the activity of the camera-wearer with an emphasis on application to real-world police body-worn video. Real-world datasets pose a different set of challenges from existing egocentric vision datasets: the amount of footage of different activities is unbalanced, the data contains personally identifiable information, and in practice it is difficult to provide substantial training footage for a supervised approach. We address these challenges by extracting features based exclusively on motion information then segmenting the video footage using a semi-supervised classification algorithm. On publicly available datasets, our method achieves results comparable to, if not better than, supervised and/or deep learning methods using a fraction of the training data. It also shows promising results on real-world police body-worn video.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

Visual Timelines of Police Encounters in Body-Worn Camera Footage: Operational Context and Activity Cataloging for Training and Analysis in OpenBWC
cs.CV 2026-05 unverdicted novelty 5.0

A pipeline that converts body-worn camera footage into labeled visual timelines by classifying 10-second windows along operational-context and motion-intensity axes using CLIP and optical-flow features.