BAR: Bayesian Activity Recognition using variational inference

Mahesh Subedar; Omesh Tickoo; Ranganath Krishnan

arxiv: 1811.03305 · v2 · pith:P4CS3O63new · submitted 2018-11-08 · 💻 cs.NE · cs.CV· cs.LG· stat.ML

BAR: Bayesian Activity Recognition using variational inference

Ranganath Krishnan , Mahesh Subedar , Omesh Tickoo This is my paper

classification 💻 cs.NE cs.CVcs.LGstat.ML

keywords activitydnnsrecognitionbayesianuncertaintydeepinferencemodel

0 comments

read the original abstract

Uncertainty estimation in deep neural networks is essential for designing reliable and robust AI systems. Applications such as video surveillance for identifying suspicious activities are designed with deep neural networks (DNNs), but DNNs do not provide uncertainty estimates. Capturing reliable uncertainty estimates in safety and security critical applications will help to establish trust in the AI system. Our contribution is to apply Bayesian deep learning framework to visual activity recognition application and quantify model uncertainty along with principled confidence. We utilize the stochastic variational inference technique while training the Bayesian DNNs to infer the approximate posterior distribution around model parameters and perform Monte Carlo sampling on the posterior of model parameters to obtain the predictive distribution. We show that the Bayesian inference applied to DNNs provide reliable confidence measures for visual activity recognition task as compared to conventional DNNs. We also show that our method improves the visual activity recognition precision-recall AUC by 6.2% compared to non-Bayesian baseline. We evaluate our models on Moments-In-Time (MiT) activity recognition dataset by selecting a subset of in- and out-of-distribution video samples.

This paper has not been read by Pith yet.

discussion (0)

Forward citations

Cited by 1 Pith paper

Reviewed papers in the Pith corpus that reference this work. Sorted by Pith novelty score.

InternVideo: General Video Foundation Models via Generative and Discriminative Learning
cs.CV 2022-12 unverdicted novelty 5.0

InternVideo combines masked video modeling and video-language contrastive learning into a single foundation model that reaches state-of-the-art results on 39 video datasets including 91.1% top-1 on Kinetics-400.