Introduces TALKIN dataset and deep Siamese fusion network showing audio-visual combination outperforms uni-modal baselines for kinship verification.
Msu-avis dataset: Fusing face and voice modalities for biometric recognition in indoor surveillance videos,
1 Pith paper cite this work. Polarity classification is still indexing.
1
Pith paper citing it
fields
cs.CV 1years
2019 1verdicts
UNVERDICTED 1representative citing papers
citing papers explorer
-
Audio-Visual Kinship Verification
Introduces TALKIN dataset and deep Siamese fusion network showing audio-visual combination outperforms uni-modal baselines for kinship verification.