Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH) , pages=

· 2021 · DOI 10.21437/interspeech.2021-698

6 Pith papers cite this work. Polarity classification is still indexing.

6 Pith papers citing it

open at publisher browse 6 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

[Emerging Ideas] Artificial Tripartite Intelligence: A Bio-Inspired, Sensor-First Architecture for Physical AI

cs.AI · 2026-04-15 · unverdicted · novelty 7.0

ATI is a tripartite bio-inspired architecture for physical AI that co-designs sensing and inference, shown in a camera prototype to raise accuracy from 53.8% to 88% and cut remote invocations by 43.3%.

Parameter-efficient Dual-encoder Architecture with Differentiable Choquet Integral Fusion for Underwater Acoustic Classification

cs.SD · 2026-06-01 · unverdicted · novelty 6.0

A parameter-efficient dual-encoder model with differentiable Choquet integral fusion improves underwater acoustic classification accuracy over single-encoder baselines on DeepShip and ShipsEar datasets.

Giving Sensors a Voice: Multimodal JEPA for Semantic Time-Series Embeddings

cs.LG · 2026-05-29 · unverdicted · novelty 6.0

CHARM learns semantic time-series embeddings via channel-aware JEPA training in an order-equivariant Transformer, achieving strong linear-probe performance on anomaly detection, classification, and forecasting.

Audio Deepfake Detection with Half-Truth Localisation Using Cross-Attentive Feature Fusion

cs.SD · 2026-05-28 · unverdicted · novelty 5.0

CAFNet performs joint ternary classification and temporal boundary regression for half-truth audio deepfakes via cross-attentive fusion of MFCC, LFCC, and Chroma-STFT features, reporting 92.71% accuracy and 0.075s MAE on MLADDC T2+T3.

CoarseSoundNet: Building a reliable model for ecological soundscape analysis

cs.SD · 2026-05-20 · unverdicted · novelty 4.0 · 2 refs

The paper introduces CoarseSoundNet, a deep learning model for classifying biophony, geophony, and anthropophony in passive acoustic monitoring recordings, reporting performance gains from additional similar data, a silence class, and decision thresholds, plus a case study on acoustic index trends.

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding

cs.CV · 2025-08-28 · unverdicted · novelty 3.0

A literature survey on abstract concept recognition in videos that catalogs prior tasks and datasets while advocating for foundation models and reuse of decades of community experience.

citing papers explorer

Showing 1 of 1 citing paper after filters.

Looking Beyond the Obvious: A Survey on Abstract Concept Recognition for Video Understanding cs.CV · 2025-08-28 · unverdicted · none · ref 154
A literature survey on abstract concept recognition in videos that catalogs prior tasks and datasets while advocating for foundation models and reuse of decades of community experience.

Proceedings of the Annual Conference of the International Speech Communication Association (INTERSPEECH) , pages=

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer