Small-footprint keyword spotting using deep neural networks

Guoguo Chen, Carolina Parada, Georg Heigold · 2014 · DOI 10.1109/icassp

8 Pith papers cite this work. Polarity classification is still indexing.

8 Pith papers citing it

open at publisher browse 8 citing papers

citation-role summary

background 1

citation-polarity summary

background 1

representative citing papers

PolySpeech-100: A Large-Scale Benchmark for Speech Understanding Across 100+ Languages and Dialects

cs.CL · 2026-05-31 · unverdicted · novelty 7.0

PolySpeech-100 is a new benchmark for native-level speech comprehension across 110 linguistic variants that evaluates 22 models and reports E2E advantages on dialects, robustness gaps on low-resource languages, and degradation from Chain-of-Thought prompting.

Cross-Modal Knowledge Distillation without Paired Data: Theoretical Foundation and Algorithm

cs.AI · 2026-06-09 · unverdicted · novelty 6.0

A distribution-alignment framework for unpaired cross-modal knowledge distillation with theoretical guarantees on feature and label alignment.

Boosting Multimodal Federated Learning via Chained Modality Optimization

cs.DC · 2026-06-01 · unverdicted · novelty 6.0

FedMChain improves multimodal federated learning by chaining modality-wise optimization phases with error-compensated regularization and sparse sign-guided aggregation to mitigate modality competition and cut communication overhead.

APEX: Audio Prototype EXplanations for Classification Tasks

cs.SD · 2026-05-11 · unverdicted · novelty 6.0

APEX generates four types of prototype-based explanations for pre-trained audio classifiers that preserve output invariance and target acoustic properties better than gradient methods applied to spectrograms.

Interpreting Multi-Branch Anti-Spoofing Architectures: Correlating Internal Strategy with Empirical Performance

cs.SD · 2026-02-14 · unverdicted · novelty 6.0

A framework using covariance-based spectral signatures and TreeSHAP attributions on AASIST3 branches identifies four operational archetypes and a flawed specialization mode that explains high error rates on specific spoofing attacks.

GS-NFS: Bandwidth-adaptive Streaming of Dynamic Gaussian Splats and Point Clouds

cs.MM · 2026-06-04 · unverdicted · novelty 5.0

GS-NFS accelerates dynamic 3DGS encoding and decoding by 1-2 orders of magnitude on GPU while maintaining competitive compression ratios and rendering quality.

Hardware-Software Co-Design of Scalable, Energy-Efficient Analog Recurrent Computations

cs.AR · 2026-05-12 · unverdicted · novelty 5.0 · 2 refs

BMRUs enable analog recurrent neural network hardware via discrete outputs that suppress noise 20-fold, with one-to-one parameter-to-circuit mapping and linear power scaling for recurrence.

Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech

cs.CL · 2025-07-17

citing papers explorer

Showing 1 of 1 citing paper after filters.

Balalaika: Data-Centric, Prosody-Aware Annotation Pipeline for Russian Speech cs.CL · 2025-07-17 · unreviewed · ref 17

Small-footprint keyword spotting using deep neural networks

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer