The ami meeting corpus

· 2005

3 Pith papers cite this work. Polarity classification is still indexing.

3 Pith papers citing it

browse 3 citing papers

citation-role summary

method 1

citation-polarity summary

use method 1

representative citing papers

FastTurn: Unifying Acoustic and Streaming Semantic Cues for Low-Latency and Robust Turn Detection

cs.SD · 2026-04-02 · unverdicted · novelty 6.0

FastTurn unifies acoustic features and streaming CTC decoding for low-latency, robust turn detection in full-duplex dialogue systems and releases a realistic human-dialogue test set.

A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS)

eess.AS · 2026-04-30 · unverdicted · novelty 4.0

Detecting manners of articulation and adding them as knowledge features improves target speech extraction in cinematic audio with background sounds.

Exploring Speech Foundation Models for Speaker Diarization Across Lifespan

eess.AS · 2026-04-06 · unverdicted · novelty 4.0 · 2 refs

Cross-lifespan evaluation shows adult-trained speech foundation models degrade on child and older-adult data, with joint multi-age training and targeted adaptation improving robustness especially using Whisper encoder.

citing papers explorer

Showing 3 of 3 citing papers.

FastTurn: Unifying Acoustic and Streaming Semantic Cues for Low-Latency and Robust Turn Detection cs.SD · 2026-04-02 · unverdicted · none · ref 19
FastTurn unifies acoustic features and streaming CTC decoding for low-latency, robust turn detection in full-duplex dialogue systems and releases a realistic human-dialogue test set.
A Knowledge-Driven Approach to Target Speech Extraction in the Presence of Background Sound Effects for Cinematic Audio Source Separation (CASS) eess.AS · 2026-04-30 · unverdicted · none · ref 16
Detecting manners of articulation and adding them as knowledge features improves target speech extraction in cinematic audio with background sounds.
Exploring Speech Foundation Models for Speaker Diarization Across Lifespan eess.AS · 2026-04-06 · unverdicted · none · ref 30 · 2 links
Cross-lifespan evaluation shows adult-trained speech foundation models degrade on child and older-adult data, with joint multi-age training and targeted adaptation improving robustness especially using Whisper encoder.

The ami meeting corpus

citation-role summary

citation-polarity summary

fields

years

verdicts

roles

polarities

representative citing papers

citing papers explorer