CleanCodec reframes audio tokenization as a selective information bottleneck to encode only perceptually important features at 12.5 tokens per second, outperforming prior codecs in efficiency, speaker similarity, and intelligibility.
Pyroomacoustics: A python package for audio room simulation and array processing algorithms,
2 Pith papers cite this work. Polarity classification is still indexing.
2
Pith papers citing it
years
2026 2verdicts
UNVERDICTED 2representative citing papers
Permutation-equivariant training via matched random channel shuffling improves SDR and reduces microphone bleed in multi-channel music source separation under unseen conditions.
citing papers explorer
-
CleanCodec: Efficient and Robust Speech Tokenization via Perceptually Guided Encoding
CleanCodec reframes audio tokenization as a selective information bottleneck to encode only perceptually important features at 12.5 tokens per second, outperforming prior codecs in efficiency, speaker similarity, and intelligibility.
-
Learning Input-Channel Permutation Equivariance for Multi-Channel Source Separation: Reducing Bleeding in Small Music Ensembles
Permutation-equivariant training via matched random channel shuffling improves SDR and reduces microphone bleed in multi-channel music source separation under unseen conditions.