arXiv preprint arXiv:2305.14032 , year=

Patch-mix contrastive learning with audio spectrogram transformer on respiratory sound classification , author= · 2023 · arXiv 2305.14032

2 Pith papers cite this work. Polarity classification is still indexing.

2 Pith papers citing it

read on arXiv browse 2 citing papers

representative citing papers

RespiraMFM: A Multimodal Foundation Model with Contrastive Audio-Language Alignment for Respiratory Disease Identification

cs.SD · 2026-06-08 · unverdicted · novelty 5.0

RespiraMFM reports 9.15% AUROC gain in supervised fine-tuning and 20.98% in zero-shot settings over baselines by aligning respiratory audio with clinical text across seven real-world datasets for five diseases.

C2GA: A Class-Controllable Generative Augmentation Framework for Respiratory Sound Classification

cs.SD · 2026-06-01 · unverdicted · novelty 4.0

C2GA uses conditional VQ-VAE with decoupled local tokens and global class prototypes plus a Transformer prior to generate high-fidelity label-consistent Mel-spectrograms for respiratory sound data augmentation.

citing papers explorer

Showing 2 of 2 citing papers.

RespiraMFM: A Multimodal Foundation Model with Contrastive Audio-Language Alignment for Respiratory Disease Identification cs.SD · 2026-06-08 · unverdicted · none · ref 18
RespiraMFM reports 9.15% AUROC gain in supervised fine-tuning and 20.98% in zero-shot settings over baselines by aligning respiratory audio with clinical text across seven real-world datasets for five diseases.
C2GA: A Class-Controllable Generative Augmentation Framework for Respiratory Sound Classification cs.SD · 2026-06-01 · unverdicted · none · ref 30
C2GA uses conditional VQ-VAE with decoupled local tokens and global class prototypes plus a Transformer prior to generate high-fidelity label-consistent Mel-spectrograms for respiratory sound data augmentation.

arXiv preprint arXiv:2305.14032 , year=

fields

years

verdicts

representative citing papers

citing papers explorer