Coswara -- A Database of Breathing, Cough, and Voice Sounds for COVID-19 Diagnosis
read the original abstract
The COVID-19 pandemic presents global challenges transcending boundaries of country, race, religion, and economy. The current gold standard method for COVID-19 detection is the reverse transcription polymerase chain reaction (RT-PCR) testing. However, this method is expensive, time-consuming, and violates social distancing. Also, as the pandemic is expected to stay for a while, there is a need for an alternate diagnosis tool which overcomes these limitations, and is deployable at a large scale. The prominent symptoms of COVID-19 include cough and breathing difficulties. We foresee that respiratory sounds, when analyzed using machine learning techniques, can provide useful insights, enabling the design of a diagnostic tool. Towards this, the paper presents an early effort in creating (and analyzing) a database, called Coswara, of respiratory sounds, namely, cough, breath, and voice. The sound samples are collected via worldwide crowdsourcing using a website application. The curated dataset is released as open access. As the pandemic is evolving, the data collection and analysis is a work in progress. We believe that insights from analysis of Coswara can be effective in enabling sound based technology solutions for point-of-care diagnosis of respiratory infection, and in the near future this can help to diagnose COVID-19.
This paper has not been read by Pith yet.
Forward citations
Cited by 3 Pith papers
-
COVID-19 Diagnosis from Cough Acoustics using ConvNets and Data Augmentation
ConvNet with MFCC features and data augmentation classifies cough sounds for COVID-19 with 87.07 AUC on the DiCOVA 2021 blind test, outperforming the baseline by 23%.
-
Optimising MFCC parameters for the automatic detection of respiratory diseases
Empirical tuning of MFCC parameters (roughly 30 coefficients, shorter hops, dataset-dependent frame lengths) improves SVM accuracy for respiratory disease detection by 14.9-19.6% on COVID-19 and voice-disorder datasets.
-
Developing a Multi-variate Prediction Model For COVID-19 From Crowd-sourced Respiratory Voice Data
HuBERT reaches 86% accuracy and 0.93 AUC detecting COVID-19 from 893 voice samples in the Cambridge COVID-19 Sound database.
discussion (0)
Sign in with ORCID, Apple, or X to comment. Anyone can read and Pith papers without signing in.