| COUGHVID | Cough audio | English | COVID-19 detection | 20,000 | 2020 | CC-BY 4.0 |
| Coswara | Cough, breath, speech | English | COVID-19 detection | 5,000 | 2022 | CC-BY 4.0 |
| UK COVID-19 Vocal Audio Dataset | Cough, breath, speech | English | COVID-19 detection | 70,000 | 2023 | OGL v3.0 |
| Respiratory Sound Database | Lung auscultation sounds | English | Respiratory disease classification | 920 | 2017 | CC-BY 4.0 |
| smarty4covid | Cough, breath, voice | English | COVID-19 detection | 4,600 | 2023 | CC-BY 4.0 |
| Bridge2AI-Voice | Voice recordings | English | Voice biomarker research | Not specified | 2025 | Apache-2.0 |
| VOICED | Voice recordings | English | Pathological voice analysis | 208 | 2018 | ODC-BY 1.0 |
| Perceptual Voice Qualities Dataset | Voice recordings | English | Perpetual voice quality | 360+ | 2020 | CC-BY 4.0 |
| COVID-19 Voice Dataset | Voice recordings | English | COVID-19 detection | Not specified | 2023 | CC-BY 4.0 |
| ALS IAC Speech Corpus | Speech | English | ALS | Not specified | 2024 | CC-BY 4.0 |
| PMC COVID-19 Voice Dataset | Voice recordings | English | COVID-19 detection | Not specified | 2022 | OGL v3.0 |