figshare
Browse

Multimodal Speech Dataset Capturing Air and Bone Conduction

Download (6.86 GB)
dataset
posted on 2025-04-08, 02:38 authored by Ammar AmjadAmmar Amjad

The corpus comprises 47,182 synchronized air-conducted (AC) and bone-conducted (BC) utterances collected from 100 speakers, totaling approximately 42 hours of speech. Each speaker contributed around 25 minutes of recorded material, with individual utterances ranging in duration from 1 to 5 seconds.

Data collection was conducted in an anechoic chamber compliant with ISO 3745 standards, ensuring a controlled acoustic environment. The chamber measures 11.8 meters in length, 4.2 meters in width, and 3.8 meters in height. During the recording sessions, speakers wore a headset and read from prepared text transcriptions.

History

Usage metrics

    Licence

    Exports

    RefWorks
    BibTeX
    Ref. manager
    Endnote
    DataCite
    NLM
    DC