Hugging Face Dataset

Voice Note Audio

Voice Notes A dataset of voice notes collected by Daniel Rosehill in and around Jerusalem (mostly) in a variety of acoustic environments and in a variety of formats reflecting typical daily use with speech to text transcription apps. This dataset is a subsection of a voice note training dataset that I'm curating for STT fine-tuning and entity recognition. Annotation The dataset includes rich annotations collected using Label Studio: Corrected transcripts (manually… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/Voice-Note-Audio.

Project Information

Categories

Tags

task_categories:automatic-speech-recognitionlanguage:enlicense:mitsize_categories:n<1Kformat:audiofoldermodality:audiomodality:textlibrary:datasetslibrary:mlcroissantdoi:10.57967/hf/6316region:usspeech-to-textnoise-robustnessevaluationwhisper
View on Hugging Face Dataset