Featured Collections
CuratedWhisper ACFT Fine-Tunes
Adapter-based Whisper fine-tunes using ACFT (Adapter-based Cross-lingual Fine-Tuning) for efficient speech recognition improvements.
Whisper Fine-Tunes (Full Model)
Full model Whisper fine-tunes for specialized speech recognition tasks, including domain-specific and multilingual variants.
Key ASR Datasets
Training DataTech Sentences For ASR Training
A dataset of technical sentences designed for training and evaluating speech recognition systems on technical vocabulary and terminology.
Hebrew-English Code-Switching Sentences
Sentences containing mixed Hebrew and English for training STT systems that handle code-switching common among English speakers in Israel.
Speech Models
9 ModelsWhisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper fine-tune for speech recognition
Whisper Hebrish
ModelWhisper fine-tune for speech recognition
ASR Datasets
11 DatasetsSample Voice Context Data
DatasetSample Voice Context Data A small synthetic dataset containing LLM-generated context information simulating a job seeker narrating their ca...
Whisper WPM Test
DatasetWhisper WPM Test Dataset A dataset of audio recordings in various speaking styles and content types, designed for evaluating speech-to-text...
Tech Sentences For ASR Training
DatasetTechVoice Dataset Work in Progress – This dataset is actively being expanded with new recordings. Dataset Statistics Met...
Whisper Fine Tune One Shot Eval
DatasetWhisper Fine-Tuning Evaluation: Local vs Commercial ASR A "back of the envelope" evaluation comparing fine-tuned Whisper models running lo...
English Hebrew Mixed Sentences
DatasetEnglish-Hebrew Mixed Sentences Dataset A dataset of English sentences with Hebrew words and phrases interspersed, designed for speech-to-te...
Multimodal Ai Taxonomy
DatasetMultimodal AI Taxonomy A comprehensive, structured taxonomy for mapping multimodal AI model capabilities across input and output modalities...
Ai Generated Podcast Episodes
DatasetDataset for speech recognition training
Long Prompt Experiment
DatasetI conducted this experiment to investigate the impact of prompt structure and optimization on LLM performance, specifically testing whether quality an...
Voice Note Audio
DatasetVoice Notes Dataset Dataset Description This dataset contains real-world voice recordings with transcripts and comprehensive ann...
STT Voice Notes Evals
DatasetSTT Voice Note Evaluation Author: Daniel RosehillDate Created: August 11, 2025Purpose: Comparative evaluation of Speech-to-Text (STT) servi...
Speech To Text System Prompts 2
DatasetSpeech To Text System Prompt Library This repository provides a collection of system prompts designed to transform and refine text capture...
Voice Demos & Spaces
4 SpacesWhisper Hebrish
SpaceInteractive voice/speech demo
Whisper Fine Tune Eval
SpaceInteractive voice/speech demo
Single Podcast ASR Eval
SpaceInteractive voice/speech demo
Interactive voice/speech demo
More Voice Projects
Explore additional resources and collections on Hugging Face and GitHub