Hugging Face Dataset

English-Hebrew-Mixed-Sentences

English-Hebrew Mixed Sentences Dataset A dataset of English sentences with Hebrew words and phrases interspersed, designed for speech-to-text training and evaluation for English speakers in Israel. Overview This dataset addresses a common challenge for English-speaking immigrants in Israel: standard speech-to-text (STT) systems struggle to accurately transcribe code-switched speech where Hebrew words are mixed into primarily English sentences. Example: "I need to pick up… See the full description on the dataset page: https://huggingface.co/datasets/danielrosehill/English-Hebrew-Mixed-Sentences.

Project Information

Categories

Tags

language:enlanguage:helicense:mitsize_categories:n<1kregion:us
View on Hugging Face Dataset