Audio Multimodal AI Resources

A compilation of resources (model profiles, benchmarks, docs) for multimodal AI models with audio understanding (esp. focused on ASR and transcription use-cases)

View on GitHub

Project Details

Tags

asraudio-multimodalaudio-text-to-textaudio-understandingmultimodal-aistt

Explore More Projects