Speech Transcription Datasets
Speech transcription is an audio task that converts speech into text. This is an important task in audio that is used in many applications, such as speech recognition, voice assistants, and audio event detection.
Displaying Page 1 of 1 (1 total Repositories)
This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours.
3.8 gb
13K53