audio speech-transcription datasets

Speech Transcription

Speech transcription is an audio task that converts speech into text. This is an important task in audio that is used in many applications, such as speech recognition, voice assistants, and audio event detection.

LJ-Speech-Dataset

This is a public domain speech dataset consisting of 13,100 short audio clips of a single speaker reading passages from 7 non-fiction books. A transcription is provided for each clip. Clips vary in length from 1 to 10 seconds and have a total length of approximately 24 hours.

Audio Speech Transcription

3.8 gb

13K53

Updated: 3 years ago