A wide range of speech databases have been collected. These
databases are primarily for the development of speech
synthesis/recognition and for linguistic research.
Some databases are free but most are not. The databases normally
require lots of storage space (100's of MBytes is not unusual). Do not
expect to be able to ftp large amounts of speech data.
In addition to the descriptions of speech databases and speech
database providers below, information can be obtained from
- Provides a very wide range of speech and text data to research and
commercial users: see below.
COCOSDA Home Page:
The International Committee for the Co-ordination and
Standardisation of Speech Databases and Assesment Techniques for Speech
Shikano's WWW site on Speech and Acoustics
- European resource initiative:
The following speech data resources are described in the FAQ.
Bavarian Archive for Speech Signals
BUPT Spoken Digit Database (Chinese)
Center for Spoken Language Understanding (CSLU)
Examples of IPA Symbols
Linguistic Data Consortium (LDC)
Oxford Acoustic Phonetic Database
University of Victoria Phonetic Database
Section 1 of the
comp.speech FAQ Home Page.
Submit Information :
Last Revision: 16:48 14-May-1997