Bavarian Archive for Speech Signals
- Description: The Bavarian Archive for Speech Signals
(BAS) was founded in January 1995 as an initiative of the Institute of
Phonetics at the University of Munich, Germany. The BAS will develop,
validate, administrate and disseminate corpora of spoken German to the
speech community as well as to speech engineering industry. Presently
the following German speech corpora are available on ISO 9660 CDROM:
- Siemens 1000 - SI1000
- 5 CDROMs, newspaper corpus, read speech, 10 speakers x 1000
utterances
- Siemens 100 - SI100
- 7 CDROMs, read speech, 101 speakers x 100 sentences
- PhonDat 1 - PD1
- 6 CDROMs, new edition in preparation, read speech, 201
speakers x 450+ sentences
- PhonDat 2 - PD2
- 1 CDROM, read speech, 2nd edition, 16 speakers x 200
sentences, various labelled information
- Verbmobil
- Spontaneous speech recorded in a dialog task (appointment
scheduling). More information on the VERBMOBIL project:
http://www.dfki.uni-sb.de/verbmobil/
Corpora in Preparation
- PhonDat I - PD1: 2nd extended edition (Jul 1995)
- Strange Corpora - SC
- Reference Corpora that reflect certain well known problems in
speech processing, like accents, repair, breaks, hesitations,
repetitions, extreme F0, backround noise, pathological speech, speaker
adaptation. The first SC corpus (SC1 Accents) will be edited in Jul
1995.
- BAS Edition of Verbmobil Corpora - VM: 2nd extended edition
- Articulatory data - AD: EMA data of speakers of SI1000 corpus
- ERBA: 10000 utterances from a train inquiry task
- Misc: BAS is currently developing tools for the
automatic annotation and segmentation of very large speech corpora. This
includes the automatic detection of variants of pronunciation, a
statistical based alignment and a rule-based refinement of the outcome.
The BAS seeks to cooperate with public institutions as well as with
industrial partners to further develop new German speech databases. BAS
can be a platform to re-distribute existing German speech.
- Contact and More Information: The BAS is located at the
University of Munich, Germany.
BAS c/o Institut fuer Phonetik
Schellingstr. 3/II
80799 Muenchen, Germany
Ph: +49-89-21802758, Fax: +49-89-2800362
Email: bas@sun1.phonetik.uni-muenchen.de
WWW: http://www.phonetik.uni-muenchen.de/BASSeng.html
Back to
Q1.7 of
Section 1 of the
comp.speech FAQ Home Page.
Administrivia,
Copyright,
Submit Information :
Last Revision: 03:00 01-Apr-1996