Misquotations



next up previous
Next: Mispronunciations Up: Orthography Previous: Hyphenated words

Misquotations

Where an utterance differs from its prompt text yet remains fluent and linguistically valid, the DOT transcription represents the actual spoken utterance. This utterance text may consist of deletions, transpositions, substitutions and (theoretically unlikely) insertions of words from the original prompt text. This approach increases the quantity of testing and training data at the expense of losing consistency with the newspaper text.




Tue Jan 17 18:52:43 GMT 1995