The transcriptions for all utterances in a session are concatenated into a single file of the form, <SSS><T><EE>00.dot and include the utterance-ID codes. The format for a single utterance transcription entry in this table is as follows: <TRANSCRIPTION-TEXT> (<UTTERANCE-ID>) <NEW-LINE>
An example sentence illustrating this format is given below:
The December contract rose one point oh seven cents a pound to sixty eight point six two cents at the Chicago Mercantile Exchange (c13c020l)
There is one .dot file for each speaker-session. It should be noted that the conventions used during transcription were slightly different from those used in WSJ0 and can be found in the section on `WSJCAM0 Detailed Orthographic Transcription (.dot) Specification'.