2.1.4 Speech Files:
NIST's SPHERE Format Audio data distributed by the Linguistic Data Consortium (LDC) is typicaly distributed on CDROM using the NIST SPHERE format. A SPHERE file consists of a single fixed size header followed by binary audio data. The header is organized as name value pairs in a 1024-byte blocked, ASCII structure placed at the beginning of the file. The binary data can be in either big-endian or little-endian byte order. Click here for a more detailed description of the SPHERE format. An example of a SPHERE header is shown below. Click here or use the "Save Link As" feature in your browser to download this example.
|