Several versions of the TIMIT corpus are available from
the Linguistic Data Consortium (LDC); a pointer to the catalog
entry for the original and basic one is
http://www.ldc.upenn.edu/ldc/catalog/html/speech_html/arpa.html
Their latest price is $100 for nonmembers.
It's also known as NIST Speech Disc CD1-1.1, and it's also
available from the National Technical Information Service (NTIS).
A set of hardcopy documentation for TIMIT is also available
from NTIS, as NTIS# PB91-100354. I don't know any more about
how to get it from them or what they charge, but their URL is
I don't know of any site that's made the text available;
it's a part of what we sell via LDC and NTIS. I'd be surprised
if there are hundereds of spelling errors, since it's been
scrubbed for years now.
- Bill Fisher / NIST