It should be similar to the MUC-4 Latin American Terrorist Event
corpus. The problem with MUC-4 is that (from what I have seen) the texts
were all in upper case. I need a corpus with capital letters only given
in the correct places - beginning of sentences, proper nouns, etc.
Thanks in advance for any replies. I will send a summary around if I
receive a significant number of responses.
Rob Collier
NLP group
Department of Computer Science
University of Sheffield
England
http://www.dcs.shef.ac.uk/~robin/index.html