Dear members of the corpora list,
We would like to announce the release of CETEMPúblico, a large corpus
(approx. 180 million words) of Portuguese newspaper language from the
Portuguese daily newspaper Público, created by our project as another
initiative to foster R&D in the processing of the Portuguese language.
Please see the corpus page for further details on distribution and
availability:
http://cgi.portugues.mct.pt/cetempublico/
Diana Santos & Paulo Rocha
Computational processing of Portuguese
http://www.portugues.mct.pt/
SINTEF Telecom and Informatics
Box 124 Blindern, N-0314 Oslo, Norway
projecto@informatics.sintef.no
This archive was generated by hypermail 2b29 : Tue Sep 05 2000 - 12:46:25 MET DST