Corpora: Announcing a large Portuguese corpus

From: Diana Maria de Sousa Marques Pinto dos Santos (Diana.Santos@informatics.sintef.no)
Date: Tue Sep 05 2000 - 12:48:27 MET DST

Next message: Charles Meyer: "Corpora: Third North American Symposium on Corpus Linguistics and Language Teaching"

Previous message: Fernando Martínez Santiago: "Corpora: spanish / english comparable corpora"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

Dear members of the corpora list,

We would like to announce the release of CETEMPúblico, a large corpus
(approx. 180 million words) of Portuguese newspaper language from the
Portuguese daily newspaper Público, created by our project as another
initiative to foster R&D in the processing of the Portuguese language.

Please see the corpus page for further details on distribution and
availability:
http://cgi.portugues.mct.pt/cetempublico/

Diana Santos & Paulo Rocha

Computational processing of Portuguese
http://www.portugues.mct.pt/
SINTEF Telecom and Informatics
Box 124 Blindern, N-0314 Oslo, Norway
projecto@informatics.sintef.no

Next message: Charles Meyer: "Corpora: Third North American Symposium on Corpus Linguistics and Language Teaching"
Previous message: Fernando Martínez Santiago: "Corpora: spanish / english comparable corpora"
Messages sorted by: [ date ] [ thread ] [ subject ] [ author ]

This archive was generated by hypermail 2b29 : Tue Sep 05 2000 - 12:46:25 MET DST