Corpora: Announcing a large Portuguese corpus

From: Diana Maria de Sousa Marques Pinto dos Santos (Diana.Santos@informatics.sintef.no)
Date: Tue Sep 05 2000 - 12:48:27 MET DST

  • Next message: Charles Meyer: "Corpora: Third North American Symposium on Corpus Linguistics and Language Teaching"

    Dear members of the corpora list,

    We would like to announce the release of CETEMPúblico, a large corpus
    (approx. 180 million words) of Portuguese newspaper language from the
    Portuguese daily newspaper Público, created by our project as another
    initiative to foster R&D in the processing of the Portuguese language.

    Please see the corpus page for further details on distribution and
    availability:
    http://cgi.portugues.mct.pt/cetempublico/

    Diana Santos & Paulo Rocha

    Computational processing of Portuguese
    http://www.portugues.mct.pt/
    SINTEF Telecom and Informatics
    Box 124 Blindern, N-0314 Oslo, Norway
    projecto@informatics.sintef.no



    This archive was generated by hypermail 2b29 : Tue Sep 05 2000 - 12:46:25 MET DST