Corpora: New Corpus from LDC

From: LDC Office (ldc@unagi.cis.upenn.edu)
Date: Thu Jul 06 2000 - 17:23:21 MET DST

  • Next message: Nancy M. Ide: "Corpora: ACL 200 Workshop: WORD SENSES AND MULTI-LINGUALITY"

    The Linguistic Data Consortium is pleased to announce
    the availability of the Korean Newswire Text Corpus.
    The collection contains 143,137 articles collected
    from Korean Press Agency during the period of 2 June
    1994 through 20 March 2000. The articles are encoded
    in the the KSC-5601 Korean character encoding and
    SGML tagging has been added.

    Institutions that have membership in the LDC during
    the 2000 Membership Year will be able to receive this
    corpus free of charge. Nonmembers may purchase this
    collection for $1000.

    If you would like to order a copy of this corpus,
    please email your request to <ldc@ldc.upenn.edu>. If
    you need additional information before placing your
    order, or would like to inquire about membership in
    the LDC, please send email or call (215) 573-1275.



    This archive was generated by hypermail 2b29 : Thu Jul 06 2000 - 17:31:01 MET DST