Re: Corpora: Knowledge about TreeTagger and MXPOST???

From: Steven Bird (sb@unagi.cis.upenn.edu)
Date: Thu Sep 14 2000 - 01:54:56 MET DST

  • Next message: hijean kim: "Re: Corpora: early modern English"

    Rachel Aires writes:
    > I want to know wich is the key to understand the information
    > that is in the file created by TreeTagger and wich is the
    > relation among the 10 files created by MXPOST.
    > Someones has ever asked himself that?

    This situation is an instance of a more general problem with the babel
    of formats, which motivated a forthcoming special issue of Speech
    Communication:

      Speech Annotation and Corpus Tools
      http://www.ldc.upenn.edu/annotation/specom.html
      (Steven Bird & Jonathan Harrington, eds)

    Various projects are addressing this need for standard formats, and
    a good starting point is the Linguistic Annotation page.

      Linguistic Annotation
      http://www.ldc.upenn.edu/annotation/
      (Steven Bird & Mark Liberman, eds)

    Updates welcome...

    Steven Bird

    --
    Steven.Bird@ldc.upenn.edu  http://www.ldc.upenn.edu/sb
    Assoc Director, LDC; Adj Assoc Prof, CIS & Linguistics
    Linguistic Data Consortium, University of Pennsylvania
    3615 Market St, Suite 200, Philadelphia, PA 19104-2608
    



    This archive was generated by hypermail 2b29 : Thu Sep 14 2000 - 12:28:11 MET DST