Corpora: Freely accessible Corpus of American English

From: Nancy M. Ide (ide@cs.vassar.edu)
Date: Tue Nov 14 2000 - 18:15:32 MET

  • Next message: Hristo Tanev: "(no subject)"

    Ruth Möhlig writes:
    > I have two questions:
    >
    > 1) is there any freely accessible, fairly recent corpus of American English
    > (preferrably both written and spoken and preferrably searchable with
    > Wordsmith)?
    >

    An American National Corpus is under development, which will suit your
    description (see http://www.cs.vassar.edu/~ide/anc)--but it will not
    be available in even a primitive (ie.e., unannotated or validated)
    form until some time next year.

    Nancy Ide

    =======================================================

    Nancy Ide

    Professor and Chair
    Department of Computer Science, Vassar College
    Poughkeepsie, NY 12604-0520 USA
    Tel: +1 845 437-5988 Fax: +1 845 437-7498
    ide@cs.vassar.edu

    Chercheur Invite
    Equipe Langue et Dialogue, LORIA/CNRS
    Campus Scientifique - BP 239
    54506 Vandoeuvre-les-Nancy FRANCE
    Tel: +33 (0)3 83 59 20 47 Fax: +33 (0)3 83 41 30 79
    ide@loria.fr

    =======================================================



    This archive was generated by hypermail 2b29 : Tue Nov 14 2000 - 18:12:48 MET