> Judie Chien wrote:
>
> Hi,Corpus Liguistics:
>
> Would any one of you know where I can get a standard English
> frequencly list (either of a corpus or form any language institute)?
> Criteria of the frequency list:
> 1. A list in which a number of lemmata has been included.
> 2. Inflection forms of a lemma should be treated as the same word
> derived form the list.(walk includes walks, walking, walked)
> 3. Word with different derivational forms should be put in different
> places in the list. (ex: affirm, affirmation, affirmative appear
> with different number in frequency list).
> It would do our research group a great help if you can offer some
> related information.
Dear Judie,
I have compiled a number of wordlists based on an original one-million
words Business Letter Corpus (BLC) as part of my MA Thesis, "A Corpus-
based Study of Lexical and Grammatical Features of Written Business
English."
Although they not exactly what you are looking for, you may still find
them interesting. At the moment I have the following wordlists uploaded
onto my Web page at the following URL:
http://www2.gol.com/users/ysomeya/
General BLC Wordlists
D1 COMPREHENSIVE BLC WORDLIST (Lemmatized List)
D2 ALPHABETICAL INDEX OF THE COMPREHENSIVE BLC WORDLIST
D3 BLC KEYWORDS LIST
Categorical BLC Wordlists
E1 BLC VERB LIST 1 (Lemmatized Usage Rank List)
E2 BLC VERB LIST 2 (Graphic-word based Frequency Comparison Table)
E4 BLC ADVERB LIST 1 (Usage Rank List)
E5 BLC ADVERB LIST 1 (Frequency Comparison Table)
E6 BLC ADJECTIVE LIST 1 (Usage Rank List)
E7 BLC ADJECTIVE LIST 2 (Frequency Comparison Table)
E8 BLC NOUN LIST 1 (Lemmatized Frequency Comparison Table)
All the technical (computational and statistical) details of the
wordlists are described in my MA Thesis, which can also be downloaded
from the same Web site.
Regards,
Yasumasa Someya <ysomeya@gol.com>
Graduate Department of Language and Information Sciences
University of Tokyo
This archive was generated by hypermail 2b29 : Sun Feb 27 2000 - 20:11:22 MET