ICAME CORPUS COLLECTION - INFORMATION

 

The Helsinki Corpus of English Texts: Diachronic Part


This corpus was compiled at the University of Helsinki, under the direction of Matti Rissanen. It consists of a selection of texts covering the Old, Middle, and Early Modern English periods, in all 242 text files (total 1.5 million words.) See the description in ICAME Journal 13 (1989), pp. 12-15.


Example: 
<B COPREFCP>                                                                    
<Q O2 XX PREF PRCP>                                                             
<N PREF CP>                                                                     
<A ALFRED>                                                                      
<C O2>                                                                          
<O 850-950>                                                                     
<M 850-950>                                                                     
<K CONTEMP>                                                                     
<D WS>                                                                          
<V PROSE>                                                                       
<T PREFACE/EPIL>                                                                
<G X>                                                                           
<F X>                                                                           
<W X>                                                                           
<X X>                                                                           
<Y X>                                                                           
<H X>                                                                           
<U X>                                                                           
<E X>                                                                           
<J X>                                                                           
<I X>                                                                           
<Z X>                                                                           
<S SAMPLE X>                                                                    
 
                                                 

[^  TEXT:  ALFRED'S PREFACE TO CURA PASTORALIS.                                 
KING ALFRED'S WEST-SAXON VERSION                                                
OF GREGORY'S PASTORAL CARE, PART I.                                             
EARLY ENGLISH TEXT SOCIETY, O.S. 45.                                            
ED. H. SWEET.                                                                   
LONDON, 1958 (1871).                                                            
PP. 3.1 - 9.7  ^]                                                               

                                                                            
[^B9.1.1^]                                                                      
<P 3>                                                                           
<R 1>                                                                           
[}+DEOS BOC SCEAL TO WIOGORA CEASTRE.}]                                         
+alfred kyning hate+d gretan W+arfer+d biscep his wordum                        
luflice & freondlice;                                                           
<R 2>                                                                           
& +de cy+dan hate +d+at me com swi+de oft on gemynd, hwelce                     
wiotan iu w+aron giond Angelcynn, +ag+der ge godcundra                          
hada ge woruldcundra;                                                           
<R 4>

The Helsinki Corpus of English Texts: Diachronic Part, WordCruncher version


This in an indexed version of the Helsinki Corpus. It can only be used with WordCruncher.

Most of the material has been described in greater detail in previous issues of ICAME Journal. Prices and technical specifications are given on separate order forms. Note that tagged versions of the Brown Corpus cannot be obtained through ICAME. The same applies to audio tapes for the London-Lund Corpus, the Lancaster/IBM Spoken English Corpus, and the Polytechnic of Wales Corpus.

There are available printed manuals for the LOB Corpus (the original manual and a supplementary manual for the tagged version). Printed manuals for the Brown Corpus cannot be obtained from Bergen. Some information on the London-Lund Corpus is distributed together with copies of the text and the KWIC concordance for the corpus. Users of the London-Lund material are also recommended to consult J. Svartvik (ed.). The London-Lund Corpus: Description and Research, Lund University Press, 1990.

A manual for the Kolhapur Corpus can be ordered from: S.V. Shastri, Department of English, Shivaji University, Vidyanagar, Kolhapur-416006, India. The price of this manual is US $15 (including airmail charges). Payment should be sent along with the order by cheque or international postal order drawn in favour of The Registrar, Shivaji University, Kolhapur.

 


Conditions on the use of ICAME corpus material

The primary purposes of the International Computer Archive of Modern English (ICAME) are:

  1. collecting and distributing information on (i) English language material available for computer processing; and (ii) linguistic research completed or in progress on this material;
  2. compiling an archive of corpora to be located at the University of Bergen, from where copies of the material can be obtained at cost.

The following conditions govern the use of corpus material distributed through ICAME:

  1. No copies of corpora, or parts of corpora, are to be distributed under any circumstances without the written permission of ICAME.
  2. Print-outs of corpora, or parts thereof, are to be used for bona fide research of a non-profit nature. Holders of copies of corpora may not reproduce any texts, or parts of texts, for any purpose other than scholarly research without getting the written permission of the individual copyright holders, as listed in the manual or record sheet accompanying the corpus in question. (For material where there is no known copyright holder, the person(s) who originally prepared the material in computerized form will be regarded as the copyright holder(s).)
  3. Commercial publishers and other non-academic organizations wishing to make use of part or all of a corpus or a print-out thereof must obtain permission from all the individual copyright holders involved.
  4. The person(s) who originally prepared the material in computerized form must be acknowledged in every subsequent use of it.

Use of ICAME texts within an institution
Though ICAME texts cannot be used and distributed outside the institution making the order, they can be freely used within the institution (department, faculty, university) for the purposes of research and teaching. To prevent any use of the material for commercial and profit-making purposes, it is advisable to limit access to registered computer users within the institution. The way this is done may vary depending upon the institution making the order.