A few weeks ago, during the discussion on the "legal aspects of compiling corpora", a few
people mentioned the cached pages that are available from Google, which at times reproduce
large portions of copyrighted material. Our question was how this could be legal, and it
now looks like it may not be. The following article gives more details, including
information from Google on how they plan to address the problem:
http://news.com.com/2100-1038_3-1024234.html?tag=fd_lede2_hed
Mark Davies
ISU/BYU
=================================================
Mark Davies
Assoc. Prof., Spanish Linguistics
Illinois State University
http://mdavies.for.ilstu.edu/
(phone) 309-438-7975 / (fax) 309-438-8038
(As of August 1, 2003)
Assoc. Prof., Corpus and Computational Linguistics
Brigham Young University, Provo, UT
** Corpus design and use // Web-database scripting **
** Historical and dialectal syntax // Functional-typological grammar **
=================================================
This archive was generated by hypermail 2b29 : Sat Jul 12 2003 - 23:31:21 MET DST