Andrew McCallum's Bag Of Words library has several components that sound
like they would be of use to you.
http://www.cs.cmu.edu/~mccallum/bow <http://www.cs.cmu.edu/~mccallum/bow>
Regards
Paul.
---
Paul Holmes-Higgin
Documentum UK
Tel: +44 (0)20 8867 3179
Email: paulhh@documentum.com
-----Original Message-----
From: Patrick Ruch [mailto:ruch@dim.hcuge.ch]
Sent: Monday, October 16, 2000 3:32 PM
To: ln@cines.fr; Elsnet; IRList; webir@egroups.com; CORPORA@HD.UIB.NO
Subject: Corpora: IR info and tools
Hi,
I am looking for a toolset -free or commercial- for calculating
vector distances (cosinus, euclid...). The target is an NLP-based
IR engine, and it must be efficient. Related stategies for choosing
the indexing terms are welcome.
Thanks in advance,
Patrick Ruch
__________________________________
Patrick Ruch
HUG - Medical Informatics Division
CH-1211 Geneva 14
tel.: (+41 22) 372 61 64
fax: (+41 22) 372 48 55
email: Patrick.Ruch@dim.hcuge.ch <mailto:Patrick.Ruch@dim.hcuge.ch>
This archive was generated by hypermail 2b29 : Mon Oct 16 2000 - 17:05:14 MET DST