Hi Constantin,
If what you need is a string edit distance (a kind of diff tool),
then a good introduction, theoretical, practical and didactic,
can be found here:
http://www-igm.univ-mlv.fr/~lecroq/seqcomp/index.html,
you will also find a couple of links.
Some (unix-like) c code can be downloaded here:
http://odur.let.rug.nl/~kleiweg/levenshtein/
Such generic metrics are likely to be applied on any material
(tokens, part-of.speech, word-sense).
Regards,
-Patrick
__________________________________
Patrick Ruch
HUG - Medical Informatics Division
CH-1211 Geneva 14
tel.: (+41 22) 372 61 64
fax: (+41 22) 372 48 55
email: Patrick.Ruch@dim.hcuge.ch
This archive was generated by hypermail 2b29 : Mon Nov 13 2000 - 19:43:18 MET