Actually I think there is quite a lot of work on this kind of thing. I
think
that "Estimation-Maximization" or "Baum-Walsh Renormalization" of
"Hidden
Markov Models" of symbol strings are all based on these kinds of ideas.
According to my understanding these are all ways of identifying the best
set of
labels for relationships between symbols in a string. Someone who is
more familiar with those techniques might like to comment on that,
though.
Rob Freeman
rjfreeman@usa.net
Meunier Fanny wrote:
> Dear all,
>
> I was wondering whether (or not) studies have been published on the
> comparison of the success rates of POS taggers with a restricted tagset vs
> POS taggers with a refined tagset. Any interesting references would be most
> welcome!
> Thank you very much,