Supervised Disambiguation:An Information-Theoretic Approach
(Brown et al., 1991)’s Idea: to find a single contextual feature that reliably indicates which sense of the ambiguous word is being used.
The Flip-Flop algorithm is used to disambiguate between the different senses of a word using the mutual information as a measure.
I(X;Y)=?x?X?y?Yp(x,y) log p(x,y)/(p(x)p(y))
The algorithm works by searching for a partition of senses that maximizes the mutual information. The algorithm stops when the increase becomes insignificant.