Thesaurus-Based Disambiguation
Idea: the semantic categories of the words in a context determine the semantic category of the context as a whole. This category, in turn, determines which word senses are used.
(Walker, 87): each word is assigned one or more subject codes which corresponds to its different meanings. For each subject code, we count the number of words (from the context) having the same subject code. We select the subject code corresponding to the highest count.
(Yarowski, 92): adapted the algorithm for words that do not occur in the thesaurus but that are very . Informative. E.g., Navratilova --> Sports