One sense per discourse, one sense per collocation
(Yarowsky, 1995)’s Idea: there are constraints between different occurrences of an ambiguous word within a corpus that can be exploited for disambiguation:
- One sense per discourse: The sense of a target word is highly consistent within any given document.
- One sense per collocation: nearby words provide strong and consistent clues to the sense of a target word, conditional on relative distance, order and syntactic relationship.