Supervised Disambiguation
Training set: exemplars where each occurrence of the ambiguous word w is annotated with a semantic label ==> Classification problem.
Approaches:
- Bayesian Classification: the context of occurrence is treated as a bag of words without structure, but it integrates information from many words.
- Information Theory: only looks at informative features in the context. These features may be sensitive to text structure.
- There are many more approaches (see Chapter 16 or the Machine Learning course).