Table of Contents
Statistical NLP: Lecture 6
Corpus-Based Work
Looking at Text I: Low-Level Formatting Issues
Looking at Text II: Tokenization --What is a Word?
Looking at Text III: Tokenization --What is a Word (Cont’d)?
Morphology
Sentences: What is a sentence?”
Marked-Up Data I: Mark-up Schemes
Marked-Up Data II: Grammatical Coding
|