Pearson’s Chi-Square test II: Applications
One of the early uses of the Chi square test in Statistical NLP was the identification of translation pairs in aligned corpora (Church & Gale, 1991).
A more recent application is to use Chi square as a metric for corpus similarity (Kilgariff and Rose, 1998)
Nevertheless, the Chi-Square test should not be used in small corpora.