Mean and Variance (Smadja et al., 1993)
Frequency-based search works well for fixed phrases. However, many collocations consist of two words in more flexible relationships.
The method computes the mean and variance of the offset (signed distance) between the two words in the corpus.
If the offsets are randomly distributed (i.e., no collocation), then the variance/sample deviation will be high.