Offset Alignment by Signal Processing Techniques III: Fung & McKeown, 1994
Fung and McKeown’s algorithm works:
- Without having found sentence boudaries.
- In only roughly parallel text (with certain sections missing in one language)
- With unrelated language pairs.
The technique is to infer a small bilingual dictionary that will give points of alignment.
For each word, a signal is produced, as an arrival vector of integer numbers giving the numver of words between each occurrence of the word at hand.