Title: Indexed Fast Sequential Searches
Abstract:
A data structure and algorithm for organizing data into easy to access and identify sequences of tokens will be presented. The resulting method is used for indexing plain text for rapid searches of sequential tokens and clustering of common patterns. The work stems from research into how best to organize data for unsupervised grammar learning from plain text.