Package ca.uottawa.balie

Class Summary
AbbreviationLookup Wraps around a hashtable that contains a list of abbreviation.
Balie This is the main entry point for training Balie.
Canonizer Offers static function to convert a word into its canon form.
CharacterNGram Methods to collect and handle character n-gram.
DebugInfo Class that group routines for debugging.
FileHandler Simple file loader
HashCodeUtil Collected methods which allow easy implementation of hashCode.
LanguageIdentification Methods for training, testing and using language identification.
LanguageIdentificationGuess A guess is compose of a language (e.g.: French) along with its probability (ex.: 0.9).
LanguageSpecific Squeleton of language specific routines.
LanguageSpecificEnglish Routines specific to English language.
LanguageSpecificFrench Routines specific to French language.
LanguageSpecificGerman Routines specific to German language.
LanguageSpecificRomanian Routines specific to Romanian language.
LanguageSpecificSpanish Routines specific to Spanish language.
LexiconOnDisk  
LigatureLookup Wraps around a hashtable that contains a list of ligatures.
NamedEntityAlias  
NamedEntityRecognition Named Entity Recognition (NER) NER operates on the Balie TokenList.
POSLookup Lookup for part-of-speeches.
PunctLookup Wraps around a hashtable that contains a list of punctuation.
SBRCorpusHandler SAX parser for the Sentence Boundary Recognition corpus.
SentenceBoundariesRecognition Methods for training, testing and using sentence boundary recognition.
Token Tokens are the unit element of Balie.
TokenConsts Static class containing enumeration of token type value.
Tokenizer The tokenizer takes a text in input and extract a tokenlist.
TokenList List of Tokens to represent a text.
TokenListIterator  
UnbreakableLookup Wraps around a hashtable that contains a list of Unbreakable tokens.
WekaAttribute Wrapper around the Weka attributes.
WekaAttributeSelection Methods to select the top attributes from a given classification problem.
WekaLearner Methods to create, train and test a classification algorithm.
WekaPersistance Service class to load and save Weka models.
XmlUtil Utility class for reading and writing XML files