Package Util

Lots of classes that are used throughout the engine, and could possibly be used in another project.

See:
          Description

Interface Summary
DocIdentifiable Created this class on 10/3/05 because SVMTestable and SVMTrainable need to be able to uniquely identify their documents.
 

Class Summary
BoostChecker Check if a word should be boosted for this category
EscapeChars Taken from: http://www.javapractices.com/Topic96.cjp
Convenience methods for altering special characters related to URLs, regular expressions, and HTML tags.
FileListReader Opens a file describing wormbase papers
KnownDoc Repsents documents that are from the training set
MembershipMap  
Pair<K extends Comparable,V> Used multiple times when sorting elements by a specific property/function of those elements
PairSorter Provides some methods that utilize Pair to quickly return sorted values
StopWordsChecker Loads files as needed and provides methods to check if words are verbs, stopwords, useless, or names
Stopwords.txt contains stopwords, these are one word per line and are not stemmed when checking for stopwords.
TermCounter Counts the number of times a Phrase has been used in a document.
TestDoc Represents documents that we wish to classify or cluster.
TitleFetcher  
VectorManager Class that keeps track of how often a word occurs in a document or globally.
WindowedDoc Implements methods needed by a Textpresso document to prepare it for clustering.
XMLDoc Reads the data from an XML file or from the flatfile if available.
 

Package Util Description

Lots of classes that are used throughout the engine, and could possibly be used in another project.