|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||
See:
Description
| Interface Summary | |
|---|---|
| DocIdentifiable | Created this class on 10/3/05 because SVMTestable and SVMTrainable need to be able to uniquely identify their documents. |
| Class Summary | |
|---|---|
| BoostChecker | Check if a word should be boosted for this category |
| EscapeChars | Taken from: http://www.javapractices.com/Topic96.cjp Convenience methods for altering special characters related to URLs, regular expressions, and HTML tags. |
| FileListReader | Opens a file describing wormbase papers |
| KnownDoc | Repsents documents that are from the training set |
| MembershipMap | |
| Pair<K extends Comparable,V> | Used multiple times when sorting elements by a specific property/function of those elements |
| PairSorter | Provides some methods that utilize Pair to quickly return sorted values |
| StopWordsChecker | Loads files as needed and provides methods
to check if words are verbs, stopwords, useless, or names Stopwords.txt contains stopwords, these are one word per line and are not stemmed when checking for stopwords. |
| TermCounter | Counts the number of times a Phrase has been used in a document. |
| TestDoc | Represents documents that we wish to classify or cluster. |
| TitleFetcher | |
| VectorManager | Class that keeps track of how often a word occurs in a document or globally. |
| WindowedDoc | Implements methods needed by a Textpresso document to prepare it for clustering. |
| XMLDoc | Reads the data from an XML file or from the flatfile if available. |
Lots of classes that are used throughout the engine, and could possibly be used in another project.
|
||||||||||
| PREV PACKAGE NEXT PACKAGE | FRAMES NO FRAMES | |||||||||