|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
public interface ClusterDoc
Represents a document that I can conduct unsupervised clustering on.
In fact, pretty much all documents can undergo this process if they
implement these methods.
These documents also need to be able to compare themselves (for sorting purposes)
| Method Summary | |
|---|---|
void |
addTermSetCount(Phrase termSet,
int n)
This document should record how frequently this termSet occured |
void |
checkSourceExists()
Throw an exception if this file won't be cluster-able |
void |
destroyLocalDoc()
After finding how often all the phrases are in this doc, this method should allow the supporting document to be released to free up memory. |
java.lang.String[][] |
getFixedWordSentences()
Each String should be fixed by VectorManager before being returned |
int[][] |
getIdxSentences(VectorManager vm)
Each entry represents the integer. |
int |
getNumInstancesOfTermSet(Phrase s)
Each document should get a unique id |
java.lang.String[][] |
getSentences()
|
double |
getTermSetsSupported()
|
boolean |
isJunkPhrase(java.lang.String w)
Added to allow differentiation between phrases of scientific articles and general search results |
void |
loadWindowedDoc()
Initially, the idea was to support proximity windows (eg. |
| Methods inherited from interface java.lang.Comparable |
|---|
compareTo |
| Method Detail |
|---|
double getTermSetsSupported()
int getNumInstancesOfTermSet(Phrase s)
void addTermSetCount(Phrase termSet,
int n)
termSet - n - boolean isJunkPhrase(java.lang.String w)
w - Space-separated words
void checkSourceExists()
throws java.io.FileNotFoundException
java.io.FileNotFoundException
java.lang.String[][] getFixedWordSentences()
throws java.io.FileNotFoundException,
java.io.IOException
java.io.FileNotFoundException
java.io.IOExceptionint[][] getIdxSentences(VectorManager vm)
vm -
java.lang.String[][] getSentences()
throws java.io.FileNotFoundException,
java.io.IOException
java.io.FileNotFoundException
java.io.IOExceptionvoid loadWindowedDoc()
java.io.FileNotFoundException
java.io.IOExceptionvoid destroyLocalDoc()
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||