|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||
| Packages that use Phrase | |
|---|---|
| benchmark | Classes that benchmark the system. |
| cluster | Classes that deal with the unsupervised clustering. |
| Util | Lots of classes that are used throughout the engine, and could possibly be used in another project. |
| websearch | Classes for searching general web results. |
| Uses of Phrase in benchmark |
|---|
| Methods in benchmark with parameters of type Phrase | |
|---|---|
void |
ClusterableReuters.addTermSetCount(Phrase termSet,
int n)
|
int |
ClusterableReuters.getNumInstancesOfTermSet(Phrase s)
|
| Uses of Phrase in cluster |
|---|
| Methods in cluster that return Phrase | |
|---|---|
static Phrase |
Phrase.CombinedFactory(Phrase a,
Phrase b)
|
Phrase |
Phrase.getParent()
|
| Methods in cluster that return types with arguments of type Phrase | |
|---|---|
static java.util.List<Phrase> |
PhraseSupporter.checkSets(java.util.List<? extends ClusterDoc> docs,
java.util.List<Phrase> candidates,
int sufficientDocs)
Records in TestDoc the number of terms supported Records in TermSet the documents that cover each term |
static java.util.List<Phrase> |
TreeHelper.createClusterTemplate(java.util.List<? extends ClusterDoc> files,
java.util.List<Phrase> terms)
|
static java.util.List<Phrase> |
TreeHelper.createClusterTemplateWithCatBoost(java.util.List<? extends ClusterDoc> files,
java.util.List<Phrase> terms,
int cat)
This makes the hierarchy and defines the terms for each node If using snippets, abstracts, it may be possible to just use this |
static java.util.List<Phrase> |
TreeHelper.getFlatView(java.util.List<Phrase> clusters)
Uses a queue to traverse the tree and return everything in one list The parent-child relationships are preserved in the returned list |
java.util.List<Phrase> |
Phrase.getTermSetChildren()
|
static java.util.Map<Phrase,java.lang.Integer> |
HTMLDumper.makeMap(java.util.List<Phrase> clusters)
|
| Methods in cluster with parameters of type Phrase | |
|---|---|
void |
Phrase.addChild(Phrase t)
|
void |
ClusterDoc.addTermSetCount(Phrase termSet,
int n)
This document should record how frequently this termSet occured |
static double |
PhraseSupporter.calculateRelevance(java.util.Set<ClusterDoc> docs,
Phrase originalSet,
Phrase combined)
The interpretation is that if P(B|A) -> 1, then phrase b is seen every time phrase A occurs. |
static double |
PhraseSupporter.calculateRelevanceRelaxed(java.util.Set<ClusterDoc> docs,
Phrase originalSet,
Phrase combined)
An experiment when calculating how two clusters are related. |
static boolean |
PhraseSupporter.checkSet(java.util.Set<ClusterDoc> docs,
Phrase termSet,
double cutoff)
|
static Phrase |
Phrase.CombinedFactory(Phrase a,
Phrase b)
|
static double |
PhraseSupporter.findRelevanceRelaxed(ClusterDoc d,
Phrase set,
Phrase required,
int slackNum)
An experiment while calculating the relationship between two phrases Abandoned in favor of embedding alternative phrases within Phrase |
java.util.List<Term> |
Phrase.getCombinedOverlap(Phrase set2)
Assuming this set is in front |
java.util.Set<ClusterDoc> |
Phrase.getCoverUnion(Phrase s)
|
static int |
PhraseSupporter.getNumInstances(java.util.Set<ClusterDoc> docs,
Phrase set)
|
static int |
PhraseSupporter.getNumInstancesOfCombinedSet(ClusterDoc d,
Phrase setA,
Phrase setB)
|
static int |
PhraseSupporter.getNumInstancesOfCombinedSet(java.util.Set<ClusterDoc> docs,
Phrase thisI,
Phrase thisJ)
|
static int |
PhraseSupporter.getNumInstancesOfSet(ClusterDoc d,
Phrase set)
Returns how many windows (ie. |
static int |
PhraseSupporter.getNumInstancesOfSetRelaxed(ClusterDoc d,
Phrase set,
Phrase required,
int slackNum)
Set can occur in a sentence if at most slackNum words of the combined phrase are missing, and none of these missing words are in required |
static int |
PhraseSupporter.getNumInstancesOfSetSingle(ClusterDoc d,
Phrase set)
A faster implementation when there is only one set |
int |
ClusterDoc.getNumInstancesOfTermSet(Phrase s)
Each document should get a unique id |
int |
Phrase.getNumTermsShared(Phrase t)
|
static int |
PhraseSupporter.numDocsWithSet(java.util.Collection<ClusterDoc> docs,
Phrase p)
|
boolean |
Phrase.overLapTerms(Phrase set)
|
void |
Phrase.removeChild(Phrase toMove)
toMove's parent is set to null, so assign a new parent after calling this function |
void |
Phrase.setParent(Phrase parent)
|
| Method parameters in cluster with type arguments of type Phrase | |
|---|---|
static void |
TreeHelper.calculateMemberships(java.util.List<Phrase> clusters,
java.util.List<? extends ClusterDoc> files)
Assigns all documents into the hierarchy, following certain limitations and goals The number of clusters per doc is at most 4 Try to place documents in smaller clusters |
static java.util.List<Phrase> |
PhraseSupporter.checkSets(java.util.List<? extends ClusterDoc> docs,
java.util.List<Phrase> candidates,
int sufficientDocs)
Records in TestDoc the number of terms supported Records in TermSet the documents that cover each term |
static java.util.List<Phrase> |
TreeHelper.createClusterTemplate(java.util.List<? extends ClusterDoc> files,
java.util.List<Phrase> terms)
|
static java.util.List<Phrase> |
TreeHelper.createClusterTemplateWithCatBoost(java.util.List<? extends ClusterDoc> files,
java.util.List<Phrase> terms,
int cat)
This makes the hierarchy and defines the terms for each node If using snippets, abstracts, it may be possible to just use this |
static void |
TreeHelper.createHierarchyByCrossSimilarity(java.util.List<Phrase> allClusters)
allClusters will be reduced to only the root clusters, but the tree is traversable by looking at their child nodes |
static void |
HTMLDumper.createHTMLFiles(java.util.List<Phrase> clusters,
java.util.List<TestDoc> files,
java.io.File outDir,
java.lang.String catHeader)
|
static java.lang.String |
HTMLDumper.dumpGlobalClusters(java.util.List<Phrase> clusters,
int depth,
java.util.Map<Phrase,java.lang.Integer> map,
java.lang.String catHeader)
Dumps all the clusters |
static java.lang.String |
HTMLDumper.dumpGlobalClusters(java.util.List<Phrase> clusters,
int depth,
java.util.Map<Phrase,java.lang.Integer> map,
java.lang.String catHeader)
Dumps all the clusters |
static void |
PhraseFinder.findAndAddPhrases(boolean verbose,
java.util.List<? extends ClusterDoc> docs,
int vocabCutoff,
java.util.List<Phrase> set,
int phraseSize,
VectorManager vm)
Finds phrases and adds them to set |
static void |
OntologyBasedPhraseFinder.findPhrasesOfSize(java.util.List<Phrase> set,
java.util.List<TestDoc> docs,
VectorManager vm,
int[] phraseSizes,
int cat,
int cutoff)
|
static int |
TreeHelper.getCoverSize(java.util.List<Phrase> clusters)
Returns the total number of unique documents that are present somewhere in the tree |
static java.util.List<Phrase> |
TreeHelper.getFlatView(java.util.List<Phrase> clusters)
Uses a queue to traverse the tree and return everything in one list The parent-child relationships are preserved in the returned list |
static java.util.Map<Phrase,java.lang.Integer> |
HTMLDumper.makeMap(java.util.List<Phrase> clusters)
|
static void |
TreeHelper.mergeClusters(java.util.List<Phrase> rootClusters)
Merges the clusters together via different methods |
static void |
TreeHelper.printClusters(java.util.List<Phrase> clusters,
int depth)
|
static void |
TreeHelper.printTotalClusterSize(java.util.List<Phrase> clusters)
|
static void |
TreeHelper.pruneTree(java.util.List<Phrase> rootClusters,
java.util.List<? extends ClusterDoc> files)
|
static void |
TreeHelper.removeNonDescriptiveClusters(java.util.List<Phrase> clusters)
|
static void |
TreeHelper.removeSingleRoots(java.util.List<Phrase> rootClusters,
int necessaryCover,
int sufficientCover)
Remove roots that either have no children or whose cover is less than necessaryCover. |
static void |
TreeHelper.removeUselessLeaves(java.util.List<Phrase> clusters)
:C elegans using(341) :against C elegans(25) - remove this because against is stopword and C elegans was found in parent |
static void |
TreeHelper.restrictDepth(java.util.List<Phrase> rootLayer,
int maxd)
This method will reduce the depth of the tree by bringing up children as needed |
static void |
TreeHelper.showPhrases(java.util.List<Phrase> clusters)
Prints the tree out to screen |
static void |
TreeHelper.sortByNumChildren(java.util.List<Phrase> clusters)
|
| Constructors in cluster with parameters of type Phrase | |
|---|---|
Phrase(Phrase t)
Copy constructor |
|
| Uses of Phrase in Util |
|---|
| Methods in Util with parameters of type Phrase | |
|---|---|
void |
TestDoc.addTermSetCount(Phrase termSet,
int n)
|
void |
TermCounter.addTermSetCount(Phrase termSet,
int n)
|
int |
TestDoc.getNumInstancesOfTermSet(Phrase s)
|
int |
TermCounter.getNumInstancesOfTermSet(Phrase s)
|
| Uses of Phrase in websearch |
|---|
| Methods in websearch with parameters of type Phrase | |
|---|---|
void |
SearchResultDoc.addTermSetCount(Phrase termSet,
int n)
|
int |
SearchResultDoc.getNumInstancesOfTermSet(Phrase s)
|
|
||||||||||
| PREV NEXT | FRAMES NO FRAMES | |||||||||