Uses of Class
cluster.Phrase

Packages that use Phrase
benchmark Classes that benchmark the system. 
cluster Classes that deal with the unsupervised clustering. 
Util Lots of classes that are used throughout the engine, and could possibly be used in another project. 
websearch Classes for searching general web results. 
 

Uses of Phrase in benchmark
 

Methods in benchmark with parameters of type Phrase
 void ClusterableReuters.addTermSetCount(Phrase termSet, int n)
           
 int ClusterableReuters.getNumInstancesOfTermSet(Phrase s)
           
 

Uses of Phrase in cluster
 

Methods in cluster that return Phrase
static Phrase Phrase.CombinedFactory(Phrase a, Phrase b)
           
 Phrase Phrase.getParent()
           
 

Methods in cluster that return types with arguments of type Phrase
static java.util.List<Phrase> PhraseSupporter.checkSets(java.util.List<? extends ClusterDoc> docs, java.util.List<Phrase> candidates, int sufficientDocs)
          Records in TestDoc the number of terms supported
Records in TermSet the documents that cover each term
static java.util.List<Phrase> TreeHelper.createClusterTemplate(java.util.List<? extends ClusterDoc> files, java.util.List<Phrase> terms)
           
static java.util.List<Phrase> TreeHelper.createClusterTemplateWithCatBoost(java.util.List<? extends ClusterDoc> files, java.util.List<Phrase> terms, int cat)
          This makes the hierarchy and defines the terms for each node
If using snippets, abstracts, it may be possible to just use this
static java.util.List<Phrase> TreeHelper.getFlatView(java.util.List<Phrase> clusters)
          Uses a queue to traverse the tree and return everything in one list The parent-child relationships are preserved in the returned list
 java.util.List<Phrase> Phrase.getTermSetChildren()
           
static java.util.Map<Phrase,java.lang.Integer> HTMLDumper.makeMap(java.util.List<Phrase> clusters)
           
 

Methods in cluster with parameters of type Phrase
 void Phrase.addChild(Phrase t)
           
 void ClusterDoc.addTermSetCount(Phrase termSet, int n)
          This document should record how frequently this termSet occured
static double PhraseSupporter.calculateRelevance(java.util.Set<ClusterDoc> docs, Phrase originalSet, Phrase combined)
          The interpretation is that if P(B|A) -> 1, then phrase b is seen every time phrase A occurs.
static double PhraseSupporter.calculateRelevanceRelaxed(java.util.Set<ClusterDoc> docs, Phrase originalSet, Phrase combined)
          An experiment when calculating how two clusters are related.
static boolean PhraseSupporter.checkSet(java.util.Set<ClusterDoc> docs, Phrase termSet, double cutoff)
           
static Phrase Phrase.CombinedFactory(Phrase a, Phrase b)
           
static double PhraseSupporter.findRelevanceRelaxed(ClusterDoc d, Phrase set, Phrase required, int slackNum)
          An experiment while calculating the relationship between two phrases
Abandoned in favor of embedding alternative phrases within Phrase
 java.util.List<Term> Phrase.getCombinedOverlap(Phrase set2)
          Assuming this set is in front
 java.util.Set<ClusterDoc> Phrase.getCoverUnion(Phrase s)
           
static int PhraseSupporter.getNumInstances(java.util.Set<ClusterDoc> docs, Phrase set)
           
static int PhraseSupporter.getNumInstancesOfCombinedSet(ClusterDoc d, Phrase setA, Phrase setB)
           
static int PhraseSupporter.getNumInstancesOfCombinedSet(java.util.Set<ClusterDoc> docs, Phrase thisI, Phrase thisJ)
           
static int PhraseSupporter.getNumInstancesOfSet(ClusterDoc d, Phrase set)
          Returns how many windows (ie.
static int PhraseSupporter.getNumInstancesOfSetRelaxed(ClusterDoc d, Phrase set, Phrase required, int slackNum)
          Set can occur in a sentence if at most slackNum words of the combined phrase are missing, and none of these missing words are in required
static int PhraseSupporter.getNumInstancesOfSetSingle(ClusterDoc d, Phrase set)
          A faster implementation when there is only one set
 int ClusterDoc.getNumInstancesOfTermSet(Phrase s)
          Each document should get a unique id
 int Phrase.getNumTermsShared(Phrase t)
           
static int PhraseSupporter.numDocsWithSet(java.util.Collection<ClusterDoc> docs, Phrase p)
           
 boolean Phrase.overLapTerms(Phrase set)
           
 void Phrase.removeChild(Phrase toMove)
          toMove's parent is set to null, so assign a new parent after calling this function
 void Phrase.setParent(Phrase parent)
           
 

Method parameters in cluster with type arguments of type Phrase
static void TreeHelper.calculateMemberships(java.util.List<Phrase> clusters, java.util.List<? extends ClusterDoc> files)
          Assigns all documents into the hierarchy, following certain limitations and goals
The number of clusters per doc is at most 4
Try to place documents in smaller clusters
static java.util.List<Phrase> PhraseSupporter.checkSets(java.util.List<? extends ClusterDoc> docs, java.util.List<Phrase> candidates, int sufficientDocs)
          Records in TestDoc the number of terms supported
Records in TermSet the documents that cover each term
static java.util.List<Phrase> TreeHelper.createClusterTemplate(java.util.List<? extends ClusterDoc> files, java.util.List<Phrase> terms)
           
static java.util.List<Phrase> TreeHelper.createClusterTemplateWithCatBoost(java.util.List<? extends ClusterDoc> files, java.util.List<Phrase> terms, int cat)
          This makes the hierarchy and defines the terms for each node
If using snippets, abstracts, it may be possible to just use this
static void TreeHelper.createHierarchyByCrossSimilarity(java.util.List<Phrase> allClusters)
          allClusters will be reduced to only the root clusters, but the tree is traversable by looking at their child nodes
static void HTMLDumper.createHTMLFiles(java.util.List<Phrase> clusters, java.util.List<TestDoc> files, java.io.File outDir, java.lang.String catHeader)
           
static java.lang.String HTMLDumper.dumpGlobalClusters(java.util.List<Phrase> clusters, int depth, java.util.Map<Phrase,java.lang.Integer> map, java.lang.String catHeader)
          Dumps all the clusters
static java.lang.String HTMLDumper.dumpGlobalClusters(java.util.List<Phrase> clusters, int depth, java.util.Map<Phrase,java.lang.Integer> map, java.lang.String catHeader)
          Dumps all the clusters
static void PhraseFinder.findAndAddPhrases(boolean verbose, java.util.List<? extends ClusterDoc> docs, int vocabCutoff, java.util.List<Phrase> set, int phraseSize, VectorManager vm)
          Finds phrases and adds them to set
static void OntologyBasedPhraseFinder.findPhrasesOfSize(java.util.List<Phrase> set, java.util.List<TestDoc> docs, VectorManager vm, int[] phraseSizes, int cat, int cutoff)
           
static int TreeHelper.getCoverSize(java.util.List<Phrase> clusters)
          Returns the total number of unique documents that are present somewhere in the tree
static java.util.List<Phrase> TreeHelper.getFlatView(java.util.List<Phrase> clusters)
          Uses a queue to traverse the tree and return everything in one list The parent-child relationships are preserved in the returned list
static java.util.Map<Phrase,java.lang.Integer> HTMLDumper.makeMap(java.util.List<Phrase> clusters)
           
static void TreeHelper.mergeClusters(java.util.List<Phrase> rootClusters)
          Merges the clusters together via different methods
static void TreeHelper.printClusters(java.util.List<Phrase> clusters, int depth)
           
static void TreeHelper.printTotalClusterSize(java.util.List<Phrase> clusters)
           
static void TreeHelper.pruneTree(java.util.List<Phrase> rootClusters, java.util.List<? extends ClusterDoc> files)
           
static void TreeHelper.removeNonDescriptiveClusters(java.util.List<Phrase> clusters)
           
static void TreeHelper.removeSingleRoots(java.util.List<Phrase> rootClusters, int necessaryCover, int sufficientCover)
          Remove roots that either have no children or whose cover is less than necessaryCover.
static void TreeHelper.removeUselessLeaves(java.util.List<Phrase> clusters)
          :C elegans using(341) :against C elegans(25) - remove this because against is stopword and C elegans was found in parent
static void TreeHelper.restrictDepth(java.util.List<Phrase> rootLayer, int maxd)
          This method will reduce the depth of the tree by bringing up children as needed
static void TreeHelper.showPhrases(java.util.List<Phrase> clusters)
          Prints the tree out to screen
static void TreeHelper.sortByNumChildren(java.util.List<Phrase> clusters)
           
 

Constructors in cluster with parameters of type Phrase
Phrase(Phrase t)
          Copy constructor
 

Uses of Phrase in Util
 

Methods in Util with parameters of type Phrase
 void TestDoc.addTermSetCount(Phrase termSet, int n)
           
 void TermCounter.addTermSetCount(Phrase termSet, int n)
           
 int TestDoc.getNumInstancesOfTermSet(Phrase s)
           
 int TermCounter.getNumInstancesOfTermSet(Phrase s)
           
 

Uses of Phrase in websearch
 

Methods in websearch with parameters of type Phrase
 void SearchResultDoc.addTermSetCount(Phrase termSet, int n)
           
 int SearchResultDoc.getNumInstancesOfTermSet(Phrase s)