Modifier and Type | Method and Description |
---|---|
void |
Cluster.clutoClusterSimilarityGraph(DISCO disco,
int n,
float minSim,
java.lang.String outputDir)
Creates a sparse graph file that can be clustered with CLUTO's
scluster program.Important note: This method only works with word spaces of type DISCO.WordspaceType.SIM ! |
static ReturnDataBN |
Cluster.filterOutliers(DISCO disco,
java.lang.String word,
int n)
This method takes the list of the n most similar words of the
input word and filters out all words that do not appear in the
similarity list of at least one of the other similar
words of the input word.
The resulting list of similar words will have size <= n. Important note: This method only works with word spaces of type DISCO.WordspaceType.SIM . |
static java.lang.String[] |
Cluster.growSet(DISCO disco,
java.lang.String[] inputSet)
Retrieves the similar words for all the words in the input set
and extends the input set by all words that appear in the
similarity lists of all the input words.
|
java.util.ArrayList<Rank.WordAndRank> |
Rank.highestRanking(DISCO disco,
java.util.Set<java.lang.String> words,
DISCO.WordspaceType type)
Finds the words in the index in whose similarity or collocation lists the
words rank highest. |
int |
Rank.rankSim(DISCO disco,
java.lang.String w1,
java.lang.String w2)
Computes the rank of w2 in the similarity list of w1.
|
float |
DISCO.secondOrderSimilarity(java.lang.String w1,
java.lang.String w2)
Computes the second order semantic similarity between the input words
based on the sets of their distributionally similar words.
Important note: This method only works with word spaces of type DISCO.WordspaceType.SIM . |
ReturnDataBN |
DISCO.similarWords(java.lang.String word)
Looks up the input word in the index and returns its semantically
similar words ordered by decreasing similarity together
with their similarity values.
If the search word isn't found in the word space, the return value is null .The similarity values in the result can differ from the values you get with DISCO.semanticSimilarity for the same word pair. |