pitt.search.semanticvectors
Class BuildIndex

java.lang.Object
  extended by pitt.search.semanticvectors.BuildIndex

public class BuildIndex
extends java.lang.Object

Command line utility for creating semantic vector indexes.


Constructor Summary
BuildIndex()
           
 
Method Summary
static void main(java.lang.String[] args)
          Builds term vector and document vector stores from a Lucene index.
static void usage()
          Prints the following usage message:
BuildIndex class in package pitt.search.semanticvectors
Usage: java pitt.search.semanticvectors.BuildIndex PATH_TO_LUCENE_INDEX
BuildIndex creates termvectors and docvectors files in local directory.
 
Methods inherited from class java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
 

Constructor Detail

BuildIndex

public BuildIndex()
Method Detail

usage

public static void usage()
Prints the following usage message:
BuildIndex class in package pitt.search.semanticvectors
Usage: java pitt.search.semanticvectors.BuildIndex PATH_TO_LUCENE_INDEX
BuildIndex creates termvectors and docvectors files in local directory.
Other parameters that can be changed include vector length,
(number of dimensions), seed length (number of non-zero
entries in basic vectors), minimum term frequency,
and number of iterative training cycles.
To change these use the following command line arguments:
-d [number of dimensions]
-s [seed length]
-m [minimum term frequency]
-n [number non-alphabet characters (-1 for any number)]
-tc [training cycles]
-docs [incremental|inmemory] Switch between building doc vectors incrementally"
(requires positional index) or all in memory (default case).


main

public static void main(java.lang.String[] args)
                 throws java.lang.IllegalArgumentException
Builds term vector and document vector stores from a Lucene index.

Parameters:
args -
Throws:
java.lang.IllegalArgumentException
See Also:
usage()