|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.mahout.text.wikipedia.WikipediaDatasetCreatorDriver
public final class WikipediaDatasetCreatorDriver
Create and run the Wikipedia Dataset Creator.
| Method Summary | |
|---|---|
static void |
main(String[] args)
Takes in two arguments: The input Path where the input documents live
The output Path where to write the classifier as a
SequenceFile
|
static void |
runJob(String input,
String output,
String catFile,
boolean exactMatchOnly,
Class<? extends org.apache.lucene.analysis.Analyzer> analyzerClass)
Run the job |
| Methods inherited from class java.lang.Object |
|---|
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
| Method Detail |
|---|
public static void main(String[] args)
throws IOException,
InterruptedException
Path where the input documents livePath where to write the classifier as a
SequenceFile
IOException
InterruptedException
public static void runJob(String input,
String output,
String catFile,
boolean exactMatchOnly,
Class<? extends org.apache.lucene.analysis.Analyzer> analyzerClass)
throws IOException,
InterruptedException,
ClassNotFoundException
input - the input pathname Stringoutput - the output pathname StringcatFile - the file containing the Wikipedia categoriesexactMatchOnly - if true, then the Wikipedia category must match exactly instead of simply containing the
category string
IOException
InterruptedException
ClassNotFoundException
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||