|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||
java.lang.Objectorg.apache.mahout.text.LuceneStorageConfiguration
public class LuceneStorageConfiguration
Holds all the configuration for SequenceFilesFromLuceneStorage, which generates a sequence file
with id as the key and a content field as value.
| Constructor Summary | |
|---|---|
LuceneStorageConfiguration()
|
|
LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration conf)
Deserializes a LuceneStorageConfiguration from a Configuration. |
|
LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.Path> indexPaths,
org.apache.hadoop.fs.Path sequenceFilesOutputPath,
String idField,
List<String> fields)
Create a configuration bean with all mandatory parameters. |
|
| Method Summary | |
|---|---|
boolean |
equals(Object o)
|
org.apache.hadoop.conf.Configuration |
getConfiguration()
|
List<String> |
getFields()
|
String |
getIdField()
|
List<org.apache.hadoop.fs.Path> |
getIndexPaths()
|
int |
getMaxHits()
|
org.apache.lucene.search.Query |
getQuery()
|
Iterator<Pair<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>> |
getSequenceFileIterator()
Returns an Iterator which returns (Text, Text) Pairs of the produced sequence files. |
org.apache.hadoop.fs.Path |
getSequenceFilesOutputPath()
|
org.apache.lucene.document.DocumentStoredFieldVisitor |
getStoredFieldVisitor()
|
int |
hashCode()
|
void |
readFields(DataInput in)
|
org.apache.hadoop.conf.Configuration |
serialize()
Serializes this object in a Hadoop Configuration |
void |
setMaxHits(int maxHits)
|
void |
setQuery(org.apache.lucene.search.Query query)
|
void |
write(DataOutput out)
|
| Methods inherited from class java.lang.Object |
|---|
clone, finalize, getClass, notify, notifyAll, toString, wait, wait, wait |
| Constructor Detail |
|---|
public LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration configuration,
List<org.apache.hadoop.fs.Path> indexPaths,
org.apache.hadoop.fs.Path sequenceFilesOutputPath,
String idField,
List<String> fields)
configuration - Hadoop configuration for writing sequencefilesindexPaths - paths to the indexsequenceFilesOutputPath - path to output the sequence fileidField - field used for the key of the sequence filefields - field(s) used for the value of the sequence filepublic LuceneStorageConfiguration()
public LuceneStorageConfiguration(org.apache.hadoop.conf.Configuration conf)
throws IOException
LuceneStorageConfiguration from a Configuration.
conf - the Configuration object with a serialized LuceneStorageConfiguration
IOException - if deserialization fails| Method Detail |
|---|
public org.apache.hadoop.conf.Configuration serialize()
throws IOException
Configuration
Configuration object with a String serialization
IOException - if serialization failspublic Iterator<Pair<org.apache.hadoop.io.Text,org.apache.hadoop.io.Text>> getSequenceFileIterator()
Iterator which returns (Text, Text) Pairs of the produced sequence files.
public org.apache.hadoop.conf.Configuration getConfiguration()
public org.apache.hadoop.fs.Path getSequenceFilesOutputPath()
public List<org.apache.hadoop.fs.Path> getIndexPaths()
public String getIdField()
public List<String> getFields()
public void setQuery(org.apache.lucene.search.Query query)
public org.apache.lucene.search.Query getQuery()
public void setMaxHits(int maxHits)
public int getMaxHits()
public org.apache.lucene.document.DocumentStoredFieldVisitor getStoredFieldVisitor()
public void write(DataOutput out)
throws IOException
write in interface org.apache.hadoop.io.WritableIOException
public void readFields(DataInput in)
throws IOException
readFields in interface org.apache.hadoop.io.WritableIOExceptionpublic boolean equals(Object o)
equals in class Objectpublic int hashCode()
hashCode in class Object
|
||||||||||
| PREV CLASS NEXT CLASS | FRAMES NO FRAMES | |||||||||
| SUMMARY: NESTED | FIELD | CONSTR | METHOD | DETAIL: FIELD | CONSTR | METHOD | |||||||||