org.apache.mahout.classifier.df.mapreduce.partial
Class Step1Mapper
java.lang.Object
org.apache.hadoop.mapreduce.Mapper<KEYIN,VALUEIN,KEYOUT,VALUEOUT>
org.apache.mahout.classifier.df.mapreduce.MapredMapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,TreeID,MapredOutput>
org.apache.mahout.classifier.df.mapreduce.partial.Step1Mapper
public class Step1Mapper
- extends MapredMapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,TreeID,MapredOutput>
First step of the Partial Data Builder. Builds the trees using the data available in the InputSplit.
Predict the oob classes for each tree in its growing partition (input split).
| Nested classes/interfaces inherited from class org.apache.hadoop.mapreduce.Mapper |
org.apache.hadoop.mapreduce.Mapper.Context |
|
Method Summary |
protected void |
cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
|
protected void |
configure(Long seed,
int partition,
int numMapTasks,
int numTrees)
Useful when testing |
int |
getFirstTreeId()
|
protected void |
map(org.apache.hadoop.io.LongWritable key,
org.apache.hadoop.io.Text value,
org.apache.hadoop.mapreduce.Mapper.Context context)
|
static int |
nbTrees(int numMaps,
int numTrees,
int partition)
Compute the number of trees for a given partition. |
protected void |
setup(org.apache.hadoop.mapreduce.Mapper.Context context)
|
| Methods inherited from class org.apache.hadoop.mapreduce.Mapper |
run |
| Methods inherited from class java.lang.Object |
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait |
Step1Mapper
public Step1Mapper()
getFirstTreeId
public int getFirstTreeId()
setup
protected void setup(org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException,
InterruptedException
- Overrides:
setup in class MapredMapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,TreeID,MapredOutput>
- Throws:
IOException
InterruptedException
configure
protected void configure(Long seed,
int partition,
int numMapTasks,
int numTrees)
- Useful when testing
- Parameters:
partition - current mapper inputSplit partitionnumMapTasks - number of running map tasksnumTrees - total number of trees in the forest
nbTrees
public static int nbTrees(int numMaps,
int numTrees,
int partition)
- Compute the number of trees for a given partition. The first partition (0) may be longer than the rest of
partition because of the remainder.
- Parameters:
numMaps - total number of maps (partitions)numTrees - total number of trees to buildpartition - partition to compute the number of trees for
map
protected void map(org.apache.hadoop.io.LongWritable key,
org.apache.hadoop.io.Text value,
org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException,
InterruptedException
- Overrides:
map in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,TreeID,MapredOutput>
- Throws:
IOException
InterruptedException
cleanup
protected void cleanup(org.apache.hadoop.mapreduce.Mapper.Context context)
throws IOException,
InterruptedException
- Overrides:
cleanup in class org.apache.hadoop.mapreduce.Mapper<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text,TreeID,MapredOutput>
- Throws:
IOException
InterruptedException
Copyright © 2008–2014 The Apache Software Foundation. All rights reserved.