public class DeprecatedLzoTextInputFormat
extends org.apache.hadoop.mapred.TextInputFormat
Note that to use this input format properly with hadoop-streaming, you should
also set the property stream.map.input.ignoreKey=true. That will
replicate the behavior of the default TextInputFormat by stripping off the byte
offset keys from the input lines that get piped to the mapper process.
See LzoInputFormatCommon for a description of the boolean property
lzo.text.input.format.ignore.nonlzo and how it affects the
behavior of this input format.
| Constructor and Description |
|---|
DeprecatedLzoTextInputFormat() |
| Modifier and Type | Method and Description |
|---|---|
org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> |
getRecordReader(org.apache.hadoop.mapred.InputSplit split,
org.apache.hadoop.mapred.JobConf conf,
org.apache.hadoop.mapred.Reporter reporter) |
org.apache.hadoop.mapred.InputSplit[] |
getSplits(org.apache.hadoop.mapred.JobConf conf,
int numSplits) |
protected boolean |
isSplitable(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path filename) |
protected org.apache.hadoop.fs.FileStatus[] |
listStatus(org.apache.hadoop.mapred.JobConf conf) |
protected org.apache.hadoop.fs.FileStatus[] listStatus(org.apache.hadoop.mapred.JobConf conf)
throws IOException
listStatus in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>IOExceptionprotected boolean isSplitable(org.apache.hadoop.fs.FileSystem fs,
org.apache.hadoop.fs.Path filename)
isSplitable in class org.apache.hadoop.mapred.TextInputFormatpublic org.apache.hadoop.mapred.InputSplit[] getSplits(org.apache.hadoop.mapred.JobConf conf,
int numSplits)
throws IOException
getSplits in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>getSplits in class org.apache.hadoop.mapred.FileInputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>IOExceptionpublic org.apache.hadoop.mapred.RecordReader<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text> getRecordReader(org.apache.hadoop.mapred.InputSplit split,
org.apache.hadoop.mapred.JobConf conf,
org.apache.hadoop.mapred.Reporter reporter)
throws IOException
getRecordReader in interface org.apache.hadoop.mapred.InputFormat<org.apache.hadoop.io.LongWritable,org.apache.hadoop.io.Text>getRecordReader in class org.apache.hadoop.mapred.TextInputFormatIOExceptionCopyright © 2016. All Rights Reserved.