How does Mapper class identify the SequenceFile as inputfile in hadoop? How does Mapper class identify the SequenceFile as inputfile in hadoop? hadoop hadoop

How does Mapper class identify the SequenceFile as inputfile in hadoop?


You do not need to manually read the sequence file. Just set theinput format class to sequence file:

job.setInputFormatClass(SequenceFileInputFormat.class);

and set the input path to the directory containing yor sequence files.

FileInputFormat.setInputPaths(<path to the dir containing your sequence files>);

You will need to pay attention to the (Key,Value) types of the inputs on the parameterized types of your Mapper class to match the (key,value) tuples inside your sequence file.