Changing number of splits for Hadoop job
You should override method public InputSplit[] getSplits(JobConf job, int numSplits)
in your FileInputFormat1
class. Create your own class based on InputSplit
with rectangle coordinates, so inside FileInputFormat
you can get this information to return correct key/value pairs to mapper.Probably implementation of getSplits
in FileInputFormat
could help you see here.