Changing number of splits for Hadoop job Changing number of splits for Hadoop job hadoop hadoop

Changing number of splits for Hadoop job


You should override method public InputSplit[] getSplits(JobConf job, int numSplits) in your FileInputFormat1 class. Create your own class based on InputSplit with rectangle coordinates, so inside FileInputFormat you can get this information to return correct key/value pairs to mapper.Probably implementation of getSplits in FileInputFormat could help you see here.