Do we really need sorting in the MapReduce framework? Do we really need sorting in the MapReduce framework? hadoop hadoop

Do we really need sorting in the MapReduce framework?


Well, yeah, you could use a hash table as long as everything fits in memory. But once the amount of data you're working with exceeds your computer's memory capacity, you have a problem.

The solution is to output data to a disk file and do an external sort.