Write to multiple outputs by key Scalding Hadoop, one MapReduce Job Write to multiple outputs by key Scalding Hadoop, one MapReduce Job hadoop hadoop

Write to multiple outputs by key Scalding Hadoop, one MapReduce Job


There is TemplatedTsv in Scalding (from version 0.9.0rc16 and up), exactly same as Cascading TemplateTsv.

Tsv(args("input"), ('COUNTRY, 'GDP)).read.write(TemplatedTsv(args("output"), "%s", 'COUNTRY))// it will create a directory for each country under "output" path in Hadoop mode.


This suggestion on the Cascading User group suggests to use Cascading TemplateTap. Not sure how to connect this to Scalding though.