java - Generating Multiple Output files with Hadoop 0.20+ -


I am trying to output the results of my reducer to multiple files, the data results are all contained in one file, And the remaining results are divided according to the category of their respective files. I know with 0.18 that you can do it with multiple outputs and it has not been removed. However, I am trying to make my application 0.20+ compliant. Existing multiple output functionality still requires JobConf (which uses my application job, and configuration). Based on the key, how do I generate multiple outputs?

Support for multiple outputs is not in 0.20. You have to use the old API.

It has been added to 0.21 which is currently absent as org.apache.hadoop.mapreduce.lib.output.MultipleOutputs.

Talks about this problem on the mailing list.


Comments

Popular posts from this blog

sql - dynamically varied number of conditions in the 'where' statement using LINQ -

asp.net mvc - Dynamically Generated Ajax.BeginForm -

Debug on symbian -