Not able to parse input using KeyValueTextInputFormat in hadoop mapreduce

If using Hadoop 2.x, parameter is

mapreduce.input.keyvaluelinerecordreader.key.value.separator

Can you share a sample of your input data??

If you are using the new API (hadoop 2.x), I see from the API that the correct parameter to set is mapreduce.input.keyvaluelinerecordreader.key.value.separator.

I.e., use mapreduce, instead of mapred.

UPDATE: It could also be that the delimiter ':' appears more than once in your input. For example, if an input record is key1: : value1 value2 value3, then you would get something like what you describe in your question. If such is the case, then you should choose the delimiter properly, so that it appears exactly once.

hadoop mapreduce delimiter

How to change the default key-value output seperator in Hadoop MapReduce

For KeyValueTextInputFormat the input line should be a key value pair seperated by "\t"

Key1     Value1,Value2

By changing default seperator, You will be able to read as you wish.

For New Api

Here is the solution

//New APIConfiguration conf = new Configuration();conf.set("key.value.separator.in.input.line", ","); Job job = new Job(conf);job.setInputFormatClass(KeyValueTextInputFormat.class);

CodeHunter

Not able to parse input using KeyValueTextInputFormat in hadoop mapreduce

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last