Error handling in hadoop map reduce

hadoop error-handling mapreduce distributed-system

Few questions to ask, when working with error handling:

Should the job be stopped if an error occurred in data validation. Most of the Big data use cases might be ok to leave few bad records. But if your usecase wants all the records to be good enough, you should take that decision and move to the below steps.
Some times its better to let the job run by skipping the bad records or and in parallel, get the issues(errors) using below techniques, rectify and modify as you move along.
You want the errors to be occurred, but only limited times. Then how many times an exception can be thrown, before the entire job gets stopped is as below
For Map tasks: mapreduce.map.maxattempts property
For reducer tasks: mapreduce.reduce.maxattempts
Default is 4
Handling malformed data.
So we decided to handle the malformed data. Then define the condition orwhich the record is bad. You can use counters, to quickly give you thenumber of bad records.
In Mapper class,
```
enum Temperature { OVER_10 }
```
Inside map method,
//parse the record
```
if(value > 10) {    System.err.println("Temperature over 100 degrees for input: " + value);    context.setStatus("Detected possibly corrupt record: see logs.");    context.getCounter(Temperature.OVER_10).increment(1);      }
```
With the above method, all records get processed, and the counters get added based on the bad records. You can see the counter value, at the end of the job, after job statistics or through web UI or from shell command.
```
$mapred job -counter <job_id> '${fully_qualified_class_name}' ${enum_name}$mapred job -counter job_1444655904448_17959 'com.YourMapper$Temperature' OVER_10
```
Once you know the impact of the problem i.e number of bad records, we need to know "why is it bad". For this, we need to go to the logsand search for the error messages.
Yarn provide log aggregation and combines all the logs for a job id and stores in hdfs. It can be get using
```
yarn logs -applicationId <application ID>
```

CodeHunter

Error handling in hadoop map reduce

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last