Getting existing mapreduce job from cluster (the job could be running or completed)

java apache hadoop mapreduce

The problem was with with a recent yarn upgrade that required enabling MR history server on my system. This fixed the issue. I recently upgraded from MR v1 to v2 and in that upgrade, all completed jobs are now moved to the history server.

java apache hadoop mapreduce

You look for getAllJobStatuses() that return JobStatus[]:

  List<JobStatus> runningJobs = new ArrayList<JobStatus>();  List<JobStatus> completedJobs = new ArrayList<JobStatus>();  for (JobStatus job : cluster.getAllJobStatuses()) {    if (!job.isJobComplete()) {      runningJobs.add(job);    }    else {      completedJobs.add(job)    }  }  // list of running JobIDs  for (JobStatus rjob : runningJobs) {        System.out.println(rjob.getJobID().toString());  }  // list of completed JobIDs  for (JobStatus cjob : completedJobs) {        System.out.println(cjob.getJobID().toString());  }  // to print out short report on running jobs:  // displayJobList(runningJobs.toArray(new JobStatus[0]));

CodeHunter

Getting existing mapreduce job from cluster (the job could be running or completed)

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last