How to retrieve yarn's logs programmatically using java

java hadoop apache-spark hadoop-yarn

Yes you can. You can get most of the key information about an application via YarnClient and you can make rest calls to the Spark History Server API. The endpoint you're looking for here is

/applications/[base-app-id]/logs

java hadoop apache-spark hadoop-yarn

I wanted to do it programmatically using java, so i finally took a look at the code behind the command :

yarn logs -applicationId applicationid

which is in :

src/main/java/org/apache/hadoop/yarn/client/cli/LogsCLI.java

I now retrieve the logs in a string (content). The code is:

String applicationId = "application_1492795815045_3940";ApplicationId appId = appId = ConverterUtils.toApplicationId(applicationId);LogCLIHelpers logCliHelper = new LogCLIHelpers();Configuration config = new Configuration();logCliHelper.setConf(config);String appOwner = UserGroupInformation.getCurrentUser().getShortUserName();ByteArrayOutputStream baos = new ByteArrayOutputStream();PrintStream ps = new PrintStream(baos);// Function to retrieve logslogCliHelper.dumpAllContainersLogs(appId, appOwner, ps);String content = new String(baos.toByteArray(), StandardCharsets.UTF_8);System.out.println(content)

java hadoop apache-spark hadoop-yarn

Your method in shell environment is correct!

In my opinion, because yarn is already an executable program in your system.

To make current java process(i.e., current jvm) access and use it. You can start up a new child process to help you do the job.

Maybe the code followed will help you.

public class YarnLog {    //    public static void getYarnLog(String appid) throws IOException {        BufferedReader br = null;        try {            Process p = Runtime.getRuntime().exec(String.format("yarn logs -applicationId %s", appid));            br = new BufferedReader(new InputStreamReader(p.getInputStream()));            String line;            while((line = br.readLine()) != null) {                System.out.println(line);            }        } catch (IOException e) {            e.printStackTrace();        } finally {            if(br != null) {                br.close();            }        }    }}

After the successful termination of this child process, you can use your specific logs as normal files in you current working directory.

CodeHunter

How to retrieve yarn's logs programmatically using java

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last