HDFS Delegation token expired even after adding principle to command line

hadoop spark-streaming kerberos kerberos-delegation

Issue Solved...!Added the following config to spark command line while initiating the job and it worked.

--conf spark.hadoop.fs.hdfs.impl.disable.cache=true

or you can change this at yarn config level to impact globally.

I tested it its running fine for 3 days.

Thanks

hadoop spark-streaming kerberos kerberos-delegation

This is a couple years late, but just in case anybody stumbles on this:

Disabling the FS cache (fs.hdfs.impl.disable.cache=true) means FileSystem#get will create a new FileSystem every time it is called.

Instead, it looks the app master can refresh the delegation token if you pass --keytab to spark-submit:

hadoop spark-streaming kerberos kerberos-delegation

Even after setting this configuration, Spark job will fail. We were facing the same issue.

A token is valid only for 24 hrs. Yarn renew the token every 24 hrs automatically until it reaches the max lifetime (which is 7 days) then the token cannot get renewed anymore and needs to be reissued, because of which the application will fail.

This might help to solved the problem.https://community.cloudera.com/t5/Support-Questions/Long-running-Spark-streaming-job-that-crashes-on-HDP-2-3-4-7/td-p/181658

CodeHunter

HDFS Delegation token expired even after adding principle to command line

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last