Regarding Hadoop Security via Kerberos Regarding Hadoop Security via Kerberos hadoop hadoop

Regarding Hadoop Security via Kerberos


Are Token and Ticket same ?

No. Tickets are issued by Kerberos and then servers in Hadoop (NameNode or JobTracker) issue tokens to provide authentication within the Hadoop cluster. Hadoop does not rely on Kerberos to authenticate running tasks, for instance, but uses its own tokens that were issued based on the Kerberos tickets.

The Apache doc also "DataNodes do not enforce any access control on accesses to its data blocks.

I'm guessing you're taking that from the JIRA where access control was provided (https://issues.apache.org/jira/browse/HADOOP-4359) via BlockAccessTokens. Assuming this is turned on - which it should be in a secure cluster - one cannot access a block on a datanode without such a token, which is issued by the NameNode after authentication and authorization via Kerberos and HDFS' own file system permissions.


How can I access the Datanode and write data arbitrarily ?

I am not sure what you mean here. Do you mean when the user does not have permission? As Jacob mentionedyou will not get a valid BlockAccessToken unless the user has the permissions to access the Data based on the file system permissions, assuming that you have secure Hadoop cluster.