Databricks : difference between mount and direct access of Data Lake Storage Gen 2 Databricks : difference between mount and direct access of Data Lake Storage Gen 2 azure azure

Databricks : difference between mount and direct access of Data Lake Storage Gen 2


If you mount storage all users on all clusters get access.

If you do not mount and connect directly in the session using either a service principal or a SAS (I don't think a SAS key is officially supported BTW) the user in that session must have access to the credentials to create the connection.

Service Principals can also have low lever permissions applied within the lake, such as restricting to certain folders.

Note that with ADLS Gen2 you now also have the option of passing through the user credentials: https://docs.azuredatabricks.net/spark/latest/data-sources/azure/adls-passthrough.html

I do not know of any performance differences.