Azure Data Lake Gen 1 vs Gen 2

azure azure-data-lake

Basically, think of gen2 as a superset of gen1 plus all of the best parts of blob storage: tiers, HDFS and object store API's and presumably the ability to efficiently handle the management of over 35K files and efficiently dealing with many small sizes and more trickle write type operations.. plus its cheaper.

I'm trying to get some clarity on a few specifics but not finding much in the meantime try these links:

https://azure.microsoft.com/en-us/blog/a-closer-look-at-azure-data-lake-storage-gen2/

https://docs.microsoft.com/en-us/azure/storage/data-lake-storage/introduction

azure azure-data-lake

Azure data lake storage Gen2 is a super set of Azure data lake Gen 1. It also called as a "no-compromise data lake" by Microsoft. Gen 2 extends Azure blob storage capabilities and it is best optimized for analytics workloads. It can store data once and access via existing blob storage and HDFS-compliant file system interfaces with no programming changes or data copying when doing database operations since it supports atomic file and folder operations.
At present, it is only available in West US 2 and West Central US data centers. But it will be expanded into other data centers in the near future according to Microsoft.

azure azure-data-lake

There is a Microsoft doc that talks about the the differences. For Example:

Data Organization:

Gen1

Hierarchical namespace, File and folder support.

Gen2

Hierarchical namespace, container, file and folder support

Geo-redundancy:

Gen1

LRS.

Gen2

LRS, ZRS, GRS, RA-GRS.

Ecosystem:

Gen1

HDInsight (3.6), Azure Databricks (3.1 and above), SQL DW, ADF

Gen2

HDInsight (3.6, 4.0), Azure Databricks (5.1 and above), SQL DW, ADF

CodeHunter

Azure Data Lake Gen 1 vs Gen 2

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last