Azure data factory copy activity from Storage to SQL: hangs at 70000 rows Azure data factory copy activity from Storage to SQL: hangs at 70000 rows azure azure

Azure data factory copy activity from Storage to SQL: hangs at 70000 rows


Just to answer my own question in case it helps anyone else:

The issue was with null values. The reason that my run was hanging at 70,000 rows was that at row 76560 of my blob source file, there was a null value in one of the columns. The HIVE script I had used to generate this blob file had written the null value as '\N'. Also, my sink SQL table specified 'NOT NULL' as part of the column, and the column was a FLOAT value.

So I made two changes: added the following property to my blob dataset definition:

"nullValue": "\\N"

And made my SQL table column nullable. It now runs completely and doesn't hang! :)

The problem is that the Data Factory did not error, it just got stuck - it would be nice if the job had failed with a useful error message, and told me what row of the data was the problem. I think because the write batch size is 10,000 by default, this is why it got stuck at 70,000 and not at 76560.