DELETE FROM table_name Cloudera Impala DELETE FROM table_name Cloudera Impala hadoop hadoop

DELETE FROM table_name Cloudera Impala


refer from book -learning-cloudera-impala

Impala does not support dropping or deleting a row in a table. The alternative is to either drop the table or migrate the required data to other tables and then delete the entire original table.

To simulate the effects of an UPDATE or DELETE statement in other database systems, typically you use INSERT or CREATE TABLE AS SELECT to copy data from one table to another, filtering out or changing the appropriate rows during the copy operation.

work around


Short answer: No, DELETE is not supported in Impala. The workaround is to rewrite table data with the data you want to delete not in there.

Cloudera Impala, while it supports SQL and can be used for data warehouse workloads, is not like a traditional RDBMS. Like Hive, it stores its files in HDFS (and is inter-operable with Hive in many ways), and as such, is designed to store very large files in blocks.

Thus, it, like the HDFS it depends on, is not designed to effectively delete data.