COLLECT_SET() in Hive, keep duplicates? COLLECT_SET() in Hive, keep duplicates? hadoop hadoop