Using hive table over parquet in Pig Using hive table over parquet in Pig hadoop hadoop

Using hive table over parquet in Pig


If you have files with 2 different schemas, the following seems to be sensible:

  1. Split up the files, based on which schema they have
  2. Make tables out of them
  3. If desirable, load the individual tables and store them into a supertable