Why is querying Parquet files is slower than text files in Hive?

First I would like to just point out that it is virtually impossible to answer your question with the given details.

Few points:

measuring time in a distributed environment is not the way to determine if something is slow (if you have many queries running and competing for resources you are not measuring what you think you are measuring)
not providing the actual table definition and the queries running against those tables makes this problem impossible to reproduce
not providing the number of rows of the table and the cardinality its individual fields is also not helping

In general, querying Parquet is much faster than querying text files because Parquet employs many things to make read operations much faster. Few of these things:

compression
run length encoding
dictionary encoding

Depending on the use case some of the parameters of things can be tuned to the exact use case.

CodeHunter

Why is querying Parquet files is slower than text files in Hive?

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last