apache phoenix Join query performance

apache hadoop join hbase phoenix

By default, Phoenix uses hash-joins, requiring the data to fit in memory. If you run into problems (with very large tables), you can increase the amount of memory allocated to Phoenix (config setting) or set a query "hint" (ie. SELECT /*+ USE_SORT_MERGE_JOIN*/ FROM ...) to use sort-merge joins which do not have the same requirement. They plan to auto-detect the ideal join algorithm in the future. Additionally, Phoenix currently supports only a subset of join operations.

apache hadoop join hbase phoenix

Did u try the LHS & RHS concept which has been described at the phoenix documentation as a performance optimizing feature(http://phoenix.apache.org/joins.html)? Incase of an inner join the RHS of the join will be built as a hash table in the server cache so please ensure that your smaller table forms the RHS of the inner join.Were the columns u were selecting in the query a part of the secondary index u created?If u have tried the above and still getting a latency in minutes then u need to check the memory of Hbase region servers and whether they are sufficient to serve your query.

CodeHunter

apache phoenix Join query performance

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last