Alternatives to scalding for HBase access from Scala (or Java) Alternatives to scalding for HBase access from Scala (or Java) hadoop hadoop

Alternatives to scalding for HBase access from Scala (or Java)


If you're using scalding (which I recommend) there's a new project with updated cascading and scalding wrappers for accessing HBase. You might want to check it out - https://github.com/ParallelAI/SpyGlass


You may be interested in the Kiji project (https://github.com/kijiproject/). It provides a "schema-ed" layer on top of HBase.

It also has a Scalding adapter (KijiExpress) so that you can do functional collections operations (map, groupby, etc.) on "pipes" of tuples sourced from these schema-ed HBase tables.