How can I store a graph and run page rank like analytics on it hbase? How can I store a graph and run page rank like analytics on it hbase? hadoop hadoop

How can I store a graph and run page rank like analytics on it hbase?


I think this question on SO could help:

https://stackoverflow.com/questions/9865738/is-it-possible-to-store-graphs-hbase-if-so-how-do-you-model-the-database-to-sup/9867563#9867563

This part of my answer to this question might be of use.

Using HBase/Accumulo as input to giraph has been submitted recently (7 Mar 2012) as a new feature request to Giraph: HBase/Accumulo Input and Output formats (GIRAPH-153)


We use giraph in this way, it only store minimum data in each vertex, and then run the graph algorithm with giraph, then we assemble the result with rich data using pig, for page rank algo, each vertex only needs to store vertex id, rank, thus it could scale to almost billion level.