how to use rcfilepigstorage in pig how to use rcfilepigstorage in pig hadoop hadoop

how to use rcfilepigstorage in pig


Include all the twitter dependencies and the pig jars and compile the RCFilePigStorage.java. If you want to change some specific behavior in the code, do the changes also and can rename it to MyRCFilePigStorage.java.

Now take the class files generated after compiling and create a jar file named MyRCUdf.jar. Register this jar in your pigscript.

Register MyRCUdf.jar;* your pig logic*Store 'data' using MyRCFilePigStorage();

EDIT:Consider the following links for twitter dependencies. Take the source code, compile and include the classes generated in your classpath

https://github.com/kevinweil/elephant-bird/blob/master/core/src/main/java/com/twitter/elephantbird/mapreduce/input/MapReduceInputFormatWrapper.java

https://github.com/kevinweil/elephant-bird