R+Hadoop: How to read CSV file from HDFS and execute mapreduce?
You can do something like below:
r.file <- hdfs.file(hdfsFilePath,"r")from.dfs( mapreduce( input = as.matrix(hdfs.read.text.file(r.file)), input.format = "csv", map = ...))
Please give points and hope anybody find it useful.
Note: For details refer to the stackoverflow post :
How to input HDFS file into R mapreduce for processing and get the result into HDFS file