Run first Hadoop (on unix) program (on Windows 7)

java unix hadoop virtual-machine vmware

For the tutorial you're using you should be running this on your Hadoop machine. If you want to run it from another machine you can but if it is Windows you will likely have trouble since the Libraries are built on Linux.

To run a job remotely you could do the following to get started. You will need to put a little effort into Job configuration.

Install the core Hadoop Jar and libraries needed on the Windows machine. You could use a Web Application for this.
Then build your code that will configure and call the Job. This will be straight forward with the exception of needing to set the following values in order for the Job to find the correct HDFS and Map Reducer
conf.set("mapred.job.tracker", "192.168.x.x:54311");
conf.set("fs.default.name", "hdfs://192.168.x.x:54310");

You will need more code than this so read up on Hadoop API here

EDIT: The input file should be placed on or moved into the HDFS file system that is defined within the hdfs-site.xml. If you have not formatted the HDFS file system you will need to do that first. Take a look at the HDFS user guide HERE for more details.

Lastly, if you need additional help you will have to post the code you're trying to run and the error output as well.

Good luck,Pat

CodeHunter

Run first Hadoop (on unix) program (on Windows 7)

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last