Target Replicas is 10 but found 3 replica

hadoop filesystems hdfs hadoop-yarn fsck

The replication count for files submitted as part of your job (jars, etc) is controlled by the parameter mapreduce.client.submit.file.replication (or mapred.submit.replication in pre 2.4 clusters) in mapred-site.xml. You can adjust this down for clusters that are smaller than 10 nodes, or just ignore the message from fsck.

FWIW, there is a JIRA for this but I doubt it will ever get worked.

hadoop filesystems hdfs hadoop-yarn fsck

You can ignore. /tmp/hadoop-yarn/staging/ubuntu/.staging/job_1450038005671_0025/job.jar, it is a job resource. dfs.replication does not have impact on job resources.

Job resources such as jar files, files passed using -files (distributed cache) will be copied to HDFS using 10 as replication factor
When the job is running, these job resources (code) will be copied to the container/task to process the data
Once the job is completed based up on thresholds these resources will be automatically recycled.

This feature helps in implementing data locality (where code goes to data) while processing the data.

hadoop filesystems hdfs hadoop-yarn fsck

HDFS configuration file hdfs-site.xml should contain dfs.replication property which describes block replication factor:

<configuration>  <property>    <name>dfs.replication</name>    <value>3</value>  </property></configuration>

Default hdfs-site.xml location is /etc/hadoop/hdfs-site.xml

CodeHunter

Target Replicas is 10 but found 3 replica

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last