How to find duplicate filenames (recursively) in a given directory? BASH

Here is another solution (based on the suggestion by @jim-mcnamara) without awk:

Solution 1

#!/bin/sh dirname=/path/to/directoryfind $dirname -type f | sed 's_.*/__' | sort|  uniq -d| while read fileNamedofind $dirname -type f | grep "$fileName"done

However, you have to do the same search twice. This can become very slow if you have to search a lot of data. Saving the "find" results in a temporary file might give a better performance.

Solution 2 (with temporary file)

#!/bin/sh dirname=/path/to/directorytempfile=myTempfileNamefind $dirname -type f  > $tempfilecat $tempfile | sed 's_.*/__' | sort |  uniq -d| while read fileNamedo grep "/$fileName" $tempfiledone#rm -f $tempfile

Since you might not want to write a temp file on the harddrive in some cases, you can choose the method which fits your needs. Both examples print out the full path of the file.

Bonus question here: Is it possible to save the whole output of the find command as a list to a variable?

bash duplicates filenames

Yes this is a really old question.But all those loops and temporary files seem a bit cumbersome.

Here's my 1-line answer:

find /PATH/TO/FILES -type f -printf '%p/ %f\n' | sort -k2 | uniq -f1 --all-repeated=separate

It has its limitations due to uniq and sort:

no whitespace (space, tab) in filename (will be interpreted as new field by uniq and sort)
needs file name printed as last field delimited by space (uniq doesn't support comparing only 1 field and is inflexible with field delimiters)

But it is quite flexible regarding its output thanks to find -printf and works well for me. Also seems to be what @yak tried to achieve originally.

Demonstrating some of the options you have with this:

find  /PATH/TO/FILES -type f -printf 'size: %s bytes, modified at: %t, path: %h/, file name: %f\n' | sort -k15 | uniq -f14 --all-repeated=prepend

Also there are options in sort and uniq to ignore case (as the topic opener intended to achieve by piping through tr). Look them up using man uniq or man sort.

bash duplicates filenames

#!/bin/shdirname=/path/to/checkfind $dirname -type f | while read vodo  echo `basename "$vo"`done | awk '{arr[$0]++; next} END{for (i in arr){if(arr[i]>1){print i}}}

CodeHunter

How to find duplicate filenames (recursively) in a given directory? BASH

Here's my 1-line answer:

Recent Posts

How can I color dots in a xy scatterplot according to column value?

How to update a claim in ASP.NET Identity?

What does {0} mean when initializing an object?

Accessing members of items in a JSONArray with Java

How to log SQL statements in Spring Boot?

Powershell Get-WebSite name parameter is ignored

How to detect scroll to bottom of html element

Java synchronized method

How to test controllers with CodeIgniter?

Detect Visual Composer

Matplotlib: Specify format of floats for tick labels

Rails join a list of strings with commas and "and" before the last