Convert git repository file encoding Convert git repository file encoding git git

Convert git repository file encoding


You can do this with git filter-branch. The idea is that you have to change the encoding of the files in every commit, rewriting each commit as you go.

First, write a script that changes the encoding of every file in the repository. It could look like this:

#!/bin/shfind . -type f -print | while read f; do        mv -i "$f" "$f.recode.$$"        iconv -f iso-8859-1 -t utf-8 < "$f.recode.$$" > "$f"        rm -f "$f.recode.$$"done

Then use git filter-branch to run this script over and over again, once per commit:

git filter-branch --tree-filter /tmp/recode-all-files HEAD

where /tmp/recode-all-files is the above script.

Right after the repository is freshly upgraded from CVS, you probably have just one branch in git with a linear history back to the beginning. If you have several branches, you may need to enhance the git filter-branch command to edit all the commits.