Skip to content

Unicode file name do not checkout correctly on windows #203

Open
@garyvdm

Description

@garyvdm

Steps to reproduce:

dulwich clone https://github.com/garyvdm/git_unicode_files.git
dir git_unicode_files

expected: 1 file named À (which is u'\u00c0')
actual: the file is named À (which is u'\u00c3\u20ac')

the file name is what you get if you do u'\u00c0'.encode('utf8').decode('mbcs'). mbcs it the default filesystem charter encoding used on windows.

The git client handles this correctly. I'll take a look at their source code in the future to try figure out how they handle this.

Metadata

Metadata

Assignees

No one assigned

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions