Skip to content

missing utf-8 BOM lead to codec failures during tests on windows #175

Open
@aogier

Description

@aogier

FYI, as #174 stimulated my curiosity, one of the two test files lacks BOM bytes:

$ hd  -n4 test/texts/sherlock.txt
00000000  ef bb bf 50                                       |...P|
00000004
$ hd  -n4 test/texts/senate-bills.txt
00000000  32 31 73 74                                       |21st|
00000004

and this seems not to please windows' machines (unices will probably have a better autodetection thing). I don't see this as a problem per se, as leaving correct input mangling is probably a user's task but it lowers x-files factor on previous commits ;)

thank you, regards

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions