Description
In the response object I get, all non-ASCII characters are escaped, so I see "\u201cquote\u201d"
instead of '“quote”'
.
It would be nice to have an option to get UTF-8 responses back. And since non-utf-8 environments are rather rare nowadays, I'd suggest that it could be the default.
In a way this is just a minor nit, but it's also about accuracy. If my document says "It’s", then returning "It\u2019s" is incorrect, that's not what it says.
Edit: this probably stems from the default behaviour of the Python json module (ensure_ascii=True
) and if I handle the response myself and manually save the JSON it looks right. So really the issue is that the JSON files produced aren't very human-readable if you have a lot of non-ASCII characters, and if that's not a major concern, feel free to close.
I guess if I really want to be able to see the correct text in the JSON file output (and have used LocalUploaderConfig
) I can always just load the files and save them again with ensure_ascii=False
.