Normalizes Unicode to ASCII equivalents and remove Unicode from AI generated text from ChatGPT, Anthropic, Google and more.
-
Updated
Apr 28, 2025 - Python
Normalizes Unicode to ASCII equivalents and remove Unicode from AI generated text from ChatGPT, Anthropic, Google and more.
Rust Library to convert rich UTF-8 Text into plain ASCII Text
Ruby Gem: Removes invalid UTF8 characters & extra whitespace (carriage returns, new lines, tabs, spaces, etc.) from csv or strings.
🔤 UTF8-16-32 analysis and manipulation library
Add a description, image, and links to the utf8-sanitizer topic page so that developers can more easily learn about it.
To associate your repository with the utf8-sanitizer topic, visit your repo's landing page and select "manage topics."