Unicode Case Folding 

While working on cats-uri, I ran into an issue with how `CIString` was handling certain unicode values which led me to notice it wasn't respecting [Caseless matching](https://www.unicode.org/versions/Unicode14.0.0/ch05.pdf#G21790) from the Unicode standard. As it turns out, neither does `String.equalsIgnoreCase`.

[I'd just about completed a branch](https://github.com/isomarcte/case-insensitive/tree/full-unicode-case-folding) to implement full case folding as defined by the Unicode standard when I ran across [this test](https://github.com/typelevel/case-insensitive/blob/main/tests/shared/src/test/scala/org/typelevel/ci/CIStringSuite.scala#L35).

```scala
  test("character based equality") {
    assert(CIString("ß") != CIString("SS"))
  }
```

Since under the Unicode standard's caseless matching these two strings would compare equal, I'm beginning to think we are intentionally _not_ following the standard here. Is that the case? If so, why? Is it to maintain parity with what the Java standard library is doing with methods like `equalsIgnoresCase`?



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Unicode Case Folding #228

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Unicode Case Folding #228

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions