[proposal] Default database collation to collate=C

When a database is created by the container's entrypoint, it uses the default collation that will be en_US.utf8.

As discussed here: https://github.com/odoo/odoo/pull/25196#issuecomment-396683972, this may be under-optimized with the use of LIKE queries using wildcards (LIKE 'foo%').

We have several possible axes of improvements:

* add text_pattern_ops case by case where necessary
* add trigram indices using pg_trgm which benefits for LIKE '%foo%' queries as well, case by case too however
* create the databases with a C collation and locale en_US.utf8 (collate=C)

This is mainly the last point which should be discussed here.

Pros:
* consistent sorting
* expected general improvement of performance

Cons:

* sorting of accented chars "sounds" wrong for French: Blanche, Béatrice, Claude is going to be sorted as Blanche, Béatrice, Claude instead of Béatrice, Blanche, Claude. Can be resolved with unaccent

My proposal is to change the calls to `createdb` in the image to always create them with collate=C.


Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[proposal] Default database collation to collate=C #100

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[proposal] Default database collation to collate=C #100

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions