Optimise the `Japanese` calendar #7323

robertbastian · 2025-12-16T12:45:52Z

With the Japanese calendars split (#7334 ), there is some optimisation potential:

For Japanese, we don't need to store the DataPayload with the full map anymore. The five known eras are hardcoded in code anyway, so all we have to store in the calendar object is the potential next era. The start date can be packed, because the year does not need to be i32, we can store it relative to the year 2000. This makes both Japanese and AnyCalendar significantly smaller and Copy. However, as we only store one era, it requires clients to update ICU4X at least once per era. Given that the last eras were 30 and 62 years long, I think that is acceptable.
For Japanese, we don't need to use a lookup map in DTF anymore, we can now assign icu4x_era_indexes and use linear storage. This cuts down era name size by 20kB.

sffc · 2025-12-18T22:31:23Z

We intentionally kept Japanese data-driven since the new eras get announced with short notice and we want people to be able to pull them in if they use dynamic data without having to ship a library update.

robertbastian · 2025-12-18T22:52:25Z

And? This is still data-driven.

sffc

It's an interesting proposal. It means that at most 1 post-reiwa era will be available until the library is updated. The advantage is that the type is a bit smaller and faster.

My concern would be, what if there are two eras in fast succession? Like, if there is a new emperor who leaves power very quickly after ascending to power. In that case, we'd format dates correctly for the latest emperor, which is good, but we would "forget" about the previous emperor until the library is updated. Maybe that's fine?

robertbastian · 2025-12-19T19:54:44Z

We could store two eras. Each era costs 11 bytes. All other "good" calendars are ZSTs, so this basically determines the size of any future enum calendar. 23 bytes might be fine, 34 bytes is probably too big. We have to weigh the risks; I personally don't expect Japan to have any kind of succession drama in the next 30 years - by then ICU4X will probably look different anyway.

sffc · 2025-12-19T20:06:46Z

I think we should support either 1 or many data-driven eras. I don't see a really good reason to support 2 or 3. I'm not comfortable making a unilateral decision in a code review to support just 1.

Manishearth · 2025-12-19T20:16:17Z

I've been considering a change like this for a while, especially if we get rid of JapaneseExtended (then AnyCalendar can be cheaply cloned without an Arc). However, this design was a deliberate choice because we wanted to use ICU4X's data architecture for this. In retrospect: I'm not sure if we needed to, data that updates once every few decades does not need our data architecture. Our hardcoded calendar data is more likely to change than this.

Overall I think we should work on this with care and discussion. Optimizing datetimeformat is a good motivator, and fixing the Arc is a good motivator, but there are other ways to fix the Arc, and there may be other ways to optimize datetime format.

An early implementation of this type had the data being "empty" in cases where there were just five eras. Worth considering optimizations of that shape.

But yes, it would be nice to get rid of all AnyCalendar data loading completely.

sffc · 2025-12-19T20:19:13Z

This still does data loading, I think, it just picks off one value from the data struct instead of storing the whole data struct in order to reduce stack size.

Manishearth · 2025-12-19T20:20:47Z

Yep, I'm idly wondering about designs that remove data loading entirely.

I think the "load data but only store the last era" is a clever fix.

robertbastian force-pushed the japanext2 branch from 0c70d86 to 578eda7 Compare December 16, 2025 12:46

robertbastian mentioned this pull request Dec 17, 2025

Remove JapaneseExtended #7322

Open

robertbastian force-pushed the japanext2 branch 2 times, most recently from 81161a7 to 076477a Compare December 18, 2025 16:08

robertbastian mentioned this pull request Dec 19, 2025

Untangle Japanese calendars #7334

Merged

robertbastian force-pushed the japanext2 branch 5 times, most recently from 6d26de8 to 3a25082 Compare December 19, 2025 14:15

robertbastian added 2 commits December 19, 2025 20:31

optimize era storage

d1ad434

era indices

2d248bb

robertbastian force-pushed the japanext2 branch from 3a25082 to 2d248bb Compare December 19, 2025 19:31

robertbastian marked this pull request as ready for review December 19, 2025 19:31

robertbastian requested review from a team, Manishearth and sffc as code owners December 19, 2025 19:31

sffc reviewed Dec 19, 2025

View reviewed changes

robertbastian requested a review from sffc December 19, 2025 19:54

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Optimise the `Japanese` calendar #7323

Optimise the `Japanese` calendar #7323

robertbastian commented Dec 16, 2025 •

edited

Loading

Uh oh!

sffc commented Dec 18, 2025

Uh oh!

robertbastian commented Dec 18, 2025

Uh oh!

sffc left a comment

Uh oh!

robertbastian commented Dec 19, 2025

Uh oh!

sffc commented Dec 19, 2025

Uh oh!

Manishearth commented Dec 19, 2025 •

edited

Loading

Uh oh!

sffc commented Dec 19, 2025

Uh oh!

Manishearth commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimise the Japanese calendar #7323

Are you sure you want to change the base?

Optimise the Japanese calendar #7323

Conversation

robertbastian commented Dec 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sffc commented Dec 18, 2025

Uh oh!

robertbastian commented Dec 18, 2025

Uh oh!

sffc left a comment

Choose a reason for hiding this comment

Uh oh!

robertbastian commented Dec 19, 2025

Uh oh!

sffc commented Dec 19, 2025

Uh oh!

Manishearth commented Dec 19, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

sffc commented Dec 19, 2025

Uh oh!

Manishearth commented Dec 19, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Optimise the `Japanese` calendar #7323

Optimise the `Japanese` calendar #7323

robertbastian commented Dec 16, 2025 •

edited

Loading

Manishearth commented Dec 19, 2025 •

edited

Loading