cldr icon indicating copy to clipboard operation
cldr copied to clipboard

CLDR-18144 Tooling for improving formats of locales, especially for menus

Open macchiati opened this issue 9 months ago • 6 comments

CLDR-18144

  • [ ] This PR completes the ticket.

Tool for investigating which languages / regions could have core and extension values. Prints lists of possible breakdowns, which then need to be filtered manually.

Resulting recommendations in https://docs.google.com/spreadsheets/d/1bbC5iOFVK0A4Hla-1xIGtgwAAOqtj4PWb0llFc-sgOY/edit?gid=386214050#gid=386214050

ALLOW_MANY_COMMITS=true

macchiati avatar Mar 27 '25 16:03 macchiati

Will languages be able to re-order the extension and the core parts? Do we know if there are languages where for certain combinations the extension may appear on different sides of the core word (kinda like in French where most adjectives follow the noun, while a handful precede the noun? https://en.wikipedia.org/wiki/French_grammar#Adjectives

AEApple avatar Apr 12 '25 00:04 AEApple

Will languages be able to re-order the extension and the core parts? Do we know if there are languages where for certain combinations the extension may appear on different sides of the core word (kinda like in French where most adjectives follow the noun, while a handful precede the noun? https://en.wikipedia.org/wiki/French_grammar#Adjectives

The extension use the same mechanism as the language and (script, region, variant, keys, etc). That is, someone would see in a menu (where the core/extension mechanism is used — it is optional) as the following (look at the Kurdish lines for the differences)

NEW

...
French
Kurdish (Kurmanji)
Kurdish (Sorani)
Serbian (Cyrillic)
Serbian (Latin)
Swedish (Sweden)
Swedish (Finland)
...

OLD

...
French
Kurdish
Serbian (Cyrillic)
Serbian (Latin)
Sorani Kurdish
Swedish (Sweden)
Swedish (Finland)
...

See also the regions' part of the spreadsheet in the ticket.

macchiati avatar Apr 13 '25 01:04 macchiati

The tool is just used to help identify items that might need splitting. Can be cleaned up after submission starts

On Tue, Apr 15, 2025, 00:50 Shane F. Carr @.***> wrote:

@.**** commented on this pull request.

I looked at the spreadsheet and there are some interesting findings that should be discussed.

The tool itself looks like a proof-of-concept; is it your intent to eventually land this as part of a test?

— Reply to this email directly, view it on GitHub https://github.com/unicode-org/cldr/pull/4537#pullrequestreview-2764825391, or unsubscribe https://github.com/notifications/unsubscribe-auth/ACJLEMCEYGDBPYFFUYAPKPT2ZPKNBAVCNFSM6AAAAABZ5QUJLGVHI2DSMVQWIX3LMV43YUDVNRWFEZLROVSXG5CSMV3GSZLXHMZDONRUHAZDKMZZGE . You are receiving this because you were assigned.Message ID: @.***>

macchiati avatar Apr 15 '25 12:04 macchiati

@macchiati do you want to land this code? Have there been any changes to the approach since you posted this?

conradarcturus avatar Jun 19 '25 00:06 conradarcturus

I'd like to go ahead and land the tool, because I think it could be useful in the future.

macchiati avatar Jul 28 '25 16:07 macchiati

This is just tooling, with no effect on the release. There is one unrelated test failure; I strongly suspect that rebasing will fix it, but I don't have time now. So when I get back.

macchiati avatar Aug 20 '25 21:08 macchiati