feat(i18n): normalize translation files structure and patch zh-TW translations by xiaoran007 · Pull Request #247 · jundot/omlx

xiaoran007 · 2026-03-15T21:47:53Z

This PR standardizes the structure of all i18n translation files under omlx/admin/i18n/ by treating en.json as the single source of truth.

Over time, keys were added or updated in en.json without being consistently synchronized to other locale files, which led to key drift and structural inconsistencies across translations. This PR addresses that maintenance issue by introducing a normalization utility and applying it to the current locale files. It also fills in missing entries for Traditional Chinese.

Changes made

1. Added `scripts/normalize_i18n.py`

This developer utility normalizes locale files against en.json.

It:

uses en.json as the baseline schema
aligns key structure and ordering across locale files
fills missing keys with English fallback values
removes deprecated extra keys
Update based on review: The script now strictly relies on the standard Python json library instead of regex for better robustness and maintainability. All JSON files are now formatted with a standard 2-space indentation.

2. Normalized existing locale files

Applied the normalization script to:

zh.json
zh-TW.json
ja.json
ko.json

This produces a one-time large diff because the current files are being brought into a consistent canonical structure.

3. Completed missing Traditional Chinese entries

I also identified 10 keys missing from zh-TW.json that already existed in zh.json, and added Traditional Chinese translations for them.

Notes for reviewers

The large diff is primarily caused by the initial normalization pass (reordering / restructuring to match en.json), not by broad semantic translation changes. This should be mostly a one-time cleanup and should reduce churn in future i18n updates.

Related Issues

None.

Type of Change

New feature / Enhancement (non-breaking change which adds functionality or improves DX)

Checklist:

No tests required for i18n JSON files/developer scripts, and I have performed a self-review

…matting

jundot

Thanks for the normalization work and the zh-TW translations. Couple of things i noticed:

1. the normalize script parses JSON with regex instead of a JSON parser

scripts/normalize_i18n.py processes locale files line by line using a regex pattern (r'^(\s*)"([^"]+)"(\s*:\s*)(.*)$'). This works for the current flat key-value structure but it's fragile. If the i18n files ever get nested objects or arrays, this breaks silently. Using json.load() to read and json.dump() with sort_keys (or a custom key order from en.json) would be more robust and still preserve the key ordering goal.

2. 2-space to 4-space indent change

The normalization changes all locale files from 2-space to 4-space indentation because it mirrors en.json's formatting. But 2-space is the more common convention for JSON files. If the goal is consistency across all locale files, would it make more sense to update en.json to use 2-space instead? That way the diff for ja/ko/zh files becomes purely key reordering without the indent noise.

xiaoran007 · 2026-03-16T16:40:14Z

Thanks for the great feedback! I agree with both points.

For the Regex Parsing: I've removed the regex logic entirely. The normalize_i18n.py script now strictly uses json library.
For the Indentation: I've updated en.json to use 2-space indentation, and configured the script to output all files with indent=2. This significantly reduced the diff noise and properly aligns everything with standard JSON conventions.

(bwt, I originally used regex trying to preserve the empty lines used for visual grouping in en.json, but I agree that standard and safe JSON serialization is much more important for maintainability).

I've pushed the updated commits.

jundot · 2026-03-21T16:57:11Z

Looks good, thanks for the updates. Merging this now.

I noticed a couple minor things but i'll handle them in a follow-up commit:

en.json still has blank lines between sections but the other locale files don't (since json.dump strips them). Will align these.
The script silently drops keys that exist in locale files but not in en.json. Will add a warning for that.

xiaoran007 added 5 commits March 15, 2026 17:17

chore: add i18n normalization script

34555af

feat(i18n): normalize translation files structure against en.json

2068bcd

feat(i18n): update missing translations in zh-TW.json

9a6bb43

chore: update normalization script to preserve en.json empty line for…

2565463

…matting

feat(i18n): apply empty line formatting to translation files

96a1bad

jundot reviewed Mar 16, 2026

View reviewed changes

chore: revert to pure json library with 2-space indentation

745e455

jundot force-pushed the main branch 7 times, most recently from f6faf2f to c2beead Compare March 21, 2026 05:58

jundot merged commit bf3a3f2 into jundot:main Mar 21, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(i18n): normalize translation files structure and patch zh-TW translations#247

feat(i18n): normalize translation files structure and patch zh-TW translations#247
jundot merged 6 commits intojundot:mainfrom
xiaoran007:feat(i18n)/normalization

xiaoran007 commented Mar 15, 2026 •

edited

Loading

Uh oh!

jundot left a comment

Uh oh!

xiaoran007 commented Mar 16, 2026

Uh oh!

jundot commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

xiaoran007 commented Mar 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes made

1. Added scripts/normalize_i18n.py

2. Normalized existing locale files

3. Completed missing Traditional Chinese entries

Notes for reviewers

Related Issues

Type of Change

Checklist:

Uh oh!

jundot left a comment

Choose a reason for hiding this comment

Uh oh!

xiaoran007 commented Mar 16, 2026

Uh oh!

jundot commented Mar 21, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xiaoran007 commented Mar 15, 2026 •

edited

Loading

1. Added `scripts/normalize_i18n.py`