first-contributions icon indicating copy to clipboard operation
first-contributions copied to clipboard

Proper language codes in translated files

Open rammba opened this issue 2 months ago • 7 comments

We should try to set proper language code in translation files. This includes both README and docs translations.

🐞 Problem As I've already noticed in the issue for Bosnian language, some languages doesn't have it's own code in the file name, so it can cause confusion.

💡 Possible solutions Hello @Roshanjossey, maybe we can list all wrong codes in this issue and give a chance to the community to fix the namings. As a reference, we can use ISO 639 list. If a language has two-letter code, use it. If not, use the three-letter code. We can exclude non-regular languages from this list, like pirate and alien languages.

rammba avatar Oct 17 '25 18:10 rammba

Hi i would like to work on it

Vishwaspatel2401 avatar Oct 18 '25 04:10 Vishwaspatel2401

Hello @rammba,

I think README.mli.md is in Bambara, so the language code should be bm, I must first say that I don't know the language, I could be wrong, if there is no native speaker to help identify the language, we can use bm for the time being.

I checked the text in README using both Google Translate and tomedes (a website that claims to support 270 languages), and both detected that the text was in Bambara. (No other language identification website could identify this language, I understand why the list uses three question marks now)

Note: This is just a suggestion.

Sharl0tteIsTaken avatar Oct 27 '25 18:10 Sharl0tteIsTaken

@rammba, thank you for opening this issue.

I agree that we need to standardise language codes. I'll check other existing language code standards too.

One problem I see with this change is existing open issues about things to fix in translation files. All of these have links to which file to change and they will break. @Sharl0tteIsTaken also mentioned this.

My suggestion would be to keep a checklist to track whats updated and what's not and chip away at this gradually. For each language code that's updated, we update corresponding issues and update the checklist. What do you think?

Roshanjossey avatar Oct 27 '25 18:10 Roshanjossey

I think README.mli.md is in Bambara, so the language code should be bm

Hey @Sharl0tteIsTaken, since nobody here knows what README.mli.md represents, maybe we can leave it as it is now. Neither of us is a native speaker of Bambara, so we can't prove it's that language unfortunatelly.

For each language code that's updated, we update corresponding issues and update the checklist. What do you think?

Hey @Roshanjossey, my idea was to keep track of all needed changes in this issue, including checking of finished/fixed language codes. You can create separate issues for all the languages and let the community to fix names. Also, you can create issue when other issues for that language are fixed (alt text & bash codes for now). What do you think?

rammba avatar Oct 28 '25 00:10 rammba

Hey, assing to me. i would like to work on it.

Pranav-0440 avatar Nov 10 '25 18:11 Pranav-0440

Addition to my previous comment (https://github.com/firstcontributions/first-contributions/issues/105711#issuecomment-3429455018):

Since I'm not native Arabic speaker, maybe we should use different codes for already existing Arabic languages:

rammba avatar Dec 09 '25 20:12 rammba