unicodia icon indicating copy to clipboard operation
unicodia copied to clipboard

Japanese: interesting numerals

Open Mercury13 opened this issue 9 months ago • 62 comments

{{nspk|sgw}} in Japanese Image

Mercury13 avatar Apr 09 '25 21:04 Mercury13

More clear: 1.5M = 150·10,000?

Mercury13 avatar Apr 09 '25 21:04 Mercury13

1M = 1,000,000 (M = million) so 1.5M = 1,500,000

In Japanese 1,000,000 = 100万 100,000 = 10万 10,000 = 1万 1,000 = 1千 100 = 百 10 = 十 1 = 一

coolvitto avatar Apr 10 '25 01:04 coolvitto

I haven't had much free time since my wife was hospitalized in February. Once things have settled down, I'll revisit the translation.

coolvitto avatar Apr 10 '25 01:04 coolvitto

Oh, thank you! I’ll address this.

Mercury13 avatar Apr 10 '25 17:04 Mercury13

I realized that Sebat Bet Gurage is displayed in three places, so I moved it to centralized data.

Mercury13 avatar Apr 10 '25 18:04 Mercury13

@coolvitto What about 1500? Is it better to write 1,500, 15百 or 1.5千?

Mercury13 avatar Apr 10 '25 18:04 Mercury13

@coolvitto And do we need 15千人?

Mercury13 avatar Apr 10 '25 19:04 Mercury13

And what’s better in English…

This? Image

Selection between comma and space? Or just space?

Mercury13 avatar Apr 10 '25 19:04 Mercury13

What about 1500?

Numbers are usually used as is. Notations such as 1.5千 are only used for special purposes such as graphs.

Commas in numbers are the same as in English. 1500 = 1,500

coolvitto avatar Apr 11 '25 03:04 coolvitto

@coolvitto Commas same as English — I know. There are more interesting things. How to write “1.25 billion” in Japanese? Or “300 million”?

Mercury13 avatar Apr 11 '25 10:04 Mercury13

…or “350 million”? Extremely large numbers: 350M = 35,000tt.

Mercury13 avatar Apr 11 '25 11:04 Mercury13

Current messages are:

Chinese 1.35bn:(中国語(北方語)といくつかの姉妹言語からなる言語系統、2022 時点の 1.35bn) Spanish 600M:(2023 時点の 60000万) Russian 150M:(15000万 は 2020 と同様にネイティブ) Sebat Bet 1.5…2.2M:(エチオピア、2022 時点の 150〜220万)

(I know, there’s no thousand comma, but I’ll add)

Mercury13 avatar Apr 11 '25 16:04 Mercury13

Arabic: how to change? (38,000万 は 2020s と同様にネイティブ)

Mercury13 avatar Apr 11 '25 16:04 Mercury13

P.S. Is “neitibu” true Japanese term?

Mercury13 avatar Apr 11 '25 16:04 Mercury13

Thai 60M: (2020s 時点の >6,000万)

Mercury13 avatar Apr 11 '25 20:04 Mercury13

@coolvitto Need your help!

Mercury13 avatar Apr 11 '25 22:04 Mercury13

In Japanese, "1.25 billion" would be "12億5千万"(or 12億5000万)

billion -> 10億 million -> 100万

“350 million” -> 3億5千万(or 3億5000万)

1,000,000,000 = billion = 10億 100,000,000 = 100 million = 1億 10,000,000 = 10 million = 1000万 (or 千万) 1,000,000 = million = 100万 (or 百万)

600M = 600 million 150M = 150 million The numbers are not incorrect, but in Japanese, once you get past the thousand mark, it becomes the "億". So, 600M = 6億 150M = 1億5千万

38,000万 = 3億8000万

In Japanese, the word "ネイティブ" (neitibu) is translated as "local person" or "indigenous person," but these days it is often pronounced "ネイティブ" in the English pronunciation.

coolvitto avatar Apr 12 '25 01:04 coolvitto

Interesting. I’ll try to program.

Mercury13 avatar Apr 12 '25 07:04 Mercury13

@coolvitto Quickly added more steps and policy “except1”, got this way for now Spanish 600M:(2023 時点の 6億) Russian 150M:(15,000万 は 2020 と同様にネイティブ) Chinese 1.35bn:(中国語(北方語)といくつかの姉妹言語からなる言語系統、2022 時点の 135千万)

As Unicodia has only two ranges, Sebat Bet and Rejang, I’ll suppress using smaller units in ranges, and that’s probably OK. Rejang 200…350k:(スマトラ、20〜35万)

Just my code does not support more that 1 unit — and maybe I’ll start from smaller units rather than from larger.

2020s is still hanging. Decade rather than year: if the language is big or scattered, several countries cannot conduct a census at once, and # of speakers is aggregated from different sources.

Mercury13 avatar Apr 12 '25 08:04 Mercury13

Wrote Bengali 230M:(バングラデシュ、インド東部、2億3千万 は 2021 と同様にネイティブ)

Mercury13 avatar Apr 12 '25 08:04 Mercury13

And Chinese 1.35 bn is now 中国語(北方語)といくつかの姉妹言語からなる言語系統、2022 時点の 13億5千万)

Mercury13 avatar Apr 12 '25 08:04 Mercury13

	<num-format dec-point="." thou-point="," thou-min-length="4" thou-min-length-dense="4">
		<imprecise>
			<fmt shift="0" text="{1}人" frac="never" />
			<fmt shift="3" text="{1}千" frac="avoid" />
			<fmt shift="4" text="{1}万" frac="never" bigger-unit="億" bigger-subshift="4" />
			<fmt shift="7" text="{1}千万" frac="except1" bigger-unit="億" bigger-subshift="1" />
			<fmt shift="8" text="{1}億" frac="never" />
		</imprecise>
	</num-format>

Mercury13 avatar Apr 12 '25 12:04 Mercury13

@coolvitto 2020s is still hanging. Decade rather than year: if the language is big or scattered, several countries cannot conduct a census at once, and # of speakers is aggregated from different sources.

Mercury13 avatar Apr 12 '25 13:04 Mercury13

@coolvitto Really need your help!

Mercury13 avatar Apr 17 '25 06:04 Mercury13

@coolvitto Need your help!

Mercury13 avatar May 15 '25 08:05 Mercury13

Is there anything I can help you with?

coolvitto avatar May 15 '25 16:05 coolvitto

The main task is… I rewrote auto-formatting # of speakers, and want you to check if OK. Writing approximate numbers is a really strange badly explored thing.

Mercury13 avatar May 16 '25 12:05 Mercury13

@coolvitto TWO. How to write a decade in Japanese, e.g.:

  • 100,000 speakers as of 2010
  • 100,000 speakers as of 2010s

Mercury13 avatar May 16 '25 12:05 Mercury13

@coolvitto Am i right: Chinese (>billion): (中国語(北方語)といくつかの姉妹言語からなる言語系統、2022 時点の 13億5千万人) English (hundreds of millions): (3億8千万人 は 2021 と同様にネイティブ) Russian (a bit smaller): (1億5千万人 は 2020 と同様にネイティブ) Ukrainian (tens of millions): (2024 時点の 3,900万人) Khmer (smaller): (カンボジア、2007 年現在 1,600 万人) Manipuri (<2M): (インド東部、2011 時点の >170万人) Bantawa (<200k): (インド北東部、ネパール、2011 時点の 17万人) Bhumij (tens of thousands): (インド東部、2011 時点の 27千人) And Cherokee (<10,000): (USA、2019 時点の <2,100人)

Mercury13 avatar May 16 '25 12:05 Mercury13

Review of translation — that’s a less urgent job. Internationalization of approximate numbers is a priority.

Mercury13 avatar May 16 '25 12:05 Mercury13