unicodia Japanese: interesting numerals

{{nspk|sgw}} in Japanese

Apr 09 '25 21:04 Mercury13

More clear: 1.5M = 150·10,000?

Apr 09 '25 21:04 Mercury13

1M = 1,000,000 (M = million) so 1.5M = 1,500,000

In Japanese 1,000,000 = 100万 100,000 = 10万 10,000 = 1万 1,000 = 1千 100 = 百 10 = 十 1 = 一

Apr 10 '25 01:04 coolvitto

I haven't had much free time since my wife was hospitalized in February. Once things have settled down, I'll revisit the translation.

Apr 10 '25 01:04 coolvitto

Oh, thank you! I’ll address this.

Apr 10 '25 17:04 Mercury13

I realized that Sebat Bet Gurage is displayed in three places, so I moved it to centralized data.

Apr 10 '25 18:04 Mercury13

@coolvitto What about 1500? Is it better to write 1,500, 15百 or 1.5千?

Apr 10 '25 18:04 Mercury13

@coolvitto And do we need 15千人?

Apr 10 '25 19:04 Mercury13

And what’s better in English…

This?

Selection between comma and space? Or just space?

Apr 10 '25 19:04 Mercury13

What about 1500?

Numbers are usually used as is. Notations such as 1.5千 are only used for special purposes such as graphs.

Commas in numbers are the same as in English. 1500 = 1,500

Apr 11 '25 03:04 coolvitto

@coolvitto Commas same as English — I know. There are more interesting things. How to write “1.25 billion” in Japanese? Or “300 million”?

Apr 11 '25 10:04 Mercury13

…or “350 million”? Extremely large numbers: 350M = 35,000tt.

Apr 11 '25 11:04 Mercury13

Current messages are:

Chinese 1.35bn:（中国語（北方語）といくつかの姉妹言語からなる言語系統、2022 時点の 1.35bn） Spanish 600M:（2023 時点の 60000万） Russian 150M:（15000万は 2020 と同様にネイティブ） Sebat Bet 1.5…2.2M:（エチオピア、2022 時点の 150〜220万）

(I know, there’s no thousand comma, but I’ll add)

Apr 11 '25 16:04 Mercury13

Arabic: how to change? （38,000万は 2020s と同様にネイティブ）

Apr 11 '25 16:04 Mercury13

P.S. Is “neitibu” true Japanese term?

Apr 11 '25 16:04 Mercury13

Thai 60M: （2020s 時点の >6,000万）

Apr 11 '25 20:04 Mercury13

@coolvitto Need your help!

Apr 11 '25 22:04 Mercury13

In Japanese, "1.25 billion" would be "12億５千万"(or 12億5000万)

billion -> 10億 million -> 100万

“350 million” -> 3億5千万(or 3億5000万)

1,000,000,000 = billion = 10億 100,000,000 = 100 million = 1億 10,000,000 = 10 million = 1000万 (or 千万) 1,000,000 = million = 100万 (or 百万)

600M = 600 million 150M = 150 million The numbers are not incorrect, but in Japanese, once you get past the thousand mark, it becomes the "億". So, 600M = 6億 150M = 1億5千万

38,000万 = 3億8000万

In Japanese, the word "ネイティブ" (neitibu) is translated as "local person" or "indigenous person," but these days it is often pronounced "ネイティブ" in the English pronunciation.

Apr 12 '25 01:04 coolvitto

Interesting. I’ll try to program.

Apr 12 '25 07:04 Mercury13

@coolvitto Quickly added more steps and policy “except1”, got this way for now Spanish 600M:（2023 時点の 6億） Russian 150M:（15,000万は 2020 と同様にネイティブ） Chinese 1.35bn:（中国語（北方語）といくつかの姉妹言語からなる言語系統、2022 時点の 135千万）

As Unicodia has only two ranges, Sebat Bet and Rejang, I’ll suppress using smaller units in ranges, and that’s probably OK. Rejang 200…350k:（スマトラ、20〜35万）

Just my code does not support more that 1 unit — and maybe I’ll start from smaller units rather than from larger.

2020s is still hanging. Decade rather than year: if the language is big or scattered, several countries cannot conduct a census at once, and # of speakers is aggregated from different sources.

Apr 12 '25 08:04 Mercury13

Wrote Bengali 230M:（バングラデシュ、インド東部、2億3千万は 2021 と同様にネイティブ）

Apr 12 '25 08:04 Mercury13

And Chinese 1.35 bn is now 中国語（北方語）といくつかの姉妹言語からなる言語系統、2022 時点の 13億5千万）

Apr 12 '25 08:04 Mercury13

	<num-format dec-point="." thou-point="," thou-min-length="4" thou-min-length-dense="4">
		<imprecise>
			<fmt shift="0" text="{1}人" frac="never" />
			<fmt shift="3" text="{1}千" frac="avoid" />
			<fmt shift="4" text="{1}万" frac="never" bigger-unit="億" bigger-subshift="4" />
			<fmt shift="7" text="{1}千万" frac="except1" bigger-unit="億" bigger-subshift="1" />
			<fmt shift="8" text="{1}億" frac="never" />
		</imprecise>
	</num-format>

Apr 12 '25 12:04 Mercury13

@coolvitto 2020s is still hanging. Decade rather than year: if the language is big or scattered, several countries cannot conduct a census at once, and # of speakers is aggregated from different sources.

Apr 12 '25 13:04 Mercury13

@coolvitto Really need your help!

Apr 17 '25 06:04 Mercury13

@coolvitto Need your help!

May 15 '25 08:05 Mercury13

Is there anything I can help you with?

May 15 '25 16:05 coolvitto

The main task is… I rewrote auto-formatting # of speakers, and want you to check if OK. Writing approximate numbers is a really strange badly explored thing.

May 16 '25 12:05 Mercury13

@coolvitto TWO. How to write a decade in Japanese, e.g.:

100,000 speakers as of 2010
100,000 speakers as of 2010s

May 16 '25 12:05 Mercury13

@coolvitto Am i right: Chinese (>billion): （中国語（北方語）といくつかの姉妹言語からなる言語系統、2022 時点の 13億5千万人） English (hundreds of millions): （3億8千万人は 2021 と同様にネイティブ） Russian (a bit smaller): （1億5千万人は 2020 と同様にネイティブ） Ukrainian (tens of millions): （2024 時点の 3,900万人） Khmer (smaller): (カンボジア、2007 年現在 1,600 万人) Manipuri (<2M): （インド東部、2011 時点の >170万人） Bantawa (<200k): （インド北東部、ネパール、2011 時点の 17万人） Bhumij (tens of thousands): （インド東部、2011 時点の 27千人） And Cherokee (<10,000): （USA、2019 時点の <2,100人）

May 16 '25 12:05 Mercury13

Review of translation — that’s a less urgent job. Internationalization of approximate numbers is a priority.

May 16 '25 12:05 Mercury13