From b98dec38e901b63b20e49ec1e9d272e34c863116 Mon Sep 17 00:00:00 2001 From: r12a Date: Thu, 11 Jul 2024 16:34:52 +0100 Subject: [PATCH 1/2] Add more niformation about chinese character sets & combining marks --- resources/index.html | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/resources/index.html b/resources/index.html index dd64bfc..37b7bac 100644 --- a/resources/index.html +++ b/resources/index.html @@ -174,11 +174,11 @@

Chinese Script Overview

Words are not separated by spaces or any other character. There is no case distinction. The visual forms of characters don't interact.

-

In its 'main' category, CLDR lists 2,210 characters for the Simplified Chinese orthography, and 2,180 for Traditional Chinese. Combined, this includes 3,026 unique characters, and an overlap of 1,064 characters. A working set of characters for modern Chinese may include 3 times this number, and the Unicode Standard includes approaching 100,000 Han characters, many of which are archaic or esoteric.

+

In its 'main' category, CLDR lists 2,210 characters for the Simplified Chinese orthography, and 2,180 for Traditional Chinese. Combined, this includes 3,026 unique characters, and an overlap of 1,064 characters. A working set of characters for modern Chinese may include 3 times this number, and number of characters in the Unicode Standard approaches 100,000 Han code points, many of which are archaic or esoteric. In fact, various regions define their own character sets, such as the 3,500 characters in the Tier I Table of 通用规范汉字表 (General Standard Chinese Characters) in Mainland China, the 4,808 characters in the Taiwanese 常用“国字”标准字体表 (Chart of Standard Forms of Common National Characters), the 4,759 characters in 常用字字形表 (Common Chinese Characters) in Hong Kong SAR, or the sets of 欢乐伙伴 ("Happy Buddy") characters for Singaporean primary schools.

The language is tonal, but the tones are not written explicitly.

-

Chinese has no combining marks, but has many punctuation marks. It also has a relatively complex set of typographic rules.

+

As a general rule, Chinese has no combining marks, but ideographic tone marks may be used in contexts such as university literature courses and chinese opera. On the other hand, Chinese has many punctuation marks. It also has a relatively complex set of typographic rules.

From 276afff4b1252347140e4ecbc8c36f67a5d29975 Mon Sep 17 00:00:00 2001 From: r12a Date: Fri, 12 Jul 2024 06:40:49 +0100 Subject: [PATCH 2/2] Fuqiao comments --- resources/index.html | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/resources/index.html b/resources/index.html index 37b7bac..6b9fc0d 100644 --- a/resources/index.html +++ b/resources/index.html @@ -174,11 +174,11 @@

Chinese Script Overview

Words are not separated by spaces or any other character. There is no case distinction. The visual forms of characters don't interact.

-

In its 'main' category, CLDR lists 2,210 characters for the Simplified Chinese orthography, and 2,180 for Traditional Chinese. Combined, this includes 3,026 unique characters, and an overlap of 1,064 characters. A working set of characters for modern Chinese may include 3 times this number, and number of characters in the Unicode Standard approaches 100,000 Han code points, many of which are archaic or esoteric. In fact, various regions define their own character sets, such as the 3,500 characters in the Tier I Table of 通用规范汉字表 (General Standard Chinese Characters) in Mainland China, the 4,808 characters in the Taiwanese 常用“国字”标准字体表 (Chart of Standard Forms of Common National Characters), the 4,759 characters in 常用字字形表 (Common Chinese Characters) in Hong Kong SAR, or the sets of 欢乐伙伴 ("Happy Buddy") characters for Singaporean primary schools.

+

A working set of characters for modern Chinese may include 10,000 characters, and number of characters in the Unicode Standard approaches 100,000 Han code points, many of which are archaic or esoteric. In fact, various regions define their own character sets, such as the 3,500 characters in the Tier I Table of 通用规范汉字表 (General Standard Chinese Characters) in Mainland China, the 4,808 characters in the Taiwanese 常用“国字”标准字体表 (Chart of Standard Forms of Common National Characters), the 4,759 characters in 常用字字形表 (Common Chinese Characters) in Hong Kong SAR, or the sets of 欢乐伙伴 ("Happy Buddy") characters for Singaporean primary schools.

The language is tonal, but the tones are not written explicitly.

-

As a general rule, Chinese has no combining marks, but ideographic tone marks may be used in contexts such as university literature courses and chinese opera. On the other hand, Chinese has many punctuation marks. It also has a relatively complex set of typographic rules.

+

As a general rule, Chinese has no combining marks, but ideographic tone marks may be used in contexts such as university literature courses and Chinese opera. On the other hand, Chinese has many punctuation marks. It also has a relatively complex set of typographic rules.