DR 09-0040: a small test to clarify "chinese language"

suzuki toshiya mpsuzuki at hiroshima-u.ac.jp
Tue May 17 05:38:03 CEST 2011


Dear Chris,

Sorry for my lated action, and thank you for comment about my figure
about the font slot selection algorithm. If anybody has a comment
to improve the figure for the official inclusion, please give me.

In my comment posted for DR 09-0040, I requested a clarification
of the coverage about Chinese locale that changes the accented Latin
for PinYin; "zh-CN" and "zh-TW" will be included but how about
"zh-HK", "zh-MO", "zh-SG" etc etc.

Attached docx is a testing document repeating a text with various
locale specification:

<w:p>
 <w:pPr>
  <w:r w:rsidRPr="008B6412">
   <w:rPr>
    <w:rFonts w:ascii="Courier New" w:eastAsia="SimSun-18030" w:hAnsi="Arial" w:cs="Arabic Typesetting" w:hint="default"/>
    <w:sz w:val="16"/>
    <w:szCs w:val="16"/>
    <w:lang w:val="en-US" w:eastAsia="en-US" w:bidi="en-US"/>
   </w:rPr>
   <w:t>(en-US,hint=default)  Hànyǔ Pīnyīn,uüÜ,úǘǗ‘’‚‛“”„…()[]ΑΒΓЀЁЂ</w:t>
  </w:r>
 </w:pPr>
</w:p>

3 different settings of <w:lang> elements are tested:
    <w:lang w:val="zh-CN" w:eastAsia="en-US" w:bidi="en-US"/>  (questionable usage)
    <w:lang w:val="en-US" w:eastAsia="zh-CN" w:bidi="en-US"/>  (designed usage)
    <w:lang w:val="en-US" w:eastAsia="en-US" w:bidi="zh-CN"/>  (maybe invalid usage)
and various around east/south-east asian territories (CN, TW, HK, MO, SG, and
JP, KR, MY, TH, VN) are tested.

>From the result, the Chinese language is meaningful only when it is set to eastAsia
attribute, and the coverage of Chinese languages would be defined some registered list:
zh-CN, zh-TW, zh-HK, zh-MO, zh-SG are valid, but other combinations with different
territories (like zh-MY) are invalid.
I wish if the list of registered Chinese language is provided in the spec, even if
it is classified as non-normative. In Bidi support clarification, the list of registered
languages would be helpful too (e.g. w:bidi=ar-IQ, or =fa-IR makes the paragraph to
right-to-left mode, but ar-IR cannot).

Regards,
suzuki toshiya, Hiroshima University, Japan
-------------- next part --------------
A non-text attachment was scrubbed...
Name: pinyin-test.docx
Type: application/octet-stream
Size: 14080 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110517/42703aba/attachment-0001.obj>


More information about the sc34wg4 mailing list