DR 18-0002: SML: Sorting CJK ideographic characters based on Japanese phonetics
Rex Jaeschke
rex at RexJaeschke.com
Sun Jun 24 12:14:17 CEST 2018
Attached is the DR log entry with my proposed note included for review.
The other attachments are examples Murata-san produced when researching this topic. I have chosen to write a simple note *without* including any of his specific examples.
Rex
From: eb2mmrt at gmail.com <eb2mmrt at gmail.com> On Behalf Of MURATA Makoto
Sent: Wednesday, June 6, 2018 10:03 PM
To: Rex Jaeschke <Rex at rexjaeschke.com>
Subject: Examples for kana-based sorting
Rex,
I created four SML documents. They share the same sharedStrings.xml. These SML documents were created by modifying the value of the sortMethod attribute.
The sharedStrings.xml shows some combinations of kana
and CJK ideographic characters.
1) 正直 without Kana
2) 正直 with ショウジキ as Kana
3) 正直 with マサナオ as Kana
4) 政直 with マサナオ as Kana
5) 政直 without Kana
I also added two cells containing す and み, respectively.
The four values exhibit different behaviors. Both sortMethod="none" and the absence of this attribute provides Kana based sorting. In other words, if Kana is present, the CJK ideographic characters are ignored. Meanwhile, sortMethod="pinYin" and sortMethod="stroke" appear to ignore Kana.
It appears that sortMethod="pinYin" is converted to
sortMethod="stroke", when Excel saves the document.
Regards,
Makoto
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sharedStrings.xml
Type: text/xml
Size: 835 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0001.xml>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSortingStroke.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7867 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0004.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSortingNone.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7869 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0005.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSortingPinYin.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7867 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0006.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSorting.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7855 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0007.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: DR-18-0002.docx
Type: application/vnd.openxmlformats-officedocument.wordprocessingml.document
Size: 104533 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0001.docx>
More information about the sc34wg4
mailing list