DR 18-0002: SML: Sorting CJK ideographic characters based on Japanese phonetics

Rex Jaeschke rex at RexJaeschke.com
Sun Jun 24 12:14:17 CEST 2018


Attached is the DR log entry with my proposed note included for review.

 

The other attachments are examples Murata-san produced when researching this topic. I have chosen to write a simple note *without* including any of his specific examples.

 

Rex

 

 

From: eb2mmrt at gmail.com <eb2mmrt at gmail.com> On Behalf Of MURATA Makoto
Sent: Wednesday, June 6, 2018 10:03 PM
To: Rex Jaeschke <Rex at rexjaeschke.com>
Subject: Examples for kana-based sorting

 

Rex,

 

I created four SML documents.  They share the same sharedStrings.xml.  These SML documents were created by modifying the value of the sortMethod attribute.  

 

The sharedStrings.xml shows some combinations of kana 

and CJK ideographic characters.

 

 

1) 正直 without Kana

2) 正直 with ショウジキ as Kana

3) 正直 with マサナオ as Kana

4) 政直 with マサナオ as Kana

5) 政直 without Kana

 

 

I also added two cells containing す and み, respectively.

 

The four values exhibit different behaviors.  Both sortMethod="none" and the absence of this attribute provides Kana based sorting.  In other words, if Kana is present, the CJK ideographic characters are ignored.  Meanwhile, sortMethod="pinYin" and sortMethod="stroke" appear to ignore Kana.

 

It appears that sortMethod="pinYin" is converted to 

sortMethod="stroke", when Excel saves the document.

 

Regards,
Makoto

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0001.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sharedStrings.xml
Type: text/xml
Size: 835 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0001.xml>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSortingStroke.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7867 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0004.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSortingNone.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7869 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0005.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSortingPinYin.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7867 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0006.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: Kana-BasedSorting.xlsx
Type: application/vnd.openxmlformats-officedocument.spreadsheetml.sheet
Size: 7855 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0007.xlsx>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: DR-18-0002.docx
Type: application/vnd.openxmlformats-officedocument.wordprocessingml.document
Size: 104533 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20180624/5cf85585/attachment-0001.docx>


More information about the sc34wg4 mailing list