<div dir="ltr">Francis,<div><br></div><div>Thanks for your comments. I now understand.</div><div><br></div><div>I did some experiments. Excel appears to escape an underscore </div><div>only when it is the first character of a string matching _[0-9a-zA-F]{4}_</div><div><br></div><div>Regards,</div><div>Makoto</div><div><br></div><div><div><div> <si></div><div> <t>SW_x005F_x3850_CPU</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x005F_x3850_CPU</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xzxcv</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xzxcv_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xzxcwev_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xFFFFFF_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x2000B_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x3000</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x005F_x3000_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x005F_xFFFF_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xFF_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x0F_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xF_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xG_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xGG_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xGGG_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xGGGG_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x000G_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_xFFF_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x005F_xffff_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div><div> <si></div><div> <t>_x005F_xf3f2_</t></div><div> <phoneticPr fontId="1"/></div><div> </si></div></div></div></div><div class="gmail_extra"><br><div class="gmail_quote">2017-11-09 23:50 GMT+09:00 Francis Cave <span dir="ltr"><<a href="mailto:francis@franciscave.com" target="_blank">francis@franciscave.com</a>></span>:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div lang="EN-GB" link="blue" vlink="purple"><div class="m_-5140493970415636338WordSection1"><p class="MsoNormal"><span>Murata-san<u></u><u></u></span></p><p class="MsoNormal"><span><u></u> <u></u></span></p><p class="MsoNormal"><span>I think that this DR is asking how to serialise the literal string “SW_x3850_CPU”, not “SW</span><span lang="EN-US" style="font-family:"Malgun Gothic",sans-serif">㡐</span><span style="font-family:"Malgun Gothic",sans-serif">CPU</span><span>”. If “_xHHHH_” is interpreted as the Unicode character #xHHHH, any literal string in the form “_xHHHH_” has to have the initial “_” escaped, which is what Charlie is saying that Excel does. But does this mean that “_” is always escaped by Excel, or only escaped in certain contexts, such as if followed by “x”? Does this need to be tested? <u></u><u></u></span></p><p class="MsoNormal"><span><u></u> <u></u></span></p><p class="MsoNormal"><span>Kind regards,<u></u><u></u></span></p><p class="MsoNormal"><span><u></u> <u></u></span></p><p class="MsoNormal"><span>Francis<u></u><u></u></span></p><p class="MsoNormal"><span><u></u> <u></u></span></p><p class="MsoNormal"><span><u></u> <u></u></span></p><p class="MsoNormal"><span><u></u> <u></u></span></p><p class="MsoNormal"><b><span lang="EN-US">From:</span></b><span lang="EN-US"> <a href="mailto:eb2mmrt@gmail.com" target="_blank">eb2mmrt@gmail.com</a> [mailto:<a href="mailto:eb2mmrt@gmail.com" target="_blank">eb2mmrt@gmail.com</a>] <b>On Behalf Of </b>MURATA Makoto<br><b>Sent:</b> 09 November 2017 00:55<br><b>To:</b> SC 34 WG4 <<a href="mailto:e-SC34-WG4@ecma-international.org" target="_blank">e-SC34-WG4@ecma-<wbr>international.org</a>><br><b>Subject:</b> Re: DR-16-0022: Shared ML: Escaping strings in ST_Xstring<u></u><u></u></span></p><div><div class="h5"><p class="MsoNormal"><u></u> <u></u></p><div><p class="MsoNormal">>§22.9.2.19, “ST_Xstring (Escaped String)” says:<br>><br>>For all characters that cannot be represented in XML as defined by the <u></u><u></u></p><div><p class="MsoNormal">>XML 1.0 specification, the characters are escaped using the Unicode <u></u><u></u></p></div><div><p class="MsoNormal">>numerical character representation escape character format _xHHHH_, where H<br>>represents a hexadecimal character in the character's value. <u></u><u></u></p></div><div><p class="MsoNormal">>[Example: The Unicode character 8 is not permitted<br>> in an XML 1.0 document, so it must be escaped as _x0008_. end example]<br><br>> But it's not clear from this if all such combinations should be escaped?<br>> or just those in the range [001-031]. Excel itself handles such sequences by<u></u><u></u></p></div><div><p class="MsoNormal">> escaping the first underscore but unfortunately other consumers such as <u></u><u></u></p></div><div><p class="MsoNormal">> OpenOffice do not remove the escaping so I think this needs clarifying.<u></u><u></u></p><div><p class="MsoNormal"><u></u> <u></u></p></div><div><p class="MsoNormal"><u></u> <u></u></p></div><div><p class="MsoNormal">W3C XML clearly defines which character is legal. We should <u></u><u></u></p></div><div><p class="MsoNormal">mention <a name="m_-5140493970415636338_wf-Legalchar" id="m_-5140493970415636338gmail-wf-Legalchar"><b><span style="font-size:13.5pt;font-family:"Arial",sans-serif;color:black">Well-formedness constraint: Legal Character.</span></b></a><u></u><u></u></p></div><div><p class="MsoNormal"><u></u> <u></u></p></div><p class="MsoNormal"><a href="https://www.w3.org/TR/2006/REC-xml-20060816/#wf-Legalchar" target="_blank">https://www.w3.org/TR/2006/<wbr>REC-xml-20060816/#wf-Legalchar</a><u></u><u></u></p></div><div><p class="MsoNormal"><u></u> <u></u></p></div><div><p class="MsoNormal">Or, does this DR ask how we can represent a literal such as "_x2345"?<u></u><u></u></p></div><div><p class="MsoNormal"><u></u> <u></u></p></div><div><p class="MsoNormal">Regards,<u></u><u></u></p></div><div><p class="MsoNormal">Makoto<u></u><u></u></p></div></div><div><p class="MsoNormal"><u></u> <u></u></p><div><p class="MsoNormal">2016-12-07 5:20 GMT+09:00 Rex Jaeschke <<a href="mailto:rex@rexjaeschke.com" target="_blank">rex@rexjaeschke.com</a>>:<u></u><u></u></p><blockquote style="border:none;border-left:solid #cccccc 1.0pt;padding:0cm 0cm 0cm 6.0pt;margin-left:4.8pt;margin-right:0cm"><p class="MsoNormal" style="margin-bottom:12.0pt">Here's a new DR from Charlie.<br><span style="color:#888888"><br><span class="m_-5140493970415636338hoenzb">Rex</span></span><u></u><u></u></p></blockquote></div><p class="MsoNormal"><br><br clear="all"><u></u><u></u></p><div><p class="MsoNormal"><u></u> <u></u></p></div><p class="MsoNormal">-- <u></u><u></u></p><div><p class="MsoNormal"><br>Praying for the victims of the Japan Tohoku earthquake<br><br>Makoto<u></u><u></u></p></div></div></div></div></div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><br>Praying for the victims of the Japan Tohoku earthquake<br><br>Makoto</div>
</div>