Rewriting the regexp for @ContentType in opc-contentTypes.xsd

John Haug johnhaug at exchange.microsoft.com
Wed Jun 17 11:51:33 CEST 2015


FYI, there seems to be an errant ‘b’ in:
<!ENTITY qdtext
b   "[\t&#x20;&#x21;&#x23;-\&#x5B;\&#x5D;-&#x7E;&obs-text;]">
    <!-- Note that [ (&#x5B;) and ] (&#x5D;) has to be escaped. -->
Probably just accidentally bumped the keyboard.

I can’t say a word about the RNG.  The published version of the RNG mirrors the XSD, but not now with the change to use a DTD in the XSD.

Also, I don’t recall that we ever discussed changing the definition of ST_Extension, but it’s different here from what is currently published.  Why the change?

John

From: eb2mmrt at gmail.com [mailto:eb2mmrt at gmail.com] On Behalf Of MURATA Makoto
Sent: Tuesday, June 16, 2015 10:02 PM
To: SC34
Subject: Re: Rewriting the regexp for @ContentType in opc-contentTypes.xsd

Folks,

Here is the XSD schema and RELAX NG schema.  I tried to make them more readable.

Regards,
Makoto


2015-06-13 12:28 GMT+09:00 MURATA Makoto <eb2m-mrt at asahi-net.or.jp<mailto:eb2m-mrt at asahi-net.or.jp>>:
John,

If we remove "\" preceding &#x5B; and that preceding\&#x5D;
we will have a syntax error and Xerces-J will report it.  This is
because &#x5B; is interpreted as "[" and &#x5D; is interpreted
as "]" immediately.  But I agree that a comment would be
useful.

Regards,
Makoto


2015-06-06 6:40 GMT+09:00 John Haug <johnhaug at exchange.microsoft.com<mailto:johnhaug at exchange.microsoft.com>>:
Apologies for the late e-mail on this.

Are there errant \ in defining the character ranges (23-5B, 5D-7E)?
<!ENTITY qdtext "[\t&#x20;&#x21;&#x23;-\&#x5B;\&#x5D;-&#x7E;&obs-text;]">

Compare to how a range is defined here:
<!ENTITY VCHAR "&#x21;-&#x7E;">

John

From: eb2mmrt at gmail.com<mailto:eb2mmrt at gmail.com> [mailto:eb2mmrt at gmail.com<mailto:eb2mmrt at gmail.com>] On Behalf Of MURATA Makoto
Sent: Friday, April 10, 2015 11:50 PM
To: SC34
Subject: Re: Rewriting the regexp for @ContentType in opc-contentTypes.xsd

Oops, the schema file was incorrect.

2015-04-11 12:54 GMT+09:00 MURATA Makoto <eb2m-mrt at asahi-net.or.jp<mailto:eb2m-mrt at asahi-net.or.jp>>:
Folks,

I have thoroughly rewritten the regular expression.  it
does not look like a regular expression but looks like a
grammar.  This is because I heavily used parsed entities
such as token, qdtext, and quoted-pair.

Attached please find the schema, and three test files.

I have not finished the RELAX NG schema.  It requires
manual conversion....

Regards,
Makoto



--

Praying for the victims of the Japan Tohoku earthquake

Makoto



--

Praying for the victims of the Japan Tohoku earthquake

Makoto



--

Praying for the victims of the Japan Tohoku earthquake

Makoto
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20150617/56dcd358/attachment-0001.html>


More information about the sc34wg4 mailing list