=?Windows-1252?Q?RE:_DR_09-0061_=97_Shared_MLs, _Shared_Simple_Types:_Cons?= train ST_Panose value set
Chris Rae
Chris.Rae at microsoft.com
Tue Oct 26 23:24:50 CEST 2010
I may have spoken too soon on this one. It would appear there are some values which are acceptable according to the Panose spec which this RegExp does not regard as valid. For example, 05000000000000000000. This seems to be valid according to section 1.5 of the Panose spec (http://www.panose.com/ProductsServices/pan1.aspx) but doesn't match this RegEx. I've pasted the section below.
Suzuki-san - is there a chance you could check that I'm right in this assertion?
Chris
--
Panose: 1.5 Digit values of 0 and 1
The reader will notice that the value 0 and 1 are defined as Any and No Fit for every digit in the PANOSE system. These have specific meanings to the mapper. 0 means match that digit with any available digit. This allows the mapper to handle distortable typefaces such as multiple master fonts in which, for example, weights may be variable or serifs may change. 1 means that the item being classified does not fit within the present system. There are two possible causes of this. First is that there has been no work done on that family of faces, for example at the present time an Arabic cursive font would have the PANOSE number 1 1 1 1 1 1 1 1 1 1 as there has as yet been no work done on Arabic fonts.
-----Original Message-----
From: Chris Rae [mailto:Chris.Rae at microsoft.com]
Sent: 25 October 2010 15:37
To: e-SC34-WG4 at ecma-international.org
Cc: suzuki toshiya (mpsuzuki at hiroshima-u.ac.jp)
Subject: DR 09-0061 — Shared MLs, Shared Simple Types: Constrain ST_Panose value set
http://cid-c8ba0861dc5e4adc.office.live.com/view.aspx/Public%20Documents/2009/DR-09-0061.docx
This is a very simple one indeed. We talked about this at some length at Tokyo - I was under the impression that certain valid Panose values were not accepted by Suzuki-san's RegEx. This, it turns out, was a mistake - I was truncating unused zeros from the front of the strings and in actual fact this is not done (in Office, or in the Panose specification itself). Panose consists of ten byte couplets denoted in hex where leading zeros are always included, even on the first byte.
I used http://www.regexplanet.com/simple/index.html to validate Suzuki-san's sample against some Word documents and it looks like this RegEx does indeed work fine (some sample values: 020F0502020204030204, 02010600030101010101, 020B0604020202020204, 02020603050405020304, 02040503050406030204).
I think we can accept the original solution as proposed by the submitter in the DR. Let's discuss on the next call.
Chris
More information about the sc34wg4
mailing list