Strings referencing to OPC parts (1)

MURATA Makoto eb2m-mrt at asahi-net.or.jp
Tue Aug 24 01:04:46 CEST 2010


I guess that more than one step is required for the
conversion from "H-identifier" to "Z-identifiers".

Can "H-identifiers" appear in 29500 documents?
I guess so when we reference from an OPC
part from another OPC part.

2010/8/24 Chris Rae <Chris.Rae at microsoft.com>:
> Hi Murata-san - this is a great list of questions and I think it'll help to frame discussion. Quick question: Do your "H-identifiers" ever appear in an IS 29500 file, or are they converted into Z-identifiers by an application before the file is created?
>
> Chris
>
> -----Original Message-----
> From: MURATA Makoto (FAMILY Given) [mailto:eb2m-mrt at asahi-net.or.jp]
> Sent: 21 August 2010 06:07
> To: e-SC34-WG4 at ecma-international.org
> Subject: Strings referencing to OPC parts (1)
>
> Dear colleagues,
>
> I think that the terminology in Part 2 is entirely broken.  So, let's not use any term in Part 2 first and make sure we have the same understanding.
>
> First, application programmers  and format designers use strings for identifying OPC parts.  These strings are for human users rather than protocols.  (The syntax of a programming language might force a programmer to introduce some escaping to such strings, though.) Let's call such strings "H-identifiers".
>
> Second, eventually an OPC part is represented by one or more files in a ZIP archive.  Such files also have strings have names.  Let's call such strings "Z-identifiers".
>
> Q0. Do H-identifiers begin with "/" always?
>
> Q1. Can an H-identifier contain "//"?
>
> Q2. Can an H-identifier end with "/"?
>
> Q3. Can an H-identifier contain non-ASCII characters?
>
> Q4. Can an H-identifier contain the space character?
>
> Q5. Can an H-identifier contain "<" (U+003C), ">" (U+003E) and '"' (U+0022)?
>
> Q6. Can an H-identifier contain unwise characters "\" (U+005C), "^" (U+005E),
>     "`" (U+0060), "{" (U+007B), "|" (U+007C) and "}" (U+007D)?
>
> Q7. Can an H-identifier contain the controls (C0 controls, DEL and C1 controls,
>    U+0000 - U+001F U+007F - U+009F)?
>
> Q8. Can an H-identifier contain the Bidi formatting characters
>    (U+200E, U+200F, U+202H-202E)?
>
> Q9. Can an H-identifier contain the Specials (U+FFF0-FFFD)?
>
> Q10. Can an H-identifier contain the Tags (U+E0000-E0FFF)?
>
> Q11. Can an H-identifier contain the Non-characters (U+FDD0-FDEF, U+1FFFE-1FFFF,
>    U+2FFFE-2FFFF, U+3FFFE-3FFFF, U+4FFFE-4FFFF, U+5FFFE-5FFFF, U+6FFFE-6FFFF,
>    U+7FFFE-7FFFF, U+8FFFE-8FFFF, U+9FFFE-9FFFF, U+AFFFE-AFFFF, U+BFFFE-BFFFF,
>    U+CFFFE-CFFFF, U+DFFFE-DFFFF, U+EFFFE-EFFFF, U+FFFFE-FFFFF, U+10FFFE-10FFFF)?
>
> Q12. Can an H-identifier contain the Surrogate code units (U+D800-U+DFFF)?
>
> Q13. Which character is allowed as part of a Z-identifier?
>
> My answer to Q3 thru 13 is:
>
>   Q3: Y, Q4 thru 12: N
>
> But I can live with
>
>   Q3: Y, Q4 thru 12: Y
>
> I am against any other combinations.
>
> My answers to Q0, Q1, Q2 are Y, Y, and Y, respectively.
>
> Q13 should be clear from the ZIP specification referenced from Part 2, but it is not at clear.
>
> That spec simply says:
>
>>       file name: (Variable)
>>
>>           The name of the file, with optional relative path.
>>           The path stored should not contain a drive or
>>           device letter, or a leading slash.  All slashes
>>           should be forward slashes '/' as opposed to
>>           backwards slashes '\' for compatibility with Amiga
>>           and Unix file systems etc.  If input came from standard
>>           input, there is no file name field.  If encrypting
>>           the central directory and general purpose bit flag 13 is set
>>           indicating masking, the file name stored in the Local Header
>>           will not be the actual file name.  A masking value consisting
>>           of a unique hexadecimal value will be stored.  This value will
>>           be sequentially incremented for each file in the archive. See
>>           the section on the Strong Encryption Specification for details
>>           on retrieving the encrypted file name.
>
> Some people thought this allows any encoding such as Shift_JIS.  The result is zero interoperability.
>
> I think that we have to create our own definition of Z-strings, which should cause no problems to every important implementation of the ZIP file format.  Part 2 appears to implicitly provide such a definition by specifying a conversion procedure in Appendix A.3.
>
> Cheers,
> Makoto
>
>



-- 
 --
国際大学
村田 真 <EB2M-MRT at asahi-net.or.jp>


More information about the sc34wg4 mailing list