Sketch of ISO/IEC 30114-2 Information Technology -- Extensions of Office Open XML File Formats -- Character repertoire checking

MURATA Makoto (FAMILY Given) eb2m-mrt at asahi-net.or.jp
Tue Aug 23 17:07:39 CEST 2011


Dear colleagues,

Here is my sketcy draft.  Comments welcome.

Cheers,
Makoto
--------------------------------------------------------------------
Sketch of ISO/IEC 30114-2  Character Repertoire Checking

Foreword

....

ISO/IEC 30114 consists of the following parts, under the general title Information technoology --
Extensions of Office Open XML File Formats:

Part 1: Guidelines
Part 2: Character repertoire checking

Annex A forms a normative part of this part of ISO/IEC 30114.

Introduction

ISO/IEC 29500 is designed to allow safe extensions. An OOXML extension 
is safe if OOXML documents containing this extension are guaranteed to
conform to ISO/IEC 29500 and are thus guaranteed to be processable by
conformant applications of ISO/IEC 29500.  Safe extensions of ISO/IEC
29500 may or may not be standardized.  ISO/IEC 30114 provides a
collection of standardized safe extensions of OOXML.

This part of ISO/IEC 30114 provides a standarized safe extensions for
character repertoire checking.  For example, it can be used to ensure
that some cells in OOXML Spreadsheet documents contain characters in
ISO 8859-1 only.

The structure of this part of ISO/IEC 301147 is as follows. ...

 
1. Scope

This part of the multi-part standard provides an extension for
embedding CREPDL scripts (as specified in 19757-7) as OPC parts within
OOXML documents, associating such OPC parts with certain locations in
OOXML documents, and examining whether or not text chunks at these
locations are in the specified repertoires

Issue: This draft covers two locations: sdtPr of WML and c of SML.  Should
we try to provide more?

2. Conformance
  2.1 Data conformance
  An OOXML document conformant shall be conformant to this part of the standard 
  if it conforms to Part 1 or Part 4 and also satisy requirements stated in this 
  part.
  2.2 Application conformance
  To be supplied.

3. Normative References

ISO/IEC 29500-1:2011
ISO/IEC 29550-1:2011 AMD1
ISO/IEC 29500-3:2011
ISO/IEC 29500-4:2011
ISO/IEC 29550-4:2011 AMD1
ISO/IEC 19757-2
ISO/IEC 19757-4
ISO/IEC 19757-7

4. Terms and definitions

OPC part
Alternate Content Blocks

5. Additional Requirements
    5.1 Additional OPC Part
    5.1.1 CREPDL OPC Part

     Media type: application/xml
     Root namespace: http://purl.oclc.org/dsdl/crepdl/ns/structure/1.0
     Source Relationship:? 

    An instance of this OPC part type shall be a CREPDL script as specified in ISO/IEC 19757-7.

    A package is permitted to contain zero or more CREPDL OPC parts, each of which shall be the target of an 
    explicit relationship in c of SML or characterRepertoire (see 5.2.1.1) of WML.

    5.2. Extensions using Markup Compatibility and Extensions
          5.2.1 Ignorable elements and attributes
          5.2.1.1 WML

          Introduce a new child element of <w:sdtPr>.  This
          element references to one of the OPC parts sketched above.

        <sdtPr>
             ....
            <foo:characterRepertoire r:id="..."/>
             ...
         </sdtPr>

          Issue: sdtPr appears in WordprocessingML_Comments,
          WordprocessingML_Endnotes, WordprocessingML_Footer,
          WordprocessingML_Footnotes, WordprocessingML_Glossary_Document,
          WordprocessingML_Header, and WordprocessingML_Main_Document 
          Should we allow characterRepertoire in all of these OPC parts?

          5.2.1.2 SML
          Introduce an ignorable attribute @cr:id to elements <c> in SML.
          This attribute references to a CREPDL OPC part.  The CREPDL script 
          in thist part constrains the text content of the current cell.

          @cr:id is very similar to @r:id defined in shared-relationshipReference.xsd, 
          but we use a different namespace "http://purl.oclc.org/ooxml/officeDocument/relationshipsCharRepertoire" 
          for making this attribute ignorable.

          Issue: <c> appears in SpreadsheetML_Calculation_Chain and SpreadsheetML_Worksheet.
          Should we allow characterRepertoire in both OPC parts?

          5.2.1.3 PML
          None.

          5.2.2 Alternate Content Blocks
          This part of the standard does not use alternate content blocks.

    5.3. Extensions using OOXML-defined extension elements
    This part of the standard does not use OOXML-defined extension elements, namely extLst.

Annex A: Schemas (Normative)

A.1 NVDL Schema (normative)

wmlExtension.nvdl
smlExtension.nvdl

Note: They should invoke NVDL scripts in Part 3 rather than
WordprocessingML_Main_Document.rnc or SpreadsheetML_Worksheet.rnc

A.2 RELAX NG schema (normative)

relationshipsCharRepertoireAttribute.rnc (invoked by wmlExtension.nvdl)

relationshipsCharRepertoire.rnc (invoked by smlExtension.nvdl)

A.3 W3C XML Schema schema (normative)

relationshipsCharRepertoire.xsd

relationshipsCharRepertoireAttribute.xsd
-------------- next part --------------
A non-text attachment was scrubbed...
Name: relationshipsCharRepertoire.rnc
Type: application/octet-stream
Size: 273 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110824/548d1e95/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: relationshipsCharRepertoire.xsd
Type: application/octet-stream
Size: 665 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110824/548d1e95/attachment-0001.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: relationshipsCharRepertoireAttribute.rnc
Type: application/octet-stream
Size: 227 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110824/548d1e95/attachment-0002.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: relationshipsCharRepertoireAttribute.xsd
Type: application/octet-stream
Size: 574 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110824/548d1e95/attachment-0003.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smlExtension.nvdl
Type: application/octet-stream
Size: 1061 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110824/548d1e95/attachment-0004.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: wmlExtension.nvdl
Type: application/octet-stream
Size: 1233 bytes
Desc: not available
URL: <http://mailman.vse.cz/pipermail/sc34wg4/attachments/20110824/548d1e95/attachment-0005.obj>


More information about the sc34wg4 mailing list