DII workshop in London, May 18

Caroline Arms caar at loc.gov
Fri Apr 10 16:02:27 CEST 2009


Murata-san and Doug,

Just FYI on the validity of included binary data:
This is an area of greet interest to libraries and archives and there is already a tool, JHOVE,  that handles several common binary formats, including TIFF and JPEG images.  See  http://hul.harvard.edu/jhove/

There are several years of experience from many institutions building digital archives, who typically use the tool to validate and characterize digital content as it is submitted to the archive.  Based on that experience, an improved architecture is now under development.  See http://confluence.ucop.edu/display/JHOVE2Info/Home

Should you actually be interested in following up with the project leader (the same for both developments -- he moved from Harvard to the University of California) I can make the introduction.

Caroline Arms

>>> "MURATA Makoto (FAMILY Given)" <eb2m-mrt at asahi-net.or.jp> 04/09/09 9:34 PM >>>
(Not wearing my convenor hat)

29500 already has separated document conformance and application 
conformance.   I propose to introduce some layers for each.

1. Layers for data conformance

ZIP conformance
  Is the data a correct zip file?
OPC conformance
  Is the data a correct OPC package?
XML conformance
  Are the included XML documents well-formed?
OOXML schema conformance (transitional/strict)
  XSD conformance 
    Are the included XML documents valid aginst OOXML XSD schemas?
  RNG conformance
    Are the included XML documents valid aginst OOXML XSD schemas?
Conformance to other formats such as JPEG
  Is the included binary data correct?
OOXML full conformance(transitional/strict)
  Does the OOXML document satisfy all requirements specified in 
  29500?

2. Layers for application conformance

ZIP conformance
  Can the application produce and/or consume correct ZIP files?
OPC conformance
  Can the application produce and/or consume correct OPC packages?
XML conformance
  Can the application produce and/or consume well-formed XML documents 
  as parts of OPC packages?
OOXML schema conformance (transitional/strict)
  Can the application produce and/or consume valid XML documents 
  as parts of OPC packages?
Conformance to other formats such as JPEG
  Can the application produce and/or consume correct binary data?
OOXML full conformance
  Does the application satisfy all requirements specified in 
  29500?


Some of the above layers are hard to test, while others are quite easy. 


Cheers,
Makoto<EB2M-MRT at asahi-net.or.jp>





More information about the sc34wg4 mailing list