<html xmlns:v="urn:schemas-microsoft-com:vml" xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40"><head><meta http-equiv=Content-Type content="text/html; charset=us-ascii"><meta name=Generator content="Microsoft Word 14 (filtered medium)"><style><!--
/* Font Definitions */
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri","sans-serif";}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:purple;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri","sans-serif";
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri","sans-serif";}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext="edit" spidmax="1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext="edit">
<o:idmap v:ext="edit" data="1" />
</o:shapelayout></xml><![endif]--></head><body lang=EN-US link=blue vlink=purple><div class=WordSection1><p class=MsoNormal>Just FYI. The OOXML record in the primary database of file format signatures used by the archival community includes:<o:p></o:p></p><p class=MsoNormal><a href="http://apps.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=910&strPageToDisplay=signatures">http://apps.nationalarchives.gov.uk/PRONOM/Format/proFormatSearch.aspx?status=detailReport&id=910&strPageToDisplay=signatures</a> <o:p></o:p></p><p class=MsoNormal>which takes advantage of the growth hint bytes actually found in .xslx, .docx, and .pptx (etc.) files to recognize OOXML files from the Zip-based package without unpacking it. <o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>The DROID tool that is driven by this information should be viewed as more flexible than the traditional magic number tool, but basically used in archive ingest workflows for tasks like (a) checking that a file’s content appears to match its file extension, (b) distinguishing between filetypes that use the same extension, and (c) triage (for example to invoke more complex characterization or validation steps).<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>For more on DROID, see <a href="http://www.nationalarchives.gov.uk/information-management/manage-information/preserving-digital-records/droid/">http://www.nationalarchives.gov.uk/information-management/manage-information/preserving-digital-records/droid/</a><o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal> Caroline<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>Caroline Arms<o:p></o:p></p><p class=MsoNormal>Library of Congress Contractor<o:p></o:p></p><p class=MsoNormal>Co-compiler of Sustainability of Digital Formats resource <a href="http://www.digitalpreservation.gov/formats/"><span style='color:blue'>http://www.digitalpreservation.gov/formats/</span></a><o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p><p class=MsoNormal>** Views expressed are personal and not necessarily those of the institution **<o:p></o:p></p><p class=MsoNormal><o:p> </o:p></p></div></body></html>