Misplaced Pages

Office Open XML: Difference between revisions

Article snapshot taken from Wikipedia with creative commons attribution-sharealike license. Give it a read and then ask your questions in the chat. We can research this topic together.
Browse history interactively← Previous editNext edit →Content deleted Content addedVisualWikitext
Revision as of 07:56, 30 September 2009 editScientus (talk | contribs)5,503 edits Support: weasel-worded OR← Previous edit Revision as of 08:23, 30 September 2009 edit undoScientus (talk | contribs)5,503 edits Undid revision 315657766 by HAl (talk) Enough FUD user:HAl, Microsoft Press Release even supports sentence.Next edit →
Line 73: Line 73:
] originally developed the specification as a compatible successor to its earlier ] and ].<ref>{{ cite web | url = https://www.microsoft.com/presspass/features/2005/nov05/11-21Ecma.mspx | title = Q&A: Microsoft Co-Sponsors Submission of Office Open XML Document Formats to Ecma International for Standardization | date = 2005-11-21 | publisher = Microsoft }}</ref> and in 2005 submitted it to ] Technical Committee TC45, where it was standardized as ECMA-376 in 2006. In April 2008, after incorporating the proposed changes from ] members during ] a revised version of the specification was approved as ] ISO/IEC 29500:2008,<ref name="ISOIECapproval">{{ cite web | url = http://www.iso.org/iso/pressrelease.htm?refid=Ref1123 | title = ISO/IEC DIS 29500 receives necessary votes for approval as an International Standard | date = 2008-04-02 | publisher = ISO }}</ref> published in November 2008,<ref>{{ cite web | url = http://www.iso.org/iso/pressrelease.htm?refid=Ref1181 | title = Publication of ISO/IEC 29500:2008, Information technology&nbsp;— Office Open XML formats | date = 2008-11-18 | accessdate = 2008-11-19 | publisher = ISO | author = ISO/IEC }}</ref> and as ECMA-376 Office Open XML File Formats - 2nd edition (December 2008). ISO/IEC 29500:2008 is a 4 part standard specification.<ref name=free_ISO/><ref name="StandardECMA376">{{ cite web | url = http://www.ecma-international.org/publications/standards/Ecma-376.htm | title = Standard ECMA-376 | publisher = Ecma-international.org | accessdate = 2009-05-19 }}</ref> ] originally developed the specification as a compatible successor to its earlier ] and ].<ref>{{ cite web | url = https://www.microsoft.com/presspass/features/2005/nov05/11-21Ecma.mspx | title = Q&A: Microsoft Co-Sponsors Submission of Office Open XML Document Formats to Ecma International for Standardization | date = 2005-11-21 | publisher = Microsoft }}</ref> and in 2005 submitted it to ] Technical Committee TC45, where it was standardized as ECMA-376 in 2006. In April 2008, after incorporating the proposed changes from ] members during ] a revised version of the specification was approved as ] ISO/IEC 29500:2008,<ref name="ISOIECapproval">{{ cite web | url = http://www.iso.org/iso/pressrelease.htm?refid=Ref1123 | title = ISO/IEC DIS 29500 receives necessary votes for approval as an International Standard | date = 2008-04-02 | publisher = ISO }}</ref> published in November 2008,<ref>{{ cite web | url = http://www.iso.org/iso/pressrelease.htm?refid=Ref1181 | title = Publication of ISO/IEC 29500:2008, Information technology&nbsp;— Office Open XML formats | date = 2008-11-18 | accessdate = 2008-11-19 | publisher = ISO | author = ISO/IEC }}</ref> and as ECMA-376 Office Open XML File Formats - 2nd edition (December 2008). ISO/IEC 29500:2008 is a 4 part standard specification.<ref name=free_ISO/><ref name="StandardECMA376">{{ cite web | url = http://www.ecma-international.org/publications/standards/Ecma-376.htm | title = Standard ECMA-376 | publisher = Ecma-international.org | accessdate = 2009-05-19 }}</ref>


Starting with ], the Office Open XML file formats (ECMA-376) have become the default file format of ].<ref>{{ cite web | url = http://www.courierpress.com/news/2008/oct/24/microsofts-future-lies-somewhere-beyond-the/ | title = Microsoft's future lies somewhere beyond the Vista by Evansville Courier & Press | publisher = Courierpress.com | accessdate = 2009-05-19 }}</ref><ref>{{ cite web | url = http://knowledge.wharton.upenn.edu/article.cfm?articleid=1795 | title = Rivals Set Their Sights on Microsoft Office: Can They Topple the Giant? - Knowledge@Wharton | publisher = Knowledge.wharton.upenn.edu | accessdate = 2009-05-19}}</ref> Microsoft has stated that the planned ] will be the first version to implement the ISO/IEC 29500:2008 compliant version of Office Open XML. Starting with ], the Office Open XML file formats (ECMA-376) have become the default file format of ].<ref>{{ cite web | url = http://www.courierpress.com/news/2008/oct/24/microsofts-future-lies-somewhere-beyond-the/ | title = Microsoft's future lies somewhere beyond the Vista by Evansville Courier & Press | publisher = Courierpress.com | accessdate = 2009-05-19 }}</ref><ref>{{ cite web | url = http://knowledge.wharton.upenn.edu/article.cfm?articleid=1795 | title = Rivals Set Their Sights on Microsoft Office: Can They Topple the Giant? - Knowledge@Wharton | publisher = Knowledge.wharton.upenn.edu | accessdate = 2009-05-19}}</ref> However, due to the changes introduced in a later version, Office 2007 is not entirely in compliance with ISO/IEC 29500:2008.<ref name=ODFA_communityone>{{ cite web | url = http://www.odfalliance.org/resources/IssueBriefImplementations.pdf | title = OOXML Implementations: A Community of One | publisher = ODF Alliance | date = 2008-02-20 | accessdate = 2009-05-19 }}</ref><ref name="SupportedFormats">{{ cite web | url = http://www.microsoft.com/Presspass/press/2008/may08/05-21ExpandedFormatsPR.mspx | title = Microsoft Expands List of Formats Supported in Microsoft Office | publisher = Microsoft.com | date = 2008-05-21 | accessdate = 2009-05-19 }}</ref><ref name=cwO14>{{ cite web | last = Lai | first = Eric | url = http://www.computerworld.com/action/article.do?command=viewArticleBasic&taxonomyName=Protocols+and+Standards&articleId=9089258&taxonomyId = 141&pageNumber=1 | title = FAQ: Office 14 and Microsoft's support for ODF | publisher = Computerworld.com | date = 2008-05-27 | accessdate = 2009-05-19 }}</ref><ref name=notOOXML>{{ cite web | author = Andy Updegrove | url = http://consortiuminfo.org/standardsblog/article.php?story=20080521092930864 | title = Microsoft Office 2007 to Support ODF&nbsp;— and not OOXML | publisher = ConsortiumInfo.org | accessdate = 2009-05-19 }}</ref> Microsoft has stated that the planned ] will be the first version to implement the ISO/IEC 29500:2008 compliant version of Office Open XML.


== Background == == Background ==

Revision as of 08:23, 30 September 2009

Template:Distinguish2

Office Open XML Document
Filename extension .docx or .docm
Internet media type application/vnd.
openxmlformats-officedocument.
wordprocessingml.
document
Developed byMicrosoft, Ecma, ISO/IEC
Type of formatDocument file format
Extended fromXML, DOC, WordProcessingML
StandardECMA-376, ISO/IEC 29500
WebsiteECMA-376, ISO/IEC 29500:2008
Office Open XML Presentation
Filename extension .pptx
Internet media type application/vnd.
openxmlformats-officedocument.
presentationml.
presentation
Developed byMicrosoft, Ecma, ISO/IEC
Type of formatPresentation
Extended fromXML, PPT
StandardECMA-376, ISO/IEC 29500
WebsiteECMA-376, ISO/IEC 29500:2008
Office Open XML Workbook
Filename extension .xlsx
Internet media type application/vnd.
openxmlformats-officedocument.
spreadsheetml.
sheet
Developed byMicrosoft, Ecma, ISO/IEC
Type of formatSpreadsheet
Extended fromXML, XLS, SpreadsheetML
StandardECMA-376, ISO/IEC 29500
WebsiteECMA-376, ISO/IEC 29500:2008

Office Open XML (also referred to as OOXML, or Open XML) is a file format for representing spreadsheets, charts, presentations and word processing documents. An Office Open XML file is a ZIP-compatible OPC package containing XML documents and other resources. Office Open XML standard is an IEC/ISO standard; the specification is freely available from ISO for individual and non-redistributable use only. or freely to copy without any restriction from the Ecma International website.

Microsoft originally developed the specification as a compatible successor to its earlier binary and Office 2003 XML file formats. and in 2005 submitted it to Ecma International Technical Committee TC45, where it was standardized as ECMA-376 in 2006. In April 2008, after incorporating the proposed changes from JTC 1 members during the standardization process of Office Open XML a revised version of the specification was approved as International Standard ISO/IEC 29500:2008, published in November 2008, and as ECMA-376 Office Open XML File Formats - 2nd edition (December 2008). ISO/IEC 29500:2008 is a 4 part standard specification.

Starting with Microsoft Office 2007, the Office Open XML file formats (ECMA-376) have become the default file format of Microsoft Office. However, due to the changes introduced in a later version, Office 2007 is not entirely in compliance with ISO/IEC 29500:2008. Microsoft has stated that the planned Microsoft Office 2010 will be the first version to implement the ISO/IEC 29500:2008 compliant version of Office Open XML.

Background

Prior to the 2007 edition, the core applications of the Microsoft Office software suite (primarily Word, Excel, and PowerPoint) by default stored their data in binary files. Historically, these files were difficult for other applications to interoperate with, due to the lack of publicly available information. Before 2007 Microsoft offered these binary format specifications under a royalty-free license and since 2007 the formats are directly downloadable from their site under a 'covenant not to sue' as part of its Open Specification Promise. Due to Microsoft previously keeping their file formats secret, other office software had great difficulty obtaining full levels of interoperability. Microsoft came under increasing pressure to adopt an open file format, in particular several nations adopted rules that official documents should be in an open format.

In 2000, Microsoft released an initial version of an XML-based format for Microsoft Excel, which was incorporated in Office XP. In 2002, a new file format for Microsoft Word followed. The Excel and Word formats – known as the Microsoft Office XML formats – were later incorporated into the 2003 release of Microsoft Office.

In May 2004, governments and the European Union recommended to Microsoft that they publish and standardize their XML Office formats through a standardization organization. Microsoft announced in November 2005 that it would standardize the new version of their XML-based formats through Ecma International, as "Ecma Office Open XML".

Standardization

Main article: Standardization of Office Open XML

Office Open XML was standardized first by Ecma International as ECMA-376 (published 7 December 2006). This standard was then fast-tracked in the Joint Technical Committee 1 of ISO and IEC where it became an International Standard, ISO/IEC 29500:2008 (published November 2008).

Licensing

Availability of the standard

Ecma International provides free access to the specifications that also "can be freely copied by all interested parties without restrictions".

ISO has added the document to its "list of freely available standards"; ISO permits everyone to download the standard for free also allowing them to print one copy of the standard and states that users may not further share their copies.

Royalty free licensing on patented technology

Microsoft, the biggest contributor to the standard, has provided additional royalty free licensing for any possible technologies required to implement or use Office Open XML.

Under the Ecma International code of conduct in patent matters, participating and approving member organisations of ECMA are required to make available their patent rights on a Reasonable and Non Discriminatory (RAND) basis. While making patent rights available on a RAND basis is considered a common minimum patent condition for a standard, international standardization has a clear preference for royalty-free patent licensing.

Microsoft, a main contributor to the standard, provided a Covenant Not to Sue for its patent licensing. The covenant received a mixed reception, with some (like the Groklaw blog) identifying problems and others (such as Lawrence Rosen, an attorney and lecturer at Stanford Law School) endorsing it.

Microsoft has added the format to their Open Specification Promise in which

"Microsoft irrevocably promises not to assert any Microsoft Necessary Claims against you for making, using, selling, offering for sale, importing or distributing any implementation to the extent it conforms to a Covered Specification "

subject to certain restrictions. Office Open XML can therefore be used under the Covenant Not to Sue or the Open Specification Promise which both are forms of royalty free licensing.

The Open Specification Promise was included in documents submitted to ISO/IEC in support of the ECMA-376 fast track submission. Ecma International asserted that, "The OSP enables both open source and commercial software to implement ."

File format and structure

Office Open XML main components

Office Open XML uses a file package conforming to the Open Packaging Conventions. This format uses mechanisms from the ZIP file format and contains the individual files that form the basis of the document. In addition to Office markup, the package can also include embedded files such as images, videos, or other documents.

Document markup languages

An Office Open XML file may contain several documents encoded in specialized markup languages corresponding to applications within the Microsoft Office product line. Office Open XML defines multiple vocabularies using 27 namespaces and 89 schema modules.

The primary markup languages are:

  • WordprocessingML for word-processing
  • SpreadsheetML for spreadsheets
  • PresentationML for presentations

Shared markup language materials include:

  • Office Math Markup Language (OMML)
  • DrawingML used for vector drawing, charts, and for example, text art (additionally, though deprecated, VML is supported for drawing)
  • Extended properties
  • Custom properties
  • Variant Types
  • Custom XML data properties
  • Bibliography

In addition to the above markup languages custom XML schemas can be used to extend Office Open XML.

The XML Schema of Office Open XML emphasizes reducing load time and improving parsing speed. In a test with applications current in April 2007, XML-based office documents were slower to load than binary formats. To enhance performance, Office Open XML uses very short element names for common elements and spreadsheets save dates as index numbers (starting from 1899 or from 1904). In order to be systematic and generic, Office Open XML typically uses separate child elements for data and metadata (element names ending in Pr for properties) rather than using multiple attributes, which allows structured properties. Office Open XML does not use mixed content but uses elements to put a series of text runs (element name r) into paragraphs (element name p). The result is terse and highly nested in contrast to HTML, for example, which is fairly flat, designed for humans to write in text editors and is more congenial for humans to read.

Office MathML (OMML)

Office Math Markup Language is a mathematical markup language which can be embedded in WordprocessingML, with intrinsic support for including word processing markup like revision markings, footnotes, comments, images and elaborate formatting and styles. The OMML format is different from the World Wide Web Consortium (W3C) MathML recommendation that does not support those office features, but is partially compatible through XSL Transformations.

The following Office MathML example defines the fraction: π 2 {\displaystyle {\frac {\pi }{2}}}

<m:oMathPara>
  <m:oMath>
    <m:f>
      <m:num><m:r>
        <m:t>π</m:t>
      </m:r></m:num>
      <m:den><m:r>
        <m:t>2</m:t>
      </m:r></m:den>
    </m:f>
  </m:oMath>
</m:oMathPara>

DrawingML

Example of DrawingML charts

DrawingML is the vector graphics markup language used in Office Open XML documents. Its major features are the graphics rendering of text elements, graphical vector-based shape elements, graphical tables and charts.

The DrawingML table is the third table model in Office Open XML (next to the table models in WordprocessingML and SpreadsheetML) and is optimized for graphical effects and its main use is in presentations created with PresentationML markup. DrawingML contains graphics effects (like shadows and reflection) that can be used on the different graphical elements that are used in DrawingML. In DrawingML you can also create 3d effects, for instance to show the different graphical elements through a flexible camera viewpoint. It is possible to create separate DrawingML theme parts in an Office Open XML package. These themes can then be applied to graphical elements throughout the Office Open XML package.

DrawingML is unrelated to the other vector graphics formats such as SVG. These can be converted to DrawingML to include natively in an Office Open XML document. This is a different approach to that of the OpenDocument format, which uses a subset of SVG, and includes vector graphics as separate files.

A DrawingML graphic's dimensions are specified in English Metric Units (EMUs). It is so called because it allows an exact common representation of dimensions originally in either English or Metric units. This unit is defined as 1/360,000 of a centimeter and thus there are 914,400 EMUs per inch, and 12,700 EMUs per point. This unit was chosen so that integers can be used to accurately represent most dimensions encountered in a document. Floating point cannot accurately represent a fraction that is not a sum of powers of two and the error is magnified when the fractions are added together many times, resulting in misalignment. As an inch is exactly 2.54 centimeters, or 127/50, 1/127 inch is an integer multiple of a power-of-ten fraction of the meter (2×10 m). To accurately represent (with an integer) 1 μm = 10 m, a divisor of 100 is further needed. To accurately represent the point unit, a divisor of 72 is needed, which also allows divisions by 2, 3, 4, 6, 8, 9, 12, 18, 24, and 36 to be accurate. Multiplying these together gives 127×72×100 = 914,400 units per inch; this also allows exact representations of multiples of 1/100 & 1/32 inch. According to Rick Jelliffe, programmer and standards activist (ISO, W3C, IETF), EMUs are a rational solution to a particular set of design criteria.

Container structure

Main article: Open Packaging Conventions
Container structure of Part 2 of the Ecma Office Open XML standard, ECMA-376

Office Open XML documents are stored in Open Packaging Convention (OPC) packages, which are ZIP files containing XML and other data files, along with a specification of the relationships between them. Depending on the type of the document, the packages have different internal directory structures and names. An application will use the relationships files to locate individual sections (files), with each having accompanying metadata, in particular MIME metadata.

A basic package contains an XML file called .xml at the root, along with three directories: _rels, docProps, and a directory specific for the document type (for example, in a .docx word processing package, there would be a word directory). The word directory contains the document.xml file which is the core content of the document.

.xml
This file provided MIME type information for parts of the package, using defaults for certain file extensions and overrides for parts specificied by IRI.
_rels
This directory contains relationships for the files within the package. To find the relationships for a specific file, look for the _rels directory that is a sibling of the file, and then for a file that has the original file name with a .rels appended to it. For example, if the content types file had any relationships, there would be a file called .xml.rels inside the _rels directory.
_rels/.rel
This file is where the package relationships are located. Applications look here first. Viewing in a text editor, one will see it outlines each relationship for that section. In a minimal document containing only the basic document.xml file, the relationships detailed are metadata and document.xml.
docProps/core.xml
This file contains the core properties for any Office Open XML document.
word/document.xml
This file is the main part for any Word document.

Relationships

Relationship files in Office Open XML

An example relationship file (word/_rels/document.xml.rels), is:

<?xml version="1.0" encoding="UTF-8" standalone="yes" ?>
<Relationships
  xmlns="http://schemas.microsoft.com/package/2005/06/relationships">
  <Relationship Id="rId1"
    Type="http://schemas.microsoft.com/office/2006/relationships/image"
    Target="http://en.wikipedia.org/images/wiki-en.png"
    TargetMode="External" />
  <Relationship Id="rId2"
    Type="http://schemas.microsoft.com/office/2006/relationships/hyperlink"
    Target="http://www.wikipedia.org"
    TargetMode="External" />
</Relationships>

As such, images referenced in the document can be found in the relationship file by looking for all relationships that are of type http://schemas.microsoft.com/office/2006/relationships/image. To change the used image, edit the relationship.

Hyperlink relations

The following code shows an example of inline markup for a hyperlink:

<w:hyperlink r:id="rId2" w:history="1"
  xmlns:r="http://schemas.openxmlformats.org/officeDocument/2006/relationships"
  xmlns:w="http://schemas.openxmlformats.org/wordprocessingml/2006/main">

In this example, the Uniform Resource Locator (URL) is represented by "rId2". The actual URL is in the accompanying relationships file, located by the corresponding "rId2" item. Linked images, templates, and other items are referenced in the same way.

Embedded or linked media file relations

Pictures can be embedded or linked using a tag:

<v:imagedata w:rel="rId1" o:title="example" />

This is the reference to the image file. All references are managed via relationships. For example, a document.xml has a relationship to the image. There is a _rels directory in the same directory as document.xml, inside _rels is a file called document.xml.rels. In this file there will be a relationship definition that contains type, ID and location. The ID is the referenced ID used in the XML document. The type will be a reference schema definition for the media type and the location will be an internal location within the ZIP package or an external location defined with a URL.

Document properties

Office Open XML uses the Dublin Core Metadata Element Set and DCMI Metadata Terms to store document properties. Dublin Core is a standard for cross-domain information resource description and is defined in ISO 15836:2003.

Core properties

An example document properties file (docProps/core.xml) that uses Dublin Core metadata, is:

<?xml version="1.0" encoding="UTF-8" standalone="yes"?>
<cp:coreProperties xmlns:cp="http://schemas.openxmlformats.org/package/2006/metadata/core-properties"
  xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:dcterms="http://purl.org/dc/terms/"
  xmlns:dcmitype="http://purl.org/dc/dcmitype/" xmlns:xsi="http://www.w3.org/2001/XMLSchema-instance">
  <dc:title>Office Open XML</dc:title>
  <dc:subject>File format and structure</dc:subject>
  <dc:creator>Misplaced Pages</dc:creator>
  <cp:keywords>Office Open XML, Metadata, Dublin Core</cp:keywords>
  <dc:description>Office Open XML uses ISO 15836:2003</dc:description>
  <cp:lastModifiedBy>Misplaced Pages</cp:lastModifiedBy>
  <cp:revision>1</cp:revision>
  <dcterms:created xsi:type="dcterms:W3CDTF">2008-06-19T20:00:00Z</dcterms:created>
  <dcterms:modified xsi:type="dcterms:W3CDTF">2008-06-19T20:42:00Z</dcterms:modified>
  <cp:category>Document file format</cp:category>
  <cp:contentStatus>Final</cp:contentStatus>
</cp:coreProperties>

Structure of the standard

ISO/IEC 29500:2008

The ISO/IEC standard is structured into four parts, each of which are independent standards. Part 2 specifying Open Packaging Conventions is, for example, used by other files formats including XPS and Design Web Format. The ISO/IEC 29500:2008 version is also known ECMA-376 2nd edition (2008)

Part 1. Fundamentals and Markup Language Reference (5560 pages)
Contains conformance definitions
Contains the reference material for WordprocessingML, SpreadsheetML, PresentationML, DrawingML, Shared MLs and Custom XML Schema, defining every element and attribute including the element hierarchy (parent/child relationships)
XML schemas for the markup languages are declared as XSD and (non-normatively) using RELAX NG
Defines the custom XML data storing facility
Schemas are included in the text
Part 2. Open Packaging Conventions (129 pages)
Defines the Open Packaging Conventions (package model, physical package)
Defines core properties, thumbnails and digital signatures
XML schemas for the OPC are declared as XML Schema Definitions (XSD) and (non-normatively) using RELAX NG (ISO/IEC 19757-2)
Part 3. Markup Compatibility and Extensibility (40 pages)
Describes extension facilities of OpenXML documents and specifies elements and attributes by which applications with different extensions can interoperate
Extensibility rules are expressed using NVDL (ISO/IEC 19757-4)
Part 4. Transitional Migration Features (1464 pages)
Contains reference material of legacy and transitional interest
Contains controversial material such as compatibility settings and the graphics markup language VML
Contains list of differences between ECMA-376 1st edition and ISO/IEC 29500:2008 (ECMA-376 2nd edition)

The standard specifies two levels of document and application conformance, strict and transitional for each of WordprocessingML, PresentationML and SpreadsheetML. The standard also specifies applications descriptions of base and full.

ECMA-376 1 edition (2006)

The ECMA standard is structured in five parts to meet the needs of different audiences.

Part 1. Fundamentals
Vocabulary, notational conventions and abbreviations
Summary of primary and supporting markup languages
Conformance conditions and interoperability guidelines
Constraints within the Open Packaging Conventions that apply to each document type
Part 2. Open Packaging Conventions
Defines the Open Packaging Conventions (package model, physical package)
Defines core properties, thumbnails and digital signatures
XML schemas for the OPC are declared as XML Schema Definitions (XSD) and (non-normatively) using RELAX NG (ISO/IEC 19757-2)
Part 3. Primer
Informative (non-normative) introduction to WordprocessingML, SpreadsheetML, PresentationML, DrawingML, VML and Shared MLs, providing context and illustrating elements through examples and diagrams
Describes the custom XML data storing facility within a package to support integration with business data
Part 4. Markup Language Reference
Contains the reference material for WordprocessingML, SpreadsheetML, PresentationML, DrawingML, Shared MLs and Custom XML Schema, defining every element and attribute including the element hierarchy (parent/child relationships)
XML schemas for the markup languages are declared as XSD and (non-normatively) using RELAX NG
Defines the custom XML data storing facility
Part 5. Markup Compatibility and Extensibility
Describes extension facilities of OpenXML documents and specifies elements and attributes by which applications with different extensions can interoperate
Extensibility rules are expressed using NVDL (ISO/IEC 19757-4)

Compatibility between versions

The intent of the changes from ECMA-376 1st edition to ISO 29500 was that a valid ECMA-376 document would be a valid ISO 29500 "transitional" document, but one change introduced at the BRM (refusing to allow further values for xsd:boolean) had the effect of breaking backwards compatibility for most documents. A fix for this has been suggested to JTC1/SC34/WG4, and was approved in June 2009 to go forward as a recommendation for the first amendment to OOXML.

Support and criticism

This article's "criticism" or "controversy" section may compromise the article's neutrality. Please help rewrite or integrate negative information to other sections through discussion on the talk page. (September 2009)


Support

  • Former Gnome Foundation board member Miguel de Icaza, who started the GNOME and Mono projects, showed support for the Office Open XML document format, stating “OOXML is a superb standard and yet, it has been FUDed so badly by its competitors that serious people believe that there is something fundamentally wrong with it.”
  • According to Ars Technica, Office Open XML gained a number of supporters including some who are directly affiliated with the OpenDocument format.
  • Patrick Durusau, the editor of the OpenDocument standard, has been an advocate of the standardization of Office Open XML.

Criticism of ECMA-376 1 edition

The ODF Alliance UK Action Group has stated that with OpenDocument an ISO standard for Office files already exists. Further, they argue that the Office Open XML file-format is heavily based on Microsoft's own Office applications and is thus not vendor-neutral, and that it has inconsistencies with existing ISO standards such as time and date formats and color codes.

In August 2007, the Linux Foundation published a blog post calling upon ISO National Bodies to vote “No, with comments” on OOXML. It said, ”OOXML is a direct port of a single vendor’s binary document formats. It avoids the re-use of relevant existing international standards (e.g. several cryptographic algorithms, VML, etc.). It lists a large number of “Compatibility Settings” for legacy applications (e.g. footnoteLayoutLikeWW8, autoSpaceLikeWord95, useWord97LineBreakRules, etc.) which would be difficult for other developers to implement and hardly what one would find in an aspirational, consolidated best practices document. There are literally hundreds of technical flaws that should be addressed before standardizing OOXML including continued use of binary code tied to platform specific features, propagating bugs in MS-Office into the standard, proprietary units, references to proprietary/confidential tags, unclear IP and patent rights, and much more.”

Specific criticism

  • ECMA-376 1 edition does not conform to ISO 8601:2004 "Representation of Dates and Times." It requires that implementations replicate a Lotus 1-2-3 bug that dictates that 1900 is a leap year, which in fact it isn't. Products complying with ECMA-376 would be required to use the WEEKDAY() spreadsheet function, and therefore assign incorrect dates to some days of the week, and also miscalculate the number of days between certain dates. ECMA-376 2nd edition (ISO/IEC 29500) does use 8601:2004 "Representation of Dates and Times".
  • ECMA-376 1 edition "Embedded Object Alternate Image Requests Types" and "Clipboard Format Types" in the standard refer back to Windows Metafiles or Enhanced Metafiles – each of which are proprietary formats that have hard-coded dependencies on the Windows itself. It should instead have referenced the platform neutral standard ISO/IEC 8632 "Computer Graphics Metafile".
  • Office Open XML (ISO/IEC 29500:2008 Part 4 - Transitional Migration Features) contains specific compatibility settings used when converting existing office documents to Office Open XML, most notably: autoSpaceLikeWord95, footnoteLayoutLikeWW8, lineWrapLikeWord6, mwSmallCaps, shapeLayoutLikeWW8, suppressTopSpacingWP, truncateFontHeightsLikeWP6, uiCompat97To2003, useWord2002TableStyleRules, useWord97LineBreakRules, wpJustification and wpSpaceWidth. The compatibility settings are fully documented in the ISO/IEC 29500 / Ecma 2nd edition specifications.
  • Office Open XML uses unique tag for each compatibility setting. Currently, the only application’s compatibility settings are the applications that the standard’s authors have decided to include. For other application’s compatibility settings to be added, further tag names would need to be defined in the specification, potentially creating thousands of them, each having nothing to do with interoperability.
  • Use of DrawingML and the transitional-use-only VML instead of W3C recommendation SVG. VML did not become a W3C recommendation.
  • Use of Office MathML instead of W3C recommendation MathML. MathML is a W3C recommendation for the "inclusion of mathematical expressions in Web pages" and "machine to machine communication" that has been around since about 1999. However, most mathematicians continue to use the much older TeX format as their main method for typesetting complex mathematical formulae. TeX is not an ISO standard, but is fully documented and is the de facto standard for typesetting mathematical expressions.
  • Office Open XML does not define a macro language, leaving this aspect to be application-defined.
  • The standard is long, with the version submitted to ISO comprising 6,546 pages. The need and appropriateness of such length has been questioned. Google remarks that the OpenDocument specification is 867 pages long and achieves the same goals. If ISO were to give OOXML with its 6546 pages the same level of review that other standards have seen, it would take 18 years (6576 days for 6546 pages) to achieve comparable levels of review to the existing ODF standard (871 days for 867 pages). Considering that OOXML had had a significantly shorter review period than comparable standards, Google sees "reports about inconsistencies, contradictions and missing information hardly surprising."

Adoption

Several countries have formally announced either adoption, or the evaluation of adoption of OOXML.

Belgium
Belgium's Federal Public Service for Information and Communication Technology in 2006 was evaluating the adoption of the Office Open XML format. It already then confirmed that it would consider all ISO standards to be open standards, specifically mentioning Office Open XML as such a possible future ISO standard.
Denmark
In June 2007, the Danish Ministry of Science, Technology and Innovation recommended that beginning with January 1, 2008 public authorities must support at least one of the two word processing document formats Office Open XML and ODF in all new IT solutions, where appropriate.
Germany
In Germany the Office Open XML standard is currently under observation by the governmental office for standards in public IT („Koordinierungs- und Beratungsstelle der Bundesregierung für Informationstechnik in der Bundesverwaltung“ (KBSt). The latest release of „SAGA“ (Standards and Architectures for E-Government-Applications) includes Office Open XML file formats. The standard may be used to exchange complex documents when further processing is required.
Japan
On June 29, 2007, the government of Japan published a new interoperability framework which gives preference to the procurement of products that follow open standards. On July 2 the government declared that they hold the view that formats like Office Open XML which organizations such as Ecma International and ISO had also approved was, according to them, an open standard . Also, they said that it was one of the preferences, whether the format is open, to choose which software the government shall deploy.
Lithuania
Lithuanian Standards Board has adopted the ISO/IEC 29500:2008 Office Open XML format standard as Lithuanian National standard. The decision was made by Technical Committee 4 Information Technology on March 5, 2009. The proposal to adopt the Office Open XML format standard was submitted by Lithuanian Archives Department under the Government of the Republic of Lithuania.
Norway
Norway's Ministry of Government Administration and Reform is evaluating the adoption of the Office Open XML format. The ministry put the document standard under observation in December 2007.
Sweden
Kingdom of Sweden has adopted Office Open XML as a 4 part Swedish National Standard SS-ISO/IEC 29500:2009.
Switzerland
In July 2007, the Swiss Federal Council announced adherence SAGA.ch e-Government standards mandatory for its departments as well as for cantons, cities and municipalities. The latest version of SAGA.ch includes Office Open XML file formats.
United States of America
On April 15, 2009, the ANSI-accredited INCITS organisation voted to adopt ISO/IEC 29500:2008 as an American National Standard.
The state of Massachusetts has been examining its options for implementing XML-based document processing. In early 2005, Eric Kriss, Secretary of Administration and Finance in Massachusetts, was the first government official in the United States to publicly connect open formats to a public policy purpose: "It is an overriding imperative of the American democratic system that we cannot have our public documents locked up in some kind of proprietary format, perhaps unreadable in the future, or subject to a proprietary system license that restricts access." Since 2007 Massachusetts classifies Office Open XML as "Open Format" and has amended its approved technical standards list — the Enterprise Technical Reference Model (ETRM) — to include Office Open XML. Massachusetts now formally endorses Office Open XML formats for its public records.

Application support

Main article: Office Open XML software

The Office Open XML format is used in free software and in proprietary software. This includes office suites (both traditional and web-based) and individual applications such as word-processors, spreadsheets, presentation, and data management applications.

The latest version of Office Open XML is formally known as either ISO/IEC 29500:2008 or as ECMA-376 2nd edition (December 2008). IS 29500 is subject to an active maintenance program through the committee ISO/IEC SC34 WG4, which is working on a revision for minor edits (a Corrigendum) and a revision for substantive technical issues (an Amendment)

Microsoft, whose products currently only support the version of Office Open XML originally submitted as an ECMA standard, has committed to using the ISO/IEC 29500 standard in their products and has also committed to participate in the maintenance of this standard.

Microsoft has stated that Microsoft Office 2010 will be the first version of Microsoft Office to support ISO/IEC 29500, though no release date has been announced.

On July 28, 2008 Murray Sargent, a software development engineer in the Microsoft Office team confirmed that Word 2007 will have a service pack release that enables it to read and write ISO standard Office Open XML files. However, the 2009 Service Pack 2 release did not expose any IS 29500 features in its user interface.

In a ZDNet article Alex Brown, leader of the ISO/IEC group in charge of deciding maintenance processes for any ISO/IEC 29500 Standard, stated

"I am hoping that Microsoft Office will shortly be brought into line with the 29500 specification, and will stay that way".

On March 13, 2008 Doug Mahugh, a Senior Product Manager at Microsoft specializing in Office client interoperability and the Open XML file formats confirmed that version 1.0 of the Open XML Format SDK

"will definitely be 100% compliant with the final ISO/IEC 29500 spec, including the changes accepted at the BRM".

In a ComputerWorld interview from 2008, Doug Mahugh said that

"Microsoft would continue to update the SDK to make sure that applications built with it remained compliant with an Open XML standard as changes were made in the future".

Microsoft Open XML Format SDK contains a set of managed code libraries to create and manipulate Office Open XML files programmatically. Version 1.0 was released on June 10, 2008 and incorporates the changes made to the Office Open XML specification made during the current ISO/IEC standardization process. Version 2 of the Open XML SDK supports validating Office Open XML documents against the Office Open XML schema, as well as searching in Office Open XML documents.

PowerTools for Open XML, a set of free open source cmdlets for Windows PowerShell to create and manipulate Office Open XML files at the command line or using shell scripts.

See also

References

  1. ^ Microsoft. "Register file extensions on third party servers". microsoft.com. Retrieved 2009-09-04.
  2. ^ "Freely Available Standards". ITTF (ISO). 2008-11-18.
  3. "Q&A: Microsoft Co-Sponsors Submission of Office Open XML Document Formats to Ecma International for Standardization". Microsoft. 2005-11-21.
  4. "ISO/IEC DIS 29500 receives necessary votes for approval as an International Standard". ISO. 2008-04-02.
  5. ISO/IEC (2008-11-18). "Publication of ISO/IEC 29500:2008, Information technology — Office Open XML formats". ISO. Retrieved 2008-11-19.
  6. ^ "Standard ECMA-376". Ecma-international.org. Retrieved 2009-05-19.
  7. "Microsoft's future lies somewhere beyond the Vista by Evansville Courier & Press". Courierpress.com. Retrieved 2009-05-19.
  8. "Rivals Set Their Sights on Microsoft Office: Can They Topple the Giant? - Knowledge@Wharton". Knowledge.wharton.upenn.edu. Retrieved 2009-05-19.
  9. "OOXML Implementations: A Community of One" (PDF). ODF Alliance. 2008-02-20. Retrieved 2009-05-19.
  10. ^ "Microsoft Expands List of Formats Supported in Microsoft Office". Microsoft.com. 2008-05-21. Retrieved 2009-05-19.
  11. Lai, Eric (2008-05-27). = 141&pageNumber=1 "FAQ: Office 14 and Microsoft's support for ODF". Computerworld.com. Retrieved 2009-05-19. {{cite web}}: Check |url= value (help)
  12. Andy Updegrove. "Microsoft Office 2007 to Support ODF — and not OOXML". ConsortiumInfo.org. Retrieved 2009-05-19.
  13. "How to extract information from Office files by using Office file formats and schemas". Microsoft. 2007-03-27. Retrieved 2007-07-10.
  14. "Microsoft Office Binary (doc, xls, ppt) File Formats". Microsoft. 2008-02-15. Retrieved 2008-04-05.
  15. ^ Brian Jones (2007-01-25). "History of office XML formats (1998–2006)". MSDN blogs.
  16. Telematics between Administrations Committee based on IDA expert group on open document formats (2004-05-25). "TAC approval on conclusions and recommendations on open document formats". IDABC — European eGovernment Services. Retrieved 2007-07-30.
  17. "Microsoft Co-Sponsors Submission of Office Open XML Document Formats to Ecma International for Standardization". Microsoft. 2005-11-21.
  18. "Ecma International approves Office Open XML standard" (Press release). Ecma International. December 7, 2006. Retrieved 2006-12-08.
  19. "Ecma International". Ecma International. 2009-06. {{cite web}}: Check date values in: |date= (help)
  20. "Ecma formal publications". Ecma-international.org. Retrieved 2009-05-19.
  21. Adam Farquhar (2007-05). "Office OpenXML" (PDF). Britisch Standard institute. {{cite web}}: Check date values in: |date= (help)
  22. "Massachusetts Enterprise Technical Reference Model – Version 4.0". Commonwealth of Massachusetts. 2007-08-01.
  23. http://www.itscj.ipsj.or.jp/sc34/open/1026.pdf
  24. "Office Open XML wird als weltweiter ISO/IEC-Standard anerkannt". ntz Fachzeitschrift für Informations- und Kommunikationstechnik (3–4). Berlin: VDE-Verlag: 10. 2008. ISSN 0948-728X.
  25. "India > News > Software > ISO puts Microsoft's OOXML Standard on Hold". Techtree.com. 2008-06-12. Retrieved 2009-05-19.
  26. "What is Ecma International". Ecma International.
  27. "Freely Available Standards". ISO. Retrieved 22 September 2009.
  28. "Code of Conduct in Patent Matters". Ecma International.
  29. "Microsoft Covenant Regarding Office 2003 XML Reference Schemas". Microsoft. Retrieved 2006-07-11.
  30. "2 Escape Hatches in MS's Covenant Not to Sue". Groklaw. Retrieved 2007-01-29.
  31. Berlind, David (November 28, 2005). "Top open source lawyer blesses new terms on Microsoft's XML file format". ZDNet. Retrieved 2007-01-27.
  32. "Microsoft Open Specification Promise". Microsoft. 2006-09-12. Retrieved 2007-04-22.
  33. "Licensing conditions that Microsoft offers for Office Open XML". Jtc1sc34.org. 2006-12-20. Retrieved 2009-05-19.
  34. "Microsoft Word — Responses to Comments and Perceived Contradictions.doc" (PDF). Retrieved 2009-09-16.
  35. George Ou (2007-04-27). "MS Office 2007 versus Open Office 2.2 shootout". ZDnet.com. Retrieved 2007-04-27.
  36. Jesper Lund Stocholm (2008-01-29). "Do your math — OOXML and OMML". A Mooh Point blog. Retrieved 2008-02-12.
  37. Murray Sargent (2007-06-05). "Science and Nature have difficulties with Word 2007 mathematics". MSDN blogs. Retrieved 2007-07-31.
  38. David Carlisle (2007-05-09). "XHTML and MathML from Office 2007". David Carlisle. Retrieved 2007-09-20.
  39. Wouter Van Vugt (2008-11-01). "Open XML Explained e-book". Openxmldeveloper.org. Retrieved 2007-09-14.
  40. Monday April 16, 2007 5:24AM by Rick Jelliffe in Technical (2007-04-16). "Why EMUs? - O'Reilly XML Blog". Oreillynet.com. Retrieved 2009-05-19.{{cite web}}: CS1 maint: numeric names: authors list (link)
  41. Tom Ngo (December 11, 2006). "Office Open XML Overview" (PDF). Ecma International. p. 6. Retrieved 2007-01-23.
  42. "ISO — You searched for "29500" in title and abstract". International Organization for Standardization. 2009-06-05.
  43. "Re-introducing on/off-values to ST-OnOff in OOXML Part 4". Retrieved 2009-09-29.
  44. "OOXML and Office 2007 Conformance: a Smoke Test". Retrieved 2009-09-29.
  45. "Minutes of the Copenhagen Meeting of ISO/IEC JTC1/SC34/WG4" (PDF). 2009-06-22. Retrieved 2009-09-29. page 15
  46. "TC 45 Office Open XML formats committee (includes participation list)". Ecma-international.org. Retrieved 2009-09-16.
  47. "Microsoft PowerPoint — BSI OOXML v1b.ppt" (PDF). Retrieved 2009-09-16.
  48. "Microsoft to standardize Office formats — CNET News". News.cnet.com. 2005-11-22. Retrieved 2009-09-16.
  49. "Meeting the Challenge: Office Open XML and PDF/A — Digital Preservation (Library of Congress)". Digitalpreservation.gov. 2008-04-02. Retrieved 2009-09-16.
  50. "GNOME Foundation Helping OOXML?". Slashdot. Retrieved 2009-05-19.
  51. "Slashdot | de Icaza calls OOXML a "Superb Standard"". Linux.slashdot.org. Retrieved 2009-05-19.
  52. ^ Paul, Ryan (2008-04-01). "Microsoft's Office Open XML now an official ISO standard". Ars Technica. Retrieved 2009-09-16.
  53. "OpenXML: A Poster Child for Open Standards Development?" (PDF). Retrieved 2009-09-16.
  54. ^ "ODF/OOXML technical white paper". Free Software Magazine. 2 Mai 2007. {{cite web}}: Check date values in: |date= (help)
  55. John Cherry (14 March 2008). "OOXML — vote "No, with comments"".
  56. Kyd, Charley (October 2006). "How to Work With Dates Before 1900 in Excel". ExcelUser. Retrieved 2009-09-16.
  57. ^ "The Contradictory Nature of OOXML". ConsortiumInfo.org.
  58. "ECMA-376 2nd edition Part 1 (3. Normative references)". Ecma-international.org. Retrieved 2009-09-16.
  59. "New set of proposed dispositions posted, including more positive changes to the Ecma Office Open XML formats – Dispositions now proposed for more than half of National Bodies' comments". Ecma-international.org. 2007-12-11. Retrieved 2009-09-16.
  60. "ODF/OOXML technical white paper — A white paper based on a technical comparison between the ODF and OOXML formats". Free Software Magazine.
  61. "ECMA-376 2nd edition Part 4 (paragraph 9.7.3)". Ecma-international.org. Retrieved 2009-09-16.
  62. "ODF/OOXML technical white paper — A white paper based on a technical comparison between the ODF and OOXML formats". Free Software Magazine. ... OOXML chose this route. Rather than create an application-definable configuration tag there is a unique tag for each setting ... Currently, the only application's unique settings that are catered for are the applications that the standard's authors have decided to include, ... For other applications to be added, further tag names would need to be defined in the specification, potentially creating thousands of them, each having nothing to do with interoperability ...
  63. "The X Factor". reddevnews.com. 2007. {{cite web}}: Unknown parameter |month= ignored (help)
  64. "VML — the Vector Markup Language". W3.org. 1998-05-13. Retrieved 2009-05-19.
  65. "Microsoft Office dumped by Science and Nature". ZDNet Australia. 18 June 2007.
  66. "MathML W3C standard". W3.
  67. "I have seen the second complete copy of OOXML specification proposal".
  68. ^ "Google's Position on OOXML as a Proposed ISO Standard" (PDF). Google. 2008-02. {{cite web}}: Check date values in: |date= (help)
  69. "OOXML: What's the big deal?". IBM. 2008-02-19.
  70. bnitz (2007-01-23). "Whoah! Microsoft's 6000 page OOXML standard ECMA fast-tracked by Feb 5?". Sun Microsystems.
  71. "The Contradictory Nature of OOXML". ConsortiumInfo.org. 2007-01-17.
  72. "Microsoft accused of rigging OOXML votes". zdnet. 2007-08-30.
  73. "Developers warned over OOXML patent risk". zdnet. 2008-02-15.
  74. Bob Sutor (IBM) (2007-05-24). "In case you were wondering how big 6000 pages is: OpenXML/OOXML/whatever".
  75. "FFII opposes Fasttrack adoption of Microsoft OOXML format as ISO standard". Foundation for a Free Information Infrastructure. 2007-01-29.
  76. Rob Weir (2009-01-29). "The Formula for Failure".
  77. "Microsoft's Office Open XML now an official ISO standard". arstechnica. 2008-04-01.
  78. "OOXML Will Pass Amid Massive Irregularities". slashdot. 2008-03-31.
  79. "Stop the broken Microsoft proprietary OOXML format from becoming an ISO standard". Opera Software. 2007-09-15.
  80. "FED13321-docsPeterStrickx.indd" (PDF). Retrieved 2009-09-16.
  81. "Bilag 8 – Sammenligning af rapporten om "Estimering af omkostningerne ved indførelse af Office Open XML (OOXML) og Open Document Format (ODF) i centraladministrationen" i forhold til de spørgsmål, der skal belyses i de økonomiske konsekvensvurderinger, jf. rapporten om "Anvendelse af åbne standarder i det offentlige"". Vtu.dk. Retrieved 2009-05-19.
  82. "SAGA 4.0" (PDF). Retrieved 2009-09-16.
  83. Gardner, David (2007-07-10). "Office Software Formats Battle Moves To Asia". Information Week. Retrieved 2007-07-27.
  84. "Interoperability framework for information systems (in Japanese)". Ministry of Economy, Trade and Industry, Japan. 2007-06-29. Retrieved 2007-07-27.
  85. "Latest News". Open XML Community. Retrieved 2009-05-19.
  86. "Referansekatalog for IT-standarder i offentlig sektor". regjeringen.no. Retrieved 2009-05-19.
  87. "SS-ISO/IEC 29500-1:2009". Sis.se. 2009-01-19. Retrieved 2009-09-16.
  88. "SS-ISO/IEC 29500-2:2009". Sis.se. Retrieved 2009-09-16.
  89. "SS-ISO/IEC 29500-3:2009". Sis.se. Retrieved 2009-09-16.
  90. "SS-ISO/IEC 29500-4:2009". Sis.se. Retrieved 2009-09-16.
  91. "eCH — Downloads | Standards/Normes | eCH-0014 d SAGA.ch". Ech.ch. Retrieved 2009-05-19.
  92. "INCITS Letter Ballot 3025". INCITS. 2009-04-15.
  93. "Informal comments on Open Formats". Web.archive.org. Retrieved 2009-09-16.
  94. "Cover Pages: Major Revision of Massachusetts Enterprise Technical Reference Model (ETRM)". Xml.coverpages.org. Retrieved 2009-05-19.
  95. "The actual work we did in Prague".
  96. "An Open Letter from Chris Capossela, Senior Vice President, Microsoft Office". Retrieved 2008-05-26. ... we are committed to supporting the Open XML specification that is approved by ISO/IEC in our products. ... We are committed to the healthy maintenance of the standard once ratification takes place so that it will continue to be useful and relevant to the rapidly growing number of implementers and users around the world.
  97. "Murray Sargent: Math in Office : Improved MathML support in Word 2007". Blogs.msdn.com. 2008-07-28. Retrieved 2009-05-19.
  98. "Microsoft Office 2007 fails OOXML conformance test".
  99. by dmahugh. "Open XML SDK roadmap". Blogs.msdn.com. Retrieved 2009-05-19.
  100. Lai, Eric (2008-03-12). "Microsoft releasing OOXML SDK". Computerworld.com. Retrieved 2009-05-19.
  101. Microsoft (2008-06-10). "Open XML Format SDK". Microsoft.
  102. Erika Ehrli (2008-06-10). "Announcing the Open XML Format SDK 1.0". Computerworld.
  103. ^ Doug Mahugh. "Open XML SDK roadmap". MSDN Blogs. Retrieved 2008-03-23.
  104. Eric Lai (2008-03-12). "Microsoft releasing OOXML SDK". Microsoft.
  105. "PowerTools for Open XML". Blogs.msdn.com. Retrieved 2009-05-19.
  106. "PowerTools license". Codeplex.com. Retrieved 2009-05-19.
  107. "Microsoft Public License (Ms-PL)". Opensource.org. 2007-10-15. Retrieved 2009-05-19.

External links

Standards of Ecma International
Application interfaces
File systems (tape)
File systems (disk)
Graphics
Programming languages
Radio link interfaces
Other
List of Ecma standards (1961 – present)
ISO standards by standard number
List of ISO standardsISO romanizationsIEC standards
1–9999
10000–19999
20000–29999
30000+
Categories: