XML News in 2009

The W3C has posted a new working draft of HTML 5. "This specification defines the 5th major revision of the core language of the World Wide Web: the Hypertext Markup Language (HTML). In this version, new features are introduced to help Web application authors, new elements are introduced based on research into prevailing authoring practices, and special attention has been given to defining clear conformance criteria for user agents in an effort to improve interoperability." There's also a new draft of HTML 5 differences from HTML 4.The latter contains a convenient list of changes since the January 22 draft:

The data member of ImageData objects has been changed from an array to a CanvasPixelArray object.
Shadows are now required from implementations of the canvas element and its API.
Security model for canvas is clarified.
Various changes to the processing model of canvas have been made in response to implementation and author feedback. E.g. clarifying what happens when NaN and Infinity are passed and fixing the definitions of arc() and arcTo().
innerHTML in XML was slightly changed to improve round-tripping.
The toDataURL() method on the canvas element now supports setting a quality level when the media type argument is image/jpeg.
The poster attribute of the video element now affects its intrinsic dimensions.
The behavior of the type attribute of the link element has been clarified.
Sniffing is now allowed for link when the expected type is an image.
A section on URLs is introduced dealing with how URL values are to be interpreted and what exactly authors are required to do. Every feature of the specification that uses URLs has been reworded to take the new URL section into account.
It is now explicit that the href attribute of the base element does not depend on xml:base.
It is now defined what the behavior should be when the base URL changes.
URL decomposition DOM attributes are now more aligned with Internet Explorer.
The xmlns attribute with the value http://www.w3.org/1999/xhtml is now allowed on all HTML elements.
data-* attributes and custom attributes on the embed element now have to match the XML Name production and cannot contain a colon.
Web Socket API is introduced for bidirectional communication with a server. It is currently limited to text messages.

The default value of volume on media elements is now 1.0 rather than 0.5.
event-source was renamed to eventsource because no other HTML element uses a hyphen.
A message channel API has been introduced augmenting postMessage().
A new element named bb has been added. It represents a user agent command that the user can invoke.
The addCueRange() method on media elements has been modified to take an identifier which is exposed in the callbacks.
It is now defined how to mutate a DOM into an infoset.
The parent attribute of the Window object is now defined.
The embed element is defined to do extension sniffing for compatibilty with servers that deliver Flash as text/plain. (This is marked as an issue in the specification to figure out if there is a better way to make this work.)

The embed can now be used without its src attribute.
getElementsByClassName() is defined to be ASCII case-insensitive in quirks mode for consistency with CSS.
In HTML documents localName no longer returns the node name in uppercase.
data-* attributes are defined to be always lowercase.
The opener attribute of the Window object is not to be present when the page was opened from a link with target="_blank" and rel="noreferrer".
The top attribute of the Window object is now defined.
The a element now allows nested flow content, but not nested interactive content.
It is now defined what the header element means to document summaries and table of contents.
What it means to fetch a resource is now defined.
Patterns are now required for the canvas element.
The autosubmit attribute has been removed from the menu element.
Support for outerHTML and insertAdjacentHTML() has been added.
xml:lang is now allowed in HTML when lang is also specified and they have the same value. In XML lang is allowed if xml:lang is also specified and they have the same value.
The frameElement attribute of the Window object is now defined.
An event loop and task queue is now defined detailing script execution and events. All features have been updated to be defined in terms of this mechanism.
If the alt attribute is omitted a title attribute, an enclosing figure element with a legend element descendant, or an enclosing section with an associated heading must be present.
The irrelevant attribute has been renamed to hidden.
The definitionURL attribute of MathML is now properly supported. Previously it would have ended up being all lowercase during parsing.
User agents must treat US-ASCII as Windows-1252 for compatibility reasons.
An alternative syntax for the DOCTYPE is allowed for compatibility with some XML tools.
Data templates have been removed (consisted of the datatemplate, rule and nest elements).
The media elements now support just a single loop attribute.
The load() method on media elements has been redefined as asynchronous. It also tries out files in turn now rather than just looking at the type attribute of the source element.
A new member called canPlayType() has been added to the media elements.
The totalBytes and bufferedBytes attributes have been removed from the media elements.
The Location object gained a resolveURL() method.
The q element has changed again. Punctation is to be provided by the user agent again.
Various changes were made to the HTML parser algorithm to be more in line with the behavior Web sites require.
The unload and beforeunload events are now defined.
The IDL blocks in the specification have been revamped to be in line with the upcoming Web IDL specification.
Table headers can now have headers. User agents are required to support a headers attribute pointing to a td or th element, but authors are required to only let them point to th elements.
Interested parties can now register new http-equiv values.
When the meta element has a charset attribute it must occur within the first 512 bytes.
The StorageEvent object now has a storageArea attribute.
It is now defined how HTML is to be used within the SVG foreignObject element.
The notification API has been dropped.
How [[Get]] works for the HTMLDocument and Window objects is now defined.
The Window object gained the locationbar, menubar, personalbar, scrollbars, statusbar and toolbar attributes giving information about the user interface.
The application cache section has been significantly revised and updated.
document.domain now relies on the Public Suffix List. [PSL]
A non-normative rendering section has been added that describes user agent rendering rules for both obsolete and conforming elements.
A normative section has been added that defines when certain selectors as defined in the Selectors and the CSS3 Basic User Interface Module match HTML elements. [SELECTORS] [CSS-UI]

Web Forms 2.0, previously a standalone specification, has been fully integrated into HTML 5 since last publication. The following changes were made to the forms chapter:

Support for XML submission has been removed.
Support for form filling has been removed.
Support for filling of the select and datalist elements through the data attribute has been removed.
Support for associating a field with multiple forms has been removed. A field can still be associated with a form it is not nested in through the form attribute.
The dispatchFormInput() and dispatchFormChange() methods have been removed.
Repetition templates have been removed.
The inputmode attribute has been removed.
The input element in the File Upload state no longer supports the min and max attributes.
The allow attribute on input elements in the File Upload state is no longer authorative.
The pattern and accept attributes for textarea have been removed.
RFC 3106 is no longer explicitly supported.
The submit() method now just submits, it no longer ensures the form controls are valid.
The input element in the Range state now defaults to the middle, rather than the minimum value.
The size attribute on the input element is now conforming (rather than deprecated).
object elements now partake in form submission.
The type attribute of the input element gained the values color and search.
The input element gained a multiple attribute which allows for either multiple e-mails or multiple files to be uploaded depending on the value of the type attribute.
The input, button and form elements now have a novalidate attribute to indicate that the form fields should not be required to have valid values upon submission.
When the label element contains an input it may still have a for attribute as long as it points to the input element it contains.
The input element now has an indeterminate DOM attribute.
The input element gained a placeholder attribute.

The W3C XML Schema Working Group has posted possibly the third last call working drafts of XML Schema 1.1 Part 1: Structures and XML Schema Definition Language (XSD) 1.1 Part 2: Datatypes. According to the structures draft,

The major revisions since the previous public working draft include the following:

Attribute declarations can now be marked {inheritable} (see Inherited Attributes (§3.3.5.6)) and the values of inherited attributes are accessible in the XDM data model instance constructed for checking assertions (see Assertions (§3.13)) and for conditional type assignment (see Type Alternatives (§3.12)).
Among other consequences, this allows conditional type assignment and assertions to be sensitive to the inherited value of the xml:lang attribute and thus to the language of the element's contents. This change was introduced to resolve issue 5003 Applicability of <alternative> element to xml:lang, raised by the W3C Internationalization Core Working Group.
Three priority feedback requests have been added in this draft (and those in previous drafts have for the most part been retained). They concern:
content-model subsumption involving all-groups (see issue 5293 Subsumption)
detection of certain schema errors at instance-validation time (see issue 5293 Subsumption).
the interaction of block and substitution groups (see issue 6382 Substitution group and "block").

The <override> facility has been revised to make the new declarations and definitions in the <override> element override matching declarations and definitions not only in the schema document directly pointed to by the schemaLocation attribute, but also in other schema documents referred to using <include> or <override> in that schema document.
This allows the use of <override> even in cases where the declarations and definitions to be overridden are spread widely across a tightly interconnected set of schema documents, and thus makes it more widely useful.
The discussion of <include> and <override> has been revised to eliminate an ambiguity in earlier versions of this spec regarding the meaning of cyclic dependencies formed by use of <include> and <override>: such cyclic dependencies are now clearly allowed and have a well defined meaning.
Schema processors are now explicitly recommended to provide a user option to control whether the processor attempts to dereference schema locations indicated in schemaLocation attributes in the instance document being validated; this resolves issue 5476 xsi:schemaLocation should be a hint, should be MAY not SHOULD.
The semantics of attributes vc:typeAvailable and vc:typeUnavailable have been revised to make them complementary and to ensure that if two constructs in the input to conditional inclusion processing are marked vc:typeAvailable="A B C" and vc:typeUnavailable="A B C", respectively, then exactly one of the two will be chosen. This change resolves issue 5905 vc:typeAvailable and vc:typeUnavailable.
Schema processors are now encouraged to issue warnings for unrecognized attributes in the vc namespace. Attributes may be added to that namespace in the future, so unrecognized attributes are not errors, but until that time, the presence of unrecognized attributes is likely to indicate a problem in the input. This change resolves issue 5904 Unknown attributes in vc namespace.
The constraints on the XML representation of schemas have been reformulated to allow them to be checked on schema documents in isolation, rather than requiring knowledge of the rest of the schema into which they will be embedded. The consequence is that some errors are caught not in the XML representation constraints but by having the XML mapping rules generate faulty components so that the error can be detected at the component level. These changes resolve issue 6235 Restriction from xs:anySimpleType.
The formal grammar for the XPath expressions used in conditional type assignment has been revised to allow both cast expressions and constructor functions, and to resolve a formal ambiguity in the earlier versions of the grammar; these changes resolve issue 5907 Problem with BNF for type alternatives.
The <schema> element is now declared with open content in the schema for schema documents. This change addressess issue 5930 defaultOpenContent in the S4SD.
The setting blockDefault="http://www.w3.org/TR/2009/WD-xmlschema11-1-20090130/#all" has been removed from the schema for schema documents; this change resolves issue 6120 Reconsider blockDefault=http://www.w3.org/TR/2009/WD-xmlschema11-1-20090130/#all.
Skip wildcards are now excluded from the Element Declarations Consistent (§3.8.6.3) constraint, and that constraint now also takes open content into account; these changes resolve issue 5940 Element Declarations Consistent.
Annotations given in the XML form of identity-constraint declarations with ref attributes are now retained in the ·post-schema-validation infoset· form of the containing element declaration. This change resolves issue 6144 annotation on IDC with a 'ref' attribute is lost.
All wildcard unions are now expressible, and wildcard union is used to combine multiple attribute wildcards, rather than wildcard intersection; this change resolves issue 6163 3.10.6.3 Attribute Wildcard Union .
Namespace fixup is now explicitly required in some places where it is needed but was not mentioned before; these changes resolve issue 6445 Namespace fixup and default namespace and issue 6465 Namespace fixup and inherited attributes.
The term "context-determined-declaration" has been replaced with the term ·locally declared type·; this resolves issue 4690 Editorial: 'context-determined declarations' needs more work.
The namespace prefixes used to refer to well known namespaces have been changed and are now more consistent; this resolves issue 4316 Make sure namespace prefixes are used appropriately throughout structures.
Numerous small changes were made in the interests of clarity, completeness, consistency, and precision, and to correct typographical errors.

In the datatypes spec,

The major changes since version 1.0 include:

Support for XML 1.1 has been added. It is now implementation defined whether datatypes dependent on definitions in [XML] and [Namespaces in XML] use the definitions as found in version 1.1 or version 1.0 of those specifications.
A new primitive decimal type has been defined, which retains information about the precision of the value. This type is aligned with the floating-point decimal types which will be part of the next edition of IEEE 754.
In order to align this specification with those being prepared by the XSL and XML Query Working Groups, a new datatype named anyAtomicType which serves as the base type definition for all primitive atomic datatypes has been introduced.
The conceptual model of the date- and time-related types has been defined more formally.
A more formal treatment of the fundamental facets of the primitive datatypes has been adopted.
More formal definitions of the lexical space of most types have been provided, with detailed descriptions of the mappings from lexical representation to value and from value to ·canonical representation·.
The validation rule Datatype Valid (§4.1.4) has been recast in more declarative form. A paraphrase of the constraint in procedural terms, which corrects some errors in the previous versions of this document, has been added as a note.
The rules governing partial implementations of infinite datatypes have been clarified.
Various changes have been made in order to align the relevant parts of this specification more closely with other relevant specifications, including especially the corresponding sections of [XSD 1.1 Part 1: Structures].
Changes since the previous public Working Draft include the following:

At the suggestion of the W3C OWL Working Group, a explicitTimezone facet has been added to allow date/time datatypes to be restricted by requiring or forbidding an explicit time zone offset from UTC, instead of making it optional. The dateTimeStamp datatype has been defined using this facet.
At the suggestion of the W3C Internationalization Core Working Group, most references to "time zone" have been replaced with references to "time zone offset"; this resolves issue 4642 Terminology: zone offset versus time zone.
The reference to the Unicode Database [Unicode Database] has been updated from version 4.1.0 to version 5.1.0, at the suggestion of the W3C Internationalization Core Working Group
Owing to an editorial error, the previous Working Draft had an error in the definition of the lexical spaces for float and double, which excluded the literal '+INF' from their ·lexical spaces·. This error has been corrected.
References to various other specifications have been updated.

Comments are due by February 20.

2009 XML News

1.1 Full-Text Search and XML