Symbols[ Top ]
( ) (parentheses) in element declarations, 39
( ) parentheses in element declarations, 39
& (ampersand)
entity reference for (&), 49
escaping in element character data, 20
in CDATA sections, 21
<? and ?>, delimiting processing instructions, 22
<!-- and -->, delimiting X ML comments, 22
< (angle bracket, left)
escaping in element character data, 20
' (apostrophe)
entity reference for ('), 20, 49
* (asterisk)
CSS universal selector, 224
node test, 463
suffix for element names, 38, 383
XPath arithmetic operator, 176
XPath wildcard, 169
@ (at sign), selecting an attribute, 461
]]> (CDATA end delimiter), 22
: (colon)
in XML namespaces, 64
:: (colon, double), in unabbreviated location paths, 173
.. (double period), selecting parent element, 460
= (equals sign)
separating attribute name-value pairs, 16
= (equals sign), CSS attribute value selector, 225
/ (forward slash)
//, selecting from context node descendants, 171
building compound location paths, 460
combining location steps with, 170
root location path, 166, 460
> (greater-than sign)
entity reference for (>), 20, 49
escaping, need for, 21
> (greater-than sign), CSS child element selector, 224
- (hyphen)
in XML names, 19
< (less-than sign)
entity reference for (<), 49
in CDATA sections, 21
- (minus sign), XPath arithmetic operator, 176
% (percent sign)
in parameter entity references, 54
. (period)
.. (double period), selecting parent element, 171
in XML names, 19
selecting context node, 171
. (period), selecting context node, 460
+ (plus sign)
CSS sibling element selector, 224
suffix for element names, 38, 383
XPath arithmetic operator, 176
? (question mark), suffix for element names, 38, 383
" (quotation marks, double)
enclosing values in attribute name-value pairs, 16
entity reference for straight ("), 20, 49
' (quotation marks, single)
enclosing values in attribute name-value pairs, 16
entity reference for straight ('), 20, 49
# (sharp sign), CSS attribute selector, 225
@ sign
attributes, selecting in XPath location steps, 167
~= (tilde equals sign), CSS attribute selector, 225
_ (underscore)
in XML names, 19
|= (vertical bar equals sign), CSS attribute selector, 225
| (vertical bar), multiple XPath matches, 170
@* (XPath wildcard), 169
[,��,��] (square brackets), CSS attribute selector, 224
A[ Top ]
abbreviated location paths, 173, 459
absolute location paths, 166, 460
building compound location paths, 171
id( ) function and, 180
absolute units of length (CSS properties), 229
alternate pseudo-attribute, 120
alternate pseudoattribute, 223
& (ampersand) entity reference
HTML 4.0 entity sets and, 606
XML specification for, 374
ancestor axis, 174, 462
ancestor-or-self axis, 174, 462
annotation elements in schemas, 282
anonymous types, 289
Antenna House XSL Formatter, 243
ANY element type, 42
Apache web server
MIME types for XML, 72
Apache XML Project
Cocoon (XSLT processor), 147
FOP (formatting program), 242
Xalan XSLT processor, 147
Xerces-J (parser), 281
' (apostrophe) entity reference
HTML 4.0 entity sets and, 607
XML specification for, 374
apostrophe ('), entity reference for (&apos:), 20
appendChild(,��,��) (DOM), 565
appendData(,��,��) (DOM), 506
Apple Computer character sets, 78
applications, XML, 4
application/xml, 13
application/xml MIME type, 72
Arabic Unicode block, 618
arcs, 191-195
multiple arcs from one arc element, 192-194
role attributes, 194
title attributes, 194
arithmetic operators in XPath, 176
Armenian Unicode block, 617
Arrows Unicode block, 620
ASCII character set, 605
ATTLIST declarations, 42
default declaration for attributes, 48
xml:lang attribute, 85
Attr interface (DOM), 338
attributes for, 502
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
attribute axis, 174, 462
attribute declarations, 42-49
attribute groups and, 286
attribute type
IDREFS, 47
attribute types, 43-48
CDATA, 44
ENTITIES, 48
ENTITY, 47
enumeration, 45
ID, 45
IDREF, 46
NMTOKEN, 44
NMTOKENS, 44
NOTATION, 48
defaults, 48
schemas and, 285
attribute names
document permanence and, 106
prefixes and, 63
attribute nodes
@* wildcard match, 170
default template rule for, 154
template rules and, 155
attribute types, 383
attribute value templates, 159
attribute values
ensuring validity of, 393
external entities and, 390
no, 390
ATTRIBUTE_NODE (DOM node type) and nodeType attribute, 561, 564
attributes, 16-18, 66
child elements vs., 17
default declarations for
normal attribute values and, 397
default namespaces and, 387
deprecated, XHTML DTD legacy module, 124
in XML declarations, 24-26
encoding attribute, 25
standalone attribute, 25
version attribute, 24
location steps, 167, 461
matching multiple, 170
nodes
XPath data model and, 458
notation, 396
selecting, 224
special, 385
unique names for, 390
unprefixed, namespaces and, 66
XLink, declaring in DTD, 197
XML elements and, 502
attributes attribute (Node interface), 558
Attributes interface (SAX), 581
axes, 173, 459, 461
abbreviated and unabbreviated location paths, 174
B[ Top ]
background-color property (CSS), 232
Backus-Naur Form (BNF) grammar, 495
bare name XPointers, 202
big-endian UCS-2, 75, 80
binding prefixes to URIs, 64-67
block areas (XSL-FO), 235
flowing content into pages, 239
splitting, 235
block elements (CSS display property), 227
%Block.extra entity reference, 127
block-level items in XML documents, 91
%Block.mix; entity reference, 125
BNF (Backus-Naur Form) grammar, 495
boolean( ) function (XPath), 182
Boolean expressions in predicates, 172
Boolean functions (XPath), 182
Booleans
used by SAX features, 359
XPath expressions, 177, 458
Borden, Jonathan, 250
border-color property (CSS), 232
Bosak, Jon, 9
Bray, Tim, 9, 250
Brown University, XML Validation Form, 34
byte-order mark (Unicode), 75, 80
bytes, representing code points in, 71
C[ Top ]
C0 controls in ASCII character sets, 605
C1 controls in ISO-8859 character sets, 605
Canadian French (xml:lang attribute), 85
Canonical XML, 11, 317
Cascading Style Sheets (CSS), 218
color properties, 232
display property, 227
font properties, 229
selectors, 224
style rules for, 221
syntax of, 220
text properties, 230
transformation-based presentation vs., 108
units of length, 228
web browser support for, 248
XML documents and, 222
Cascading Style Sheets (CSS)stylesheets, 218
Cascading Style Sheets Level 1 (CSS1), 220
Cascading Style Sheets Level 2 (CSS2), 220
web browser support for, 248
Cascading Style Sheets Level 3 (CSS3), 220
Cascading Style Sheets (see CSS)
case sensitivity in XML, xv, 14
catalogs, XML, 33
<![CDATA[ and ]]> (CDATA delimiters), 21
CDATA attribute type, 44
CDATA sections, 21
attribute type, 383
in XHTML, browser handling of, 117
LexicalHandler interface and, 598
XML specification for, 375
CDATASection interface (DOM), 340
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
CDATA_SECTION_NODE (DOM node type) and nodeType attribute, 561, 564
ceiling( ) function (XPath), 183
character data, 13
comments in, 22
escaping special characters with references, 20
parsed, 20
character data characters (Unicode), 603
character encodings, 25
autodetection of, 378
definition of, 71
Unicode, 75
character references, 81-84
for HTML 4.0 entity sets, 607
predefined, 374
in XHTML, 117
XML specification for, 390
character sets, 25, 601
ASCII, 605
converting between, 79
Cp1252, 606
definition of, 71
in external parsed entities, 73
ISO, 76
metadata, 71
national standard, 78
platform-dependent, 78
Cp1252, 79
MacRoman, 79
Unicode, 74
XML document default, 80
XML-defined, 74
character tables for Unicode, 603
CharacterData interface (DOM), 338
attributes for, 505
methods for, 505
charset pseudo-attribute, 119
charset pseudoattribute, 223
child axis, 174, 461
child elements, 14
attributes vs., 17
determining the number of, 38
location steps, 166, 460
metainformation about narrative documents, 91
specified in element declarations, 37
childNodes attribute (Node interface), 558
Chinese characters
using subcodes for dialects, 84
UTF-8/UTF-16 encodings, 76, 601
choices in element declarations, 39
enclosed in parentheses, 39
Clark, James, 9, 603
cloneNode(,��,��) (DOM), 566
Cocoon (Apache XML Project XSLT processor), 147
code points, 71
unassigned, 604
:(colon), use within names, 373
color property (CSS), 232
command-line programs
for document validation, 35
command-line XSLT processors, 147
comment( ) location step, 169
Comment interface (DOM), 339, 509
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
comment nodes
default template rule for, 156
XPath data model and, 458
comment(,��,��)
node test, 463
COMMENT_NODE (DOM node type) and nodeType attribute, 561
comments
DTDs, making sense of, 57
IGNORE directive acting as, 56
LexicalHandler interface and, 598
loopholes in XML specification, 317
vs. annotations in schemas, 282
in XML documents, 22
communications protocols and XML, 263
comparison operators in XPath, 177
complex types, 289
deriving
by extension, 304
by restriction, 305
occurrence constraints, 291
concat( ) function (XPath), 179
conditional inclusion, declarations, 56
conditional sections, 385
parameter entity nesting and, 398
constraints
namespace, 399
validity, 392
well-formedness, 388
container nodes, 205
contains( ) function (XPath), 181
content, element
any type, 300
empty elements, 291
mixed, 298
simple, 293
content nodes (DOM Core), 335
content (of elements), 13
escaping special characters in, 20
mixed content, 15
content specifications (in element declarations), 37
ContentHandler interface (SAX), 351, 581
methods called by parser, 352
overriding methods in, 358
Content-Type field (HTTP header), 72
context node, 165, 166, 461
changing, using here(,��,��) and origin(,��,��), 208
selecting from all descendants with //, 171
selecting, using a period (.), 460
selecting with . (period), 171
unabbreviated location paths, 174
Coptic Unicode block, 616
count( ) function (XPath), 179
country codes, 85
Cp1252 character set, 79, 606
converting to UTF-8 encoding, 80
createAttribute(,��,��) (DOM), 513
createAttributeNS(,��,��) (DOM), 514
createCDATASection(,��,��) (DOM), 515
createComment(,��,��) (DOM), 515
createDocument(,��,��) (DOM), 534
createDocumentFragment(,��,��) (DOM), 516, 521
createDocumentType(,��,��) (DOM), 535
createElement(,��,��) (DOM), 516
createElementNS(,��,��) (DOM), 516
createEntityReference(,��,��) (DOM), 517
createProcessingInstruction(,��,��) (DOM), 518
createTextNode(,��,��) (DOM), 518
CSS (Cascading Style Sheets), 9
Level 1 (CSS1), 118
Level 2 (CSS2), 118
XHTML DTD style attribute module, 124
CSS1 (Cascading Style Sheets Level 1), 220
CSS2 (Cascading Style Sheets Level 2), 220
web browser support for, 248
CSS3 (Cascading Style Sheets Level 3), 220
Currency Symbols Unicode block, 619
Cyrillic Unicode block, 617
D[ Top ]
data attribute
CharacterData interface, 505
ProcessingInstruction interface, 574
data format, XML, 261
data typing, in XML documents, 92
database, XML vs., 6
decimal character references, 81
decimal format patterns, 478, 495
declarations, 24-26
attribute, 42-49
commenting out, 56
DeclHandler interface and, 596
default, 397
document type, 32
element, 30, 37-42
element type, 394
encoding, 72, 378
inside XML declaration, 25
external
including, 302
modifying, 303
general entity, 49, 380
notation
names must be unique, 399
unparsed entities and, 399
notations, 52
parameter entity references, 54
standalone, 379
standalone attribute, 25
text, 51, 73
version attribute, 24
xml:lang attribute, 85
DeclHandler interface (SAX), 597
default declarations for attributes, 48
XML specification for, 397
default namespaces, setting, 67, 387
default values, XML specification for, 384
DefaultHandler class (SAX), 588
deleteData(,��,��) (DOM), 506
dereferencing
parameter entities, 54
derived types
controlling, 306
deriving
by extension, 304
by restriction, 305
using, 305
descendant axis, 174, 461
descendant-or-self axis, 462
Devanagari Unicode block, 618
Dingbats Unicode block, 621
disciplines and professions, standard DTDs for, 59
display property (CSS), 227
div (XPath arithmetic operator), 176
DocBook, 95-98
entity sets and, 84
structure of documents, 91
doctype attribute (Document interface), 510
DOCTYPE declaration, 32
document element, 15
Document interface (DOM), 335
attributes for, 510
methods for, 512
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
document model modules, defining parameter entity references with, 133
Document Object Model (see DOM)
Document Style Sheet and Semantics Language (DSSSL), 108
document type declaration, 32
document type definition, 29
document type definitions (see DTDs)
documentElement attribute (Document interface), 510
DocumentFragment interface (DOM), 337
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
DOCUMENT_FRAGMENT_NODE (DOM node type) and nodeType attribute, 562
DOCUMENT_NODE (DOM node type) and nodeType attribute, 561, 565
documents, XML, 12, 89-109
associating RDF document with, 143
attaching links to (see XLinks)
converting to HTML for servers, 122
embedding non-XML content in, 52
narrative
DocBook, 95-98
TEI, 92-95
narrative document structures, 90
nodes in, 154
OpenOffice, 98-101
permanence of, 105
SGML's legacy, 89
transforming and presenting, 107-109
valid, 4
well-formed, 4
WordprocessingML, 101-105
DocumentType interface (DOM)
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
DOCUMENT_TYPE_NODE (DOM node type) and nodeType attribute, 561, 565
DOM Core
content nodes, 335
structural nodes, 333
structure of, 328
DOM (Document Object Model), xii, 10, 325
document structure, 332
generic vs. specific interfaces, 328
IDL (Interface Definition Language) notation and, 326
Node interface, 500
object hierarchy, 501
parsing a document with, 344
reference, 500
sample application, 345
strengths/weaknesses, 327
TrAX and, 498
DOM Level 1, 500
DOM Level 3, 345
DOMException interface, 531
DOMImplementation interface, 340
DOMSTRING_SIZE_ERR exception (DOM), 531
DSSSL (Document Style Sheet and Semantics Language), 10, 108
DTDHandler interface (SAX), 582
DTDs, 5, 28-59
ATTLIST declarations of xml:lang attribute, 85
attribute declarations, 42-49
attribute types, 43-48
defaults, 48
conditional inclusion of declarations, 56
document permanence and, 106
element declarations, 37-42
ANY keyword, 42
child elements, 37
choices, 39
empty elements, 41
mixed content, 41
number of children, 38
parentheses in, 39
#PCDATA, 37
sequences, 37
external DTD subsets
text declarations in, 73
general entities
external, parsed, 50-52
general entity declarations, 49
namespaces and, 69
narrative XML documents, 58, 90, 92
parameter entities, 53-56
syntax of, 54
record-like documents, 10, 57
standalone attribute in XML declaration, 25
standard, locating, 59
TEI (Text Encoding Initiative), 92
validating XML documents, 279
validation, 29-36
document type declaration, 32
example DTD, 29-32
internal subsets, 33
validating a document, 34
for XHTML, 115
division into modules, 122-124
XLinks, 197
XML specification for, 379
Dublin Core metadata vocabulary, 61
binding prefixes to namespace URIs, 65
improving web searches, 140-143
E[ Top ]
EBNF (Extended Backus-Naur Form) grammar, 400
electronics symbols, Unicode block for, 620
element content
any type, 300
empty elements, 291
mixed, 298
simple, 293
ELEMENT declaration, 37
element declarations, 30, 37-42
abstract attribute, 307
ANY (content specification), 42
child elements, 37
choices, 39
complex types and, 289
constructing, 394
empty elements, 41
matching elements with DTD rules, 393
mixed content, 41
number of children, 38
occurrence constraints, 291
parentheses in, 39
#PCDATA, 37
schemas and, 284
sequences, 37
substitutionGroup attribute, 306
Element interface (DOM), 337
methods for, 540
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
element names
document permanence and, 106
matching, 389
in namespaces and DTDs, 69
prefixes and, 63
qualified, 387
suffixes for, 38
unqualified, 387
element nodes
default template rule for, 156
XPath data model and, 457
element type declarations
duplicates types not allowed, 394
uniqueness required for, 394
XML specification for, 382
ELEMENT_NODE (DOM node type) and nodeType attribute, 561, 564
elements, 3, 13
arc (XLinks), 191-195
attributes, 16-18
attributes and, 502
calculating the value of, 150
case sensitivity in, xv
complex types, 289
controlling placement of, 299
deprecated, XHTML DTD legacy module, 124
empty, 13
enforcing uniqueness, 307
location steps, 166, 459
matching multiple, 170
matching names, 389
minOccurs/maxOccurs attributes, 291
mixed content of, 15
in narrative-oriented documents, 91
of schemas, 413
order of traversal, changing with XSLT template, 151
overlapping not allowed, 14
parents and children, 14
root element, 15
sibling, 14
unprefixed, namespaces and, 66
XLink
xlink:type attribute, 185
XML specification for, 386
empty elements, 13
declaring, 41
NOTATION attributes not available for, 397
schemas and, 291
web browser problems with, 116
XML specification for, 386
encoding declaration, 25, 72
XML specification for, 378
encodings, list of, 74
ending resource, 184
enterprise applications and XML, 262
entities
DTD fragments that define, 83
external parsed
text declarations and, 73
external unparsed
declaring notations, 399
general
external, parsed, 50-52
general entity declarations, 49
parameter, 53-56
syntax of, 54
entities attribute (DocumentType interface), 523
ENTITIES attribute type, 48, 383
ENTITY attribute type, 47, 383
embedding unparsed entities in documents, 53
unparsed entities in DTD and, 396
ENTITY declaration, 49, 374
absent from entity references, 391
Entity interface (DOM), 334
attributes for, 550
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
entity references, 20
attribute values and external entities, 390
CDATA secttions and, 21
LexicalHandler interface and, 598
listing of, 20
loopholes in XML specification, 317
places for allowed usage, 21
simplifying document parsing, 375
unparsed entity names not allowed, 391
web browser problems with, 117
with no entity declarations, 391
XHTML 1.1 DTD, 125
XML specification for, 375
entity sets
HTML 4.0, 606
XHTML 1.0 DTD, 83, 607
ENTITY_NODE (DOM node type) and nodeType attribute, 561
EntityReference interface (DOM), 339, 552
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
ENTITY_REFERENCE_NODE (DOM node type) and nodeType attribute, 561
EntityResolver interface (SAX), 582
enumeration attribute type, 45
assigning a value to, 397
enumeration facet element, 295, 422
ErrorHandler interface (SAX), 583
errors
ErrorHandler interface (SAX), 583
SAXException class and, 585
validity, 28, 35
well-formedness, 26, 28
EUC-JP encoding, 602
event-based parsers, 312
events (XHTML DTD module), 124
exceptions, 585
expat parser and support for character encodings, 603
expressions, XPath, 151, 176-178
Booleans, 177
numbers, 176
strings, 177
(see also location paths; location steps)
Extended Backus-Naur Form (EBNF) grammar, 400
Extended Interfaces (DOM Core), 328
extended links, 189-196
arcs, 191-195
multiple arcs from one arc element, 192-194
role attributes, 194
title attributes, 194
local resources, 195
locator elements, 190
title elements, 195
XPointers and, 202
Extensible Linking Language (see XLink)
Extensible Stylesheet Language (see XSL; XSL-FO; XSLT)
Extensible Stylesheet Language Transformations (see XSLT)
extent attributes (XSL-FO), 237
external declarations
including, 302
modifying, 303
external DTD subsets, 34
general entity references and, 83
parameter entity references and, 55
text declarations and, 73
XML specification for, 381, 389
external general entities
parsed, 50-52
attribute values and, 390
no recursion allowed, 391
SAX core feature, 595
text declarations and, 73
unparsed, 52
declaring notations, 399
external parameter entity references, 55
SAX core feature, 595
F[ Top ]
facet elements
enumeration, 295, 422
length, 295, 425
maxExclusive, 296, 426
maxInclusive, 296, 426
maxLength, 426
minExclusive, 296
minInclusive, 296, 427
pattern, 297, 428
totalDigits, 296, 435
whiteSpace, 295, 436
facets, 294
false( ) function (XPath), 177, 182
Feynman, Richard, 164, 166, 173
XML document describing, 144
filesystems
MIME types in, 72
filters
event-based parsers and, 313
SAX, 361
final attribute, 307
firstChild attribute (Node interface), 559
#FIXED attributes (XLink), 197
#FIXED default declaration, 384
explicit value must match default value, 398
#FIXED default declaration (attributes), 48
floating-point doubles (in XPath), 176
floor( ) function (XPath), 183
flow child element (XSL-FO), 239
%Flow.mix; entity reference, 125
fo, 236, 237, 239
following axis, 174, 462
following-sibling axis, 174, 462
font properties (CSS), 229
font-family property (CSS), 229
font-stretch property (CSS), 230
font-style property (CSS), 230
font-variant property (CSS), 230
font-weight property (CSS), 230
FOP (Apache XML Project formatting program), 242
format-number( ) function (XSLT), 181
formatting objects, 233
forms, XHTML DTD modules for, 123
fragment identifiers, 199
frames modules (XHTML DTD), 123
frameset DTD (XHTML), 116
French
subcodes for dialects, 84
functions
XPath, 178, 464
Boolean, 182
node-set, 179
number, 182
string, 180
XSLT, 494
Fundamental Interfaces (DOM Core), 328
G[ Top ]
general entities
external parsed, 50-52
attribute values and, 390
no recursion allowed, 391
text declarations and, 73
external unparsed
declaring notations, 399
internal, 380
general entity declarations, 49, 380
declaring, 398
general entity references
XML specification for, 376
General Punctuation Unicode block, 619
Geometric Shapes Unicode block, 621
getAttribute(,��,��) (DOM), 541
getAttributeNode(,��,��) (DOM), 542
getAttributeNodeNS(,��,��) (DOM), 542
getAttributeNS(,��,��) (DOM), 542
getElementById(,��,��) (DOM), 519
getElementsByTagName(,��,��) (DOM)
Document interface, 519
Element interface, 543
getElementsByTagNameNS(,��,��) (DOM)
Document interface, 519, 521
Element interface, 543
getException( ) (SAX), 585
getFeature(,��,��) (SAX), 360
getNamedItem(,��,��) (DOM), 554
getNamedItemNS(,��,��) (DOM), 555
getProperty(,��,��) (SAX), 360
glyph areas (XSL-FO), 235
Goldfarb, Charles F., 8, 89
Goodman, Danny, 537
grammar for XML documents, 400
Greek Extended Unicode block, 619
Greek Unicode block, 616
> (greater-than) entity reference
HTML 4.0 entity sets and, 607
XML specification for, 374
GUI programs for XML document validation, 36
H[ Top ]
hasAttribute(,��,��) (DOM)
Element interface, 543
hasAttributeNS(,��,��) (DOM)
Element interface, 544
hasAttributes(,��,��) (DOM), 567
hasChildNodes(,��,��) (DOM), 567
hasFeature(,��,��) (DOM), 536
Hebrew Unicode block, 617
hexadecimal character references, 81
hexadecimal numeric character references
< (for <), 20
hidden elements (CSS display property), 228
HIERARCHY_REQUEST_ERR exception (DOM), 532
href pseudo-attribute, 118
href pseudoattribute, 223
HTML
converting document to XHTML, 111-115
serving XML pages in, 122
SGML and, 9
structure of documents, 91
vs. XHTML, 111
(see also XHTML)
HTML 4.0 entity sets, 606
HTML forms, XHTML DTD modules, 123
HTTP headers
Content-Type field, 72
I[ Top ]
IANA (Internet Assigned Numbers Authority)
character sets registered with, 74
MIME types for XSLT stylesheets, 118
IBM character sets, 78
id( ) function (XPath), 180
ID attribute type, 45
default for, 395
in element type declarations, 383
only one per element type, 395
values must be unique, 395
IDL (Interface Definition Language) and DOM, 326, 500
IDREF attribute type, 46
in element type declarations, 383
matching IDs in document, 395
IDREFS attribute type, 47
in element type declarations, 383
IE, 120
IGNORE directive, 56
illegal characters (Unicode), 604
image modules (XHTML DTD), 123
implementation attribute (Document interface), 511
#IMPLIED default declaration, 384
#IMPLIED default declaration (attributes), 48
importNode(,��,��) (DOM), 520
inbound links, 196
INCLUDE directive, 56
INDEX_SIZE_ERR exception (DOM), 531
Indian languages, Unicode block for, 618
inline areas (XSL-FO), 235
%Inline.extra entity reference, 127
%Inline.mix; entity reference, 125
InputSource class (SAX), 585
insertBefore(,��,��) (DOM), 568
insertData(,��,��) (DOM), 507
instance documents
attributes of, 454
complex types and, 289
controlling namespace qualification, 288
derived types in, 305
empty elements and, 292
namespace issues with, 280
substitution groups and, 306
Interface Definition Language (IDL) and DOM, 326, 500
internal DTD subsets, 33
general entity references and, 83
parameter entity references and, 55, 389
XML specification for, 381
internal general entities, 380
internalSubset attribute (DocumentType interface), 524
International Phonetic Alphabet (IPA) Extensions Unicode block, 615
internationalization, 71-85
character references, 81-84
character sets
Unicode, 74
XML-defined, 74
character-set metadata, 71
converting between character sets, 79
default character set, XML documents, 80
encoding declaration, 72
ISO character sets, 76
platform-dependent character sets, 78
xml:lang attribute, 84-85
Internationalized Resource Identifier (IRI), 60
Internet Assigned Numbers Authority (IANA)
character sets registered with, 74
Internet Assigned Numbers Authority (see IANA)
Internet Explorer (IE)
direct display of XML in, 120
MSXML XSLT processor, 147
support for XHTML, 116
text/xsl MIME type, 118
XHTML, support of, 117
XSLT, lack of support for, 146
intrinsic events module (XHTML DTD), 124
INUSE_ATTRIBUTE_ERR exception (DOM), 533
invalid XML documents, 4
INVALID_ACCESS_ERR exception (DOM), 533, 534
INVALID_CHARACTER_ERR exception (DOM), 532
INVALID_MODIFICATION_ERR exception (DOM), 533
INVALID_STATE_ERR exception (DOM), 533
IPA (International Phonetic Alphabet) Extensions Unicode block, 615
IRI (Internationalized Resource Identifier), 60
ISO character sets, 76
ISO-10646-UCS-2 encoding, 75, 602
ISO-10646-UCS-4 encoding, 602
ISO-2022-JP encoding, 602
ISO-3166 (Codes for the Representation of Names of Countries), 85
ISO-639 (Codes for the Representation of Names of Languages), 84
ISO-8859 character sets, 605
ISO-8859-1 encoding, 77, 602
vs. Cp1252 Windows character set, 606
ISO-8859-10 encoding, 77, 602
ISO-8859-11 encoding, 78, 602
ISO-8859-12 encoding, 602
ISO-8859-13 encoding, 78, 602
ISO-8859-14 encoding, 78, 602
ISO-8859-15 encoding, 78, 602
ISO-8859-16 encoding, 78, 602
ISO-8859-2 encoding, 77, 602
ISO-8859-3 encoding, 77, 602
ISO-8859-4 encoding, 77
ISO-8859-5 encoding, 77, 602
ISO-8859-6 encoding, 77, 602
ISO-8859-7 encoding, 77, 602
ISO-8859-8 encoding, 77, 602
ISO-8859-9 encoding, 77, 602
isSupported(,��,��) (DOM), 570
item(,��,��) (DOM), 555, 573
J[ Top ]
Japanese characters
national standard character sets for, 602
UTF-8/UTF-16 encodings, 76, 601
Java
examples/bindings for DOM Core objects, 501
language-specific bindings for, 326
TrAX and, 498
Java applets
applet module (XHTML DTD), 122
Java Development Kit's native2ascii tool, 80
Java servlets, converting XML documents to HTML, 122
JavaScript
XHTML DTD scripting module, 124
XML parser accessed from, 120
Jelliffe, Rick, 251
K[ Top ]
keys and references in schemas, 308
Korean characters
national standard character set for, 78
UTF-8/UTF-16 encodings, 76, 601
L[ Top ]
language codes, 84
subcodes for geographic regions, 84
last( ) function (XPath), 179
lastChild attribute (Node interface), 560
Latin Extended Additional Unicode block, 619
Latin Extended-A Unicode block, 615
Latin Extended-B Unicode block, 615
Latin-0 encoding, 78, 602
Latin-1 characters entity set, 83, 607
Latin-1 encoding, 77, 602, 606
Latin-10 encoding, 78
Latin-2 encoding, 77, 602
Latin-3 encoding, 77, 602
Latin-4 encoding, 77, 602
Latin-5 encoding, 77, 602
Latin-6 encoding, 77, 602
Latin-7 encoding, 78, 602
Latin-8 encoding, 78, 602
Latin-9 encoding, 78, 602
layout-master-set element (XSL-FO), 236, 239
legacy encodings, 74
legacy module (XHTML DTD), 124
length attribute
CharacterData interface, 505
NamedNodeMap interface, 554
NodeList interface, 573
length facet element, 295, 425
length of XML names, 19
length, units of, 228
Letter-like Symbols Unicode block, 619
line areas (XSL-FO), 235
linkbases, 196
links
attaching to XML documents (see XLinks)
following, 187
XHTML DTD link module, 124
XPointers in, 201
list elements (CSS display property), 227
literal data characters, 149
literal default declaration (attributes), 49
literal result elements, 149
attribute value templates and, 159
little-endian UCS-2, 75, 81
local parts, 64
local resources (extended links), 195
local-name( ) function (XPath), 180
localName attribute (Node interface), 558, 560
location paths, 165-170, 459
abbreviated, 173
absolute, not starting from root, 180
attribute location steps, 167
axes, listing of, 174
building compound, 170, 460
child element location steps, 166
combining with location steps with |, 170
compound, 170-172
predicates, 172
root location path, 166
unabbreviated, 173-176
using predicates in, 463
location steps, 168
attribute, 167, 461
axis, node test, and predicates, 173
child element, 460
child element location steps, 166
combining with /, 170
combining with location paths, using |, 170
comment( ), 169
processing-instruction( ), 169
text( ), 169
locator elements, 190
Locator interface (SAX), 583
LocatorImpl class (SAX), 589
Lorie, Ray, 8, 89
< (entity reference for <), 20
< (less-than) entity reference
HTML 4.0 entity sets and, 606
XML specification for, 374
M[ Top ]
Macintosh systems
character sets specific to, 79
pure ASCII documents, use of, 80
MacRoman character set, 79
markup, 13
describing a document's structure, 106
in entities, 50
input vs. output languages, 107
in narrative XML documents, 91
markup languages
programming languages vs., 5
master pages for XSL-FO, laying out, 237
matching (see pattern matching)
Mathematical Operators Unicode block, 620
MathML (Mathematical Markup Language), xiii
entity sets, 84
namespaces
need for, 60
syntax for, 64
support by Mozilla, 121
maxExclusive facet element, 296, 426
maxInclusive facet element, 296, 426
media pseudo attribute, 119
media pseudoattribute, 223
Megginson, David, 10, 580
metadata
character set, 71
embedding in page with RDF, 61
WordprocessingML document, 101
metainformation
about narrative XML documents, 90
XHTML DTD module, 124
metamarkup language, 3
Microsoft
XSLT namespaces, 146
Microsoft and character sets, 78
Microsoft Internet Explorer (see Internet Explorer)
Microsoft Word
saving files as Unicode, 79
WordprocessingML, 101-105
MIME media types
for stylesheets, 223
MIME types, 72
specifying for stylesheets with type pseudo-attribute, 118
for XML applications, 13
minExclusive facet element, 296
minInclusive facet element, 296, 427
Miscellaneous Symbols Unicode block, 621
Miscellaneous Technical Unicode block, 620
%Misc.extra entity reference, 127
mixed content
declaring mixed content element, 41
in elements, 15
schemas and, 298
mod (XPath arithmetic operator), 176
modes (XSLT), 157-159
Modular XHTML, 110
authoring compound documents with, 122-139
Modularization of XHTML (W3C specification), 126
modules
XHTML DTDs, 122-124
modules, XHTML, 122
monetary symbols, Unicode block for, 619
Mosher, Ed, 8, 89
Mozilla
direct display of XML in, 121
XHTML, support for, 117
(see also Netscape)
MSXML XSLT processor, 147
Muellner, Leonard, 95
multilingual documents, 84
Murray-Rust, Peter, 10
N[ Top ]
name( ) function (XPath), 180
name attribute
Attr interface, 502
DocumentType interface, 524
name characters (Unicode), 603
name identification module (XHTML DTD), 124
name start characters (Unicode), 603
name tokens, 44
NamedNodeMap interface (DOM)
methods for, 553
names
case sensitivity in, 14
element
suffixes added to, 38
namespace names, 63
qualified
XML specification for, 387
unqualified, 387
XML, 18-20, 190
length of, 19
punctuation characters in, 18
specification for, 373
starting character of, 19
target, 22
URIs, conventions for using, 64
namespace axis, 174, 462
namespace names, 63
namespace nodes
default template rule for, 156
location paths and, 169
XPath, 165
namespace prefixes (see namespaces; prefixes)
NAMESPACE_ERR exception (DOM), 533
namespaces, xii, 9, 60-70
constraints for, 399
controlling qualification, 288
DTDs and, 69
element/attribute declarations, 382
how parsers handle, 69
nodes
XPath data model and, 458
prefixes, 288
purposes of, 60
RDDL and namespace URLs, 250
reasons for using, 60
schemas and, 280, 286, 412
setting default, 387
syntax of, 63-68
binding prefixes to URIs, 64-67
local part, 64
prefixes, 63
qualified name, 64
setting default, 67
target namespaces, 286
URIs, 67
returned by SAX parsers, 594
WordprocessingML, 101
XML specification for, 387
XPath
node set functions for, 180
XPointers and, 204
XSLT, 120, 145, 159-161
URIs for, 146
XSLT and, 472
NamespaceSupport class (SAX), 589
namespace-uri( ) function (XPath), 180
namespaceURI attribute (Node interface), 560
name-value pairs, 16
narrative XML documents, 15
attributes in, 17
DocBook, 95-98
DTDs for, 58
mixed content elements, 41
structure of, 90
TEI (Text Encoding Initiative), 92-95
national standard character sets, 78
native2ascii tool (Java Development Kit), 80
natures of related resources, 255
NDATA declaration, 52
Netscape
direct display of XML in, 121
RSS DTD, 33
support for XHTML, 116
XHTML, support of, 117
(see also Mozilla)
network transport protocols, XML vs., 6
nextSibling attribute (Node interface), 560
NMTOKEN attribute type, 44, 383
values must be legal name tokens, 396
NMTOKENS attribute type, 44, 383
values must be legal name tokens, 396
NO_DATA_ALLOWED_ERR exception (DOM), 532
node( ) (XPath wildcard), 169
Node interface (DOM), 329, 500
attributes for, 558
methods for, 565
node sets in XPath, 459
node tests in location paths, 459, 462
node tests in location steps, 173
node(,��,��)
node test, 463
nodeName attribute (Node interface), 561
nodes in XML documents
DOM and, 329
XPath and, 163
XSLT and, 154
node-set functions, XPath, 179
nodeType attribute (Node interface), 330, 561
nodeValue attribute (Node interface), 562
NO_MODIFICATION_ALLOWED_ERR exception (DOM), 532
normalize(,��,��) (DOM)
Element interface, 544
Node interface, 571
normalize-space( ) function (XPath), 182
not( ) function (XPath), 182
NOTATION attribute type, 48
not available for empty elements, 397
only one per element type, 396
notation attributes, 396
NOTATION declaration, 52
names must be unique, 399
unparsed entities and, 399
Notation interface (DOM), 334
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy for, 501
notation syntax of XML, 322
notationName attribute (Entity interface), 550
NOTATION_NODE (DOM node type) and nodeType attribute, 562, 565
notations attribute (DocumentType interface), 524
NOT_FOUND_ERR exception (DOM), 532
NOT_SUPPORTED_ERR exception (DOM), 532
number( ) function (XPath), 183
numbers in XPath, 176, 459
numeric character references
< (for <), 20
numeric functions (XPath), 182
O[ Top ]
OASIS XML catalog format, 33
object hierarchy (DOM), 501
object module (XHTML DTD), 123
object reference (DOM Level 2 Core), 501
object serialization and XML, 267
OCR-A characters, Unicode block for, 620
online DTDs, list of, 59
online validators, 34
OpenOffice, 98-101
Opera
XHTML, support of, 117
Optical Character Recognition Unicode block, 620
org.xml.sax package, 580
org.xml.sax.ext package, 596
outbound links, 196
output formats, 107
output formats, transformed XML documents, 108
ownerDocument attribute (Node interface), 563
ownerElement attribute (Attr interface), 502, 503, 510, 511, 512, 541, 550, 551, 552
P[ Top ]
param module (XHTML DTD), 123
parameter entities, 53-56
connecting XHTML DTD modules, 122
syntax of, 54
parameter entities, XML specification for, 379
parameter entity declarations
declaring, 398
parameter entity references, 54
connecting XHTML modules, 122
defining, with document model modules, 133
external DTD subsets and, 55
in DTD only, 392
including declarations in, 389
internal subsets and, 389
location in DTD is important, 376
mixing XHTML into applications, 125
for namespace prefixes, 69
proper nesting with
conditional sections, 398
markup declarations, 392
parenthesized groups, 394
redefining, 55
XLink elements and, 197
parent axis, 174, 462
parent elements, 14
selecting with .. (double period), 460
parentheses in element declarations, 39
parentNode attribute (Node interface), 563
parse(,��,��) (SAX), 350
parsed character data (#PCDATA), 31, 37
parsed general entities
attribute values and, 390
no recursion allowed, 391
text declarations and, 73
parsers, 7
checking documents for well-formedness, 26
checking the character set, 80
comparing namespace URIs, 67
determining character encoding, 72
handling of namespaces, 69
SAX, development of, 10
validating, 8
comparing document to its DTD, 28
XML-defined character sets, 74
parsers, validating, 344
associating schema documents with instance documents, 281
PassiveTeX library, 243
pattern facet element, 297, 428
pattern matching
match attribute (xsl:template), 148
text( ) node test, 154
#PCDATA keyword, 31, 37
PCs
Mac-specific character sets on, 79
PDF format for XSL-FO documents, 248
permanence of XML documents, 105
PHP, processing instructions in, 23
pictographic symbols, Unicode block for, 621
Pinard, Francois, 80
pixels (relative unit of length), 229
platform-dependent character sets, 78
points in XPointers, 204
portability
XML data format, 6
position( ) function (XPath), 179
Post Schema Validation Infoset (PSVI), 412
preceding axis, 174, 462
preceding-sibling axis, 174, 462
predefined character references, 374
predicates
in location paths, 172, 463
predicates (in location steps), 173
prefix attribute (Node interface), 563
prefixed names in XPointers, 204
prefixes
binding to URIs, 64-67
changing with parameter entity references, 69
controlling namespace qualification, 288
default namespaces and, 67, 388
element names and, 63
namespace constraints for, 399
xlink, mapping to namespace URI, 185
XPath wildcards, attaching to, 169, 170
xsl, 145
preorder traversal, 151
previousSibling attribute (Node interface), 564
primitive types for schemas, 284, 436
defining, 294
processing instruction nodes
default template for, 156
XPath data model and, 458
processing instructions, 22
parsers and, 322
robots, 143
web browser problems with, 116
xml-stylesheet, 118-120, 148
processing-instruction( ) location step, 169
ProcessingInstruction interface (DOM), 333, 574
attributes for, 574
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
processing-instruction(,��,��)
node test, 463
PROCESSING_INSTRUCTION_NODE (DOM node type) and nodeType attribute, 561
professions and disciplines, standard DTDs for, 59
programming languages
document transformation programs, 108
Turing complete, 5
XML vs., 5
programming models used with XML, 311
prolog (XML document), 32, 372
properties
controlling parser behavior for SAX, 359, 594
of Cascading Style Sheets, 227
of XSL-FO, 243
proprietary character sets, 78
proprietary schemas, document permanence and, 106
pseudoattributes for xml-stylesheet processing instruction, 118-120, 223
pseudoclass selectors, 225
pseudoelement selectors, 226
PSVI (Post Schema Validation Infoset), 412
public IDs for DTDs, 33
PUBLIC keyword (document type declaration), 33
publicId attribute
DocumentType interface, 525
Entity interface, 550
punctuation characters
allowed in XML names, 18
disallowed in XML names, 19
punctuation characters, Unicode block for, 619
purposes of related resources, 256
Q[ Top ]
qualified names, 64
defining as parameter entity references, 70
XML specification for, 387
" (straight double quotes) entity reference
HTML 4.0 entity sets and, 607
XML specification for, 374
R[ Top ]
Ragget, Dave, 115
Rahtz, Sebastian, 243
ranges in XPointers, 206
RDDL (Resource Directory Description Language), xii, 250
document example, 252
namespace URIs and, 67
namespace URLs and, 250
natures of related resources, 255
purposes of related resources, 256
syntax for, 251
RDF (Resource Description Framework), xiii, 140
binding prefixes to namespace URIs, 65
embedding metadata in a page, 61
encoding Dublin Core in, 142
recode (character-set conversion tool), 80
record-like documents
DTDs for, 57
use of XML, 10
references, 20
to other declarations in DTDs, 32
entity, listing of, 20
external parameter entity, 55
external parsed general entities, 50-52
parameter entity, 54
places for allowed usage, 21
region-start child element (XSL-FO), 237
related resources, 251
natures of, 255
purposes of, 256
relational operators in XPath, 172, 177
relative location paths, 460
building compound location paths, 171
relative units of length (CSS properties), 229
relative XPointers, 208
RELAX NG schema language, 10, 278
remote resources
third-party links, 196
remote resources (extended links), 190
removeAttribute(,��,��) (DOM), 544
removeAttributeNode(,��,��) (DOM), 545
removeAttributeNS(,��,��) (DOM), 545
removeChild(,��,��) (DOM), 571
removeNamedItem(,��,��) (DOM), 555
removeNamedItemNS(,��,��) (DOM), 556
RenderX formatting program (XEP), 243
replaceChild(,��,��) (DOM), 571
replaceData(,��,��) (DOM), 507
#REQUIRED default declaration, 384
value must be assigned to attributes, 397
#REQUIRED default declaration (attributes), 48
Resource Directory Description Language (see RDDL)
resource elements, 195
Rich Site Summary (RSS) DTD, 33
robots processing instruction, 143
role attributes for arcs, 194
root element, 15, 372
comment placement and, 22
determining name of, 392
in narrative-oriented documents, 90
processing instruction placement and, 23
TEI documents, 94
WordprocessingML document, 101
XSLT stylesheets, 145
root interface (DOM Core), 329
root location path, 166, 460
root node
default template rule for, 156
paths beginning with /, 171
root element vs. (XPath), 165
XPath data model and, 457
round( ) function (XPath), 179, 183
RSS (Rich Site Summary) DTD, 33
ruby module (XHTML DTD), 124
S[ Top ]
Safari
XHTML, support of, 117
Sanskrit language, Unicode block for, 618
SAX (Simple API for XML), xii, 10, 349
Attributes interface, 581
AttributesImpl class, 587
ContentHandler interface, 351, 581
controlling parser behavior with URIs, 359, 594
DeclHandler interface, 596, 598
DTDHandler interface, 582
EntityResolver interface, 582
ErrorHandler interface, 583
features of, 359, 594
filters, 361
LexicalHandler interface, 598
Locator interface, 583
methods called by parser in ContentHandler, 352
NamespaceSupport class, 589
nondeterministic code details, 356
org.xml.sax package, 349
org.xml.sax.ext package, 596
org.xml.sax.helpers package, 587
ParserAdapter class, 590
properties of, 359, 596
reference, 580
SAX1 vs. SAX2, 349, 580
SAXException class, 350, 585
SAXNotRecognizedException class, 360, 586, 597
SAXNotSupportedException class, 587
SAXParseException class, 586
vs. DOM, 359
SAXException class, 585
SAXNotRecognizedException class, 360, 586, 597
SAXNotSupportedException class, 587
SAXON (XSLT engine), 122
SAXON (XSLT processor), 147
SAXParseException class (SAX), 586
Scalable Vector Graphics, xiii
schema languages, 5, 10
schemas, xii, 278
abstract attribute, 307
annotation elements in, 282
attribute declarations and, 285
complex types and, 289
constructing, 280
controlling namespace qualification, 288
document organization, 281
document permanence and, 106
document validity and, 4
element content
any type, 300
empty elements, 291
mixed, 298
simple, 293
elements of, 413
enforcing element uniqueness, 307
facets and, 294
final attribute, 307
fixed attribute, 307
keys and references, 308
lists and, 297, 425
namespaces and, 280, 286, 412
reference, 412
simple types, 284, 436
defining, 294
targetNamespace attribute and, 286
unions and, 297, 435
using multiple documents, 301
vs. DTDs, 279
Schematron schema language, 10, 278
annotations and, 284
Validator for Windows, 36
scripting module (XHTML DTD), 124
search methods for the web, 139-143
Dublin Core, 140-143
RDF (Resource Description Framework), 140
robots processing instruction, 143
sections, narrative XML documents, 90
divided into subsections, 91
security
XML documents transmitted over Internet, 11
selectors for Cascading Style Sheets, 224
self axis, 174, 462
sequences in element declarations, 37
enclosed in parentheses, 39
setAttribute(,��,��) (DOM), 546
setAttributeNode(,��,��) (DOM), 547
setAttributeNodeNS(,��,��) (DOM), 548
setAttributeNS(,��,��) (DOM), 546
setFeature(,��,��) (SAX), 360
setNamedItem(,��,��) (DOM), 556
setNamedItemNS(,��,��) (DOM), 557
setProperty(,��,��) (SAX), 361
SGML
DocBook application, 95
HTML and, 9
legacy of, 89
XML and, 8
Shift_JIS encoding, 602
siblings (elements), 14
Simple API for XML (see SAX)
simple content and schemas, 293
simple links, 184
Simple Object Access Protocol (SOAP), 265
simple types for schemas, 284, 436
defining, 294
simple typesfor schemas
facet elements for, 413
SOAP (Simple Object Access Protocol), 265
source code samples and CDATA sections, 21
Spacing Modifier Letters Unicode block, 616
special attributes, 385
special characters entity set, 83, 607, 610
specified attribute (Attr interface), 503
Sperberg-McQueen, C.M., 9
splitText(,��,��) (DOM), 576
standalone attribute, 25
external DTD subsets and, 34
standalone declaration
setting value to no'', 393
XML specification for, 379
standard DTDs, 59
Standard Generalized Markup Language (see SGML)
starting resource, 184
starts-with( ) function (XPath), 179, 181
<$statrange, 184
strict DTD (XHTML), 115
string( ) function (XPath), 181
string-length( ) function (XPath), 182
strings
in XPath, 177, 459
functions for, 180
internalized (SAX feature), 594
structural nodes (DOM Core), 333
style attribute module (XHTML DTD), 124
style rules, 221
stylesheet element (XSLT), 145
stylesheets
attaching to XML documents, 23
descriptive, formatting XML documents, 108
displaying XML in web browsers, 117-122
document permanence and, 107
major stylesheet languages, 118
XHTML DTD stylesheet module, 124
subcodes (languages), 84
substring( ) (XPath method), 181
substring-after( ) function (XPath), 181
substring-before( ) function (XPath), 181
substringData(,��,��) (DOM), 508
suffixes in element declarations, 38
sum( ) function (XPath), 183
SVG (Scalable Vector Graphics), xiii
namespaces
need for, 60
syntax for, 64
prefixes and namespace URIs, 67
source code written in XHTML and enclosed in CDATA section, 21
support by Mozilla, 121
symbols entity set, 83, 607
syntax structures of XML documents, 372
SYNTAX_ERR exception (DOM), 533
systemId attribute
DocumentType interface, 525
Entity interface, 551
T[ Top ]
table elements (CSS display property), 228
tables modules (XHTML DTD), 123
tag sets, 4
tag syntax
case sensitivity, 14
tagName attribute (Element interface), 541
tags
empty-element, 386
matching, 389
overlapping not allowed, 14
start- and end-tags, 13
syntax of, 13
empty elements, 13
target, 22
target attribute (ProcessingInstruction interface), 574
target module (XHTML DTD), 124
TEI (Text Encoding Initiative), 92-95
teiHeader element, 94
template rules, built-in, 147
template rules (XSLT)
built-in, 154-157
for comment and processing instruction nodes, 156
for element and root nodes, 156
for namespace nodes, 156
for text and attribute nodes, 154
template rules (XSLT stylesheets), 144
templates
adding to XSLT stylesheets, 148
XSLT
applying, 151-153
templates, XSLT
applying different with modes, 157-159
attribute value, 159
text
presentation of, 4
text( ) location step, 169
text declarations
external entities and, 51
XML declarations vs., 73
XML specification for, 381
text element (TEI documents), 94
text elements, WordprocessingML, 102
Text Encoding Initiative (TEI), 92
Text interface (DOM), 339, 575
method for, 575
nodeName attribute and, 561
nodeValue attribute and, 562
object hierarchy, 501
text nodes
default template rule for, 154
XPath data model and, 458
text properties
Cascading Style Sheets, 230
XSL-FO, 235
text(,��,��)
node test, 463
text-align property (CSS), 231
text/css MIME type, 118
text-decoration property (CSS), 231
text-indent property (CSS), 230
TEXT_NODE (DOM node type) and nodeType attribute, 561, 564
text-transform property (CSS), 231
textual tools for XML documents, 312
text/xml, 13
text/xml MIME type, 72
text/xsl MIME type (Microsoft IE), 118
Thai characters
ISO-8859-11 encoding, 78, 602
Unicode block for, 618
third-party links, 196
Tibetan Unicode block, 618
Tidy (HTML to XHTML conversion tool), 115
title attributes for arcs, 194
title elements (extended links), 195
title pseudo-attribute, 120
title pseudoattribute, 223
Tobin, Richard, 35
Topologi Schematron Validator for Windows, 36
totalDigits facet element, 296, 435
transform element (XSLT), 145
transformations, XSL, 144
transitional DTD (XHTML), 116
traversal
Node interface and, 332
TrAX (Transformations API for XML), 498
tree models for XML documents, 314
tree structure, XML documents
narrative documents, 90
true( ) function (XPath), 177, 182
Turing, Alan, 29, 57, 164
JPEG image of, 52
XML document describing, 144
Turing complete programming languages, 5
type derivation, 285
type pseudo-attribute, 118
type pseudoattribute, 223
U[ Top ]
UCS-2 encoding, 75, 80
UCS-4 encoding, 81
unabbreviated location paths, 173-176
unassigned code points (Unicode), 604
Unicode
encoding attribute in XML declaration, 25
evolution of, XML and, 11
UCS-2 encoding, 80
UTF-16 or UTF-8 format for all XML documents, 79
UTF-8 encoding, 81
converting Cp1252 encoding to, 80
Unicode blocks, 614
Arabic, 618
Armenian, 617
Arrows, 620
Currency Symbols, 619
Cyrillic, 617
Devanagari, 618
General Punctuation, 619
Geometric Shapes, 621
Greek Extended, 619
Greek/Coptic, 616
Hebrew, 617
IPA Extensions, 615
Latin Extended Additional, 619
Latin Extended-A, 615
Latin Extended-B, 615
Letter-like Symbols, 619
Mathematical Operators, 620
Miscellaneous Symbols, 621
Miscellaneous Technical, 620
Optical Character Recognition, 620
Spacing Modifier Letters, 616
Thai, 618
Tibetan, 618
Unicode character set, 71, 74
character references, 81-84
encodings of, 71
UCS-2 and UTF-16 encodings, 75
UTF-8 encoding, 76
Unicode characters
character references and, 374
character set names, 601
character tables for, 603
units of length (CSS), 228
unparsed entities and notations, 52
unparsed general entities, 323
declaring notations, 399
not allowed in entity references, 391
unqualified names, XML specification for, 387
URIs (Uniform Resource Identifiers)
assigning elements to, 63
binding prefixes to, 64-67
controlling parser behavior for SAX, 359, 594
conventions for using with XML names, 64
identifying starting and ending resources for links, 184
namespace, 67
XSLT processors and, 146
namespace prefixes, mapping to, 60
XPointer fragment identifier syntax, xii
URLs
absolute or relative for DTDs, 32
base module (XHTML DTD), 124
URLs (Uniform Resource Locators) and XPointers, 199
UTF-16 encoding, 76, 601
encoding declaration and, 378
omitting encoding declaration, 73
required handling by XML parsers, 74
UTF-8 encoding, 25, 81, 601
encoding declaration and, 378
internationalization and, 76
omitting encoding declaration, 73
required handling by XML parsers, 74
V[ Top ]
valid XML documents, 4
XHTML document, 113-115
validating parsers, 8, 344
associating schema documents with instance documents, 281
comparing document to its DTD, 28
validation, 8, 29-36
document type declaration, 32
example DTD, 29-32
internal DTD subsets, 33
validating a document, 34
validators
online, 34
validity constraints, 392
validity errors, 8, 28, 35
value attribute (Attr interface), 503
value of an element, 150
VBScript
XHTML DTD scripting module, 124
XML parser accessed from, 120
version attribute, 24
version information attribute, XML specification for, 378
Vietnamese language, Unicode block most useful for, 619
W[ Top ]
W3C
entity sets, 83
recommended form of XHTML, 115
standard XML DTDs, 59
styling XML documents, 9
SVG standard, xiii
templates for
defining the namespace URI and prefix names, 129
importing standard modules, 136
naming attributes and elements, 131
XML Information Set, 11
XML Schema Language, 5, 10
Walsh, Norman, 95
w:body element, 101
web browsers
CSS stylesheets in, 222, 248
direct display of XML in, 117-122
alternative approaches, 122
Internet Explorer, 120
xml-stylesheet processing instruction, 118-120
document validation and, 34
link behavior in, 186
linkbases and, 196
support for XHTML, 116
XLinks support, 184
web search methods, 139-143
Dublin Core, 140-143
RDF (Resource Description Framework), 140
robots processing instruction, 143
web site for this book, xvi
well-formed XML documents, 4
well-formedness, 12
changing HTML to XHTML, 111
checking documents for, 26
constraints recognized by XML 1.0 parsers, 388
errors, 26, 28
HTML document, editing for conversion to XHTML, 111
parsing XML documents for, 7
Richard Tobin's checker and validator, 35
SAX filters, problems with, 363
SAXParseException and, 586
XSLT stylesheets and, 149
whitespace
in element contents, 13
stripping, 488
trimming and normalizing in element content, 182
in XML names, 19
XML specification for, 373
white-space property (CSS), 231
wildcards and location paths, 169
Windows ANSI (see Cp1252 character set)
Windows systems
Latin-1 character set, use of, 80
saving files as Unicode, 79
Topologi Schematron Validator, 36
word processors
saving data in XML, 98
WordprocessingML, 101-105
root element of document, 101
text elements, 102
World Wide Web Consortium (see W3C)
WRONG_DOCUMENT_ERR exception (DOM), 532
w:wordDocument element, 101
X[ Top ]
#x3C3F786D Unicode character, 81
Xalan XSLT processor, 147
Xalan-J (XSLT engine), 122
XEP (RenderX formatting program), 243
Xerces-J parser, 281
#xFEFF Unicode character, 75, 80
#xFFFE Unicode character, 81
XHTML, xii, 110, 111-117
browsers' handling of, 117
converting HTML document to, 111-115
changes for well-formedness, 111
converted HTML document, 113-115
converting HTML docuument to
example HTML document, 112
DTDs for, 115
empty element tags, 13
example of a valid document, 113
Modular XHTML, 110
Modular (XHTML 1.1)
authoring compound documents with, 122-139
moving from HTML to, 111
replacing standard DTD, 136
vs. HTML, 111
web browser support for, 116
XHTML 1.1
creating your own XHTML, 128
list of modules, 122
mixing applications into, 127
mixing modules into applications, 124
Module-based XHTML (W3C specification), 126
XHTML Basic and RDDL, 251
XInclude, xii, 10
XLink
embedding non-XML content in XML documents, 53
extended links, 189-196
arcs, 191-195
locator elements, 190
link behavior
xlink:actuate attribute, 187
link semantics, 188
xlink
role attribute, 188
XLink (Extensible Linking Language), xii, 10
xlink:actuate attribute, 187
xlink:arcrole attribute, 194
xlink:from attribute (arc elements), 191
xlink:href attribute, 186
for locators, 190
xlink:label attribute, 190
local resources and, 195
xlink:role attribute
locator elements, 190
resource elements, 195
XLinks, ??-198
DTDs for, 197
extended links
local resources, 195
link behavior, 186-188
xlink:show attribute, 187
linkbases, 196
simple links, 184
web browsers' support of, 184
XPointers and, 201
xlink:show attribute, 187
xlink:title attribute, 188
arc elements, 194
locator elements, 190
resource elements, 195
xlink:title attributes, 195
xlink:to attribute (arc elements), 191
xlink:type attribute
arc, 190, 191
extended, 189
locator, 190
possible values of, 185
resource, 190, 195
simple, 185
xlink:type attributes
title, 195
XML
benefits of, 3
case sensitivity in, 14
character sets, 74
comments, 22
communications protocols and, 263
data format, 6, 261
default character set for documents, 80
direct display in browsers, 117-122
xml-stylesheet processing instruction, 118-120
direct display in web browsers
alternative approaches, 122
Internet Explorer, 120
documents
associating stylesheets with, 222
counting elements/attributes, 356
grammar for, 400
parsing with DOM, 344
syntax structures, 372
tree structure of, 163
validating, 278
documents (see documents, XML)
elements, 13
enterprise applications and, 262
evolution of, 8-11
files, 13
how it works, 7
invalid documents, 4
name tokens, 44
names, 18-20, 373
namespaces, 60
object serialization and, 267
parsers
event-based, 312
parsers (see parsers)
processing instructions, 22
processing issues, 317
comments, 321
notations, 322
unparsed entities, 323
processing models, 311
abstracting XML, 316
event-based, 312
extensions/standards, 316
text-based, 311
tree-based, 314
schema languages, 5, 10
SGML and, 8
storing in a database, 6
trees, 14
valid documents, 4
version 1.1, xiii, 11, 24
IRI, use for namespaces, 60
namespaces, 66
what it isn't, 5
XML applications, 4
Robin Cover's list of, 59
XML Canonicalization, 317
XML declaration, 24-26
web browser problems with, 116
XML Encryption, 11
XML Information Set, 11
XML Schema Language, 412
XML Signature, 11
XML Validation Form (Brown University), 34
XMLFilter interface (SAX), 584
XMLFilterImpl class (SAX), 362, 591
xml:lang attribute, 84-85
ATTLIST declarations of, 85
language codes, 84
subcodes for regions, 84
xmllint, 27
--valid flag, 35
xmlns attribute, 60, 165
SAX core feature, 594
setting default namespaces with, 67, 387
xmlns:prefix attribute, 165
xmlns:xsl attribute, 145
XMLReader interface (SAX), 584
counting elements/attributes in a document, 358
validating parsers, 359
XMLReaderAdapter class (SAX), 593
XMLReaderFactory class (SAX), 593
xml-stylesheet processing instruction, 23
pseudoattributes used in, 118-120
XSLT stylesheets and, 148
XPath, xii, 163-183
arithmetic operators in, 176
Boolean expressions, 458
calculating string value of an expression, 150
data model, 165, 457
expressions, 151, 176-178, 458
Booleans, 177
numbers, 176
strings, 177
(see also location paths)
functions, 178, 464
Boolean, 182
node-set, 179
numeric, 182
string, 180
location paths, 165-170, 459
child element location steps, 166
compound, 170-172
root, 166
node sets in, 459
numbers in, 459
predicates in location paths, 172, 460, 463
reference, 457
relational operators, 172
strings in, 459
unabbreviated location paths, 173-176
XPointer, xii, 10
XPointers, 199
changing context nodes for, 208
in links, 201
namespaces and, 204
on URLs, 199
points in, 204
ranges in, 206
xs, 282, 283, 284, 285, 293, 295, 297, 304, 432, 440, 452
xsl, 475, 476, 495, 497
XSL (Extensible Stylesheet Language), 9
XSLT and XSL-FO, 144
xsl prefix (XSLT), 145
xsl:apply-templates element, 151-153
mode attribute, 157
xsl:decimal-format element, 181
XSL-FO (XSL Formatting Objects), 4, 9, 218, 233
boxes in, 234
generating, 240
laying out master pages, 237
programs for working with, 242
properties, 243
structure of, 236
vs. CSS, 248
XSLT, xii, 9, 144-162
applying templates with xsl:apply-templates, 151-153
attribute value templates, 159
calculating element value with xsl:value-of, 150
format-number( ) function, 181
functions, 494
input document, example of, 144
Internet Explorer and, 120
modes, applying different templates with, 157-159
namespaces and, 159-161
other features, 161
stylesheet processors, 144, 147
built-in template rules, 147
problems with incorrect namespace URIs, 146
stylesheet using unabbreviated XPath syntax, 174
template rules, built-in, 154-157
for comment and processing instruction nodes, 156
for element and root nodes, 156
for namespace nodes, 156
for text and attribute nodes, 154
templates and template rules, 148
transforming XML documents, 108, 315
type pseudo-atttribute, specifying with, 118
version 1.0, 118
xsl:decimal-format element, 181
XSLT stylesheets, 144
elements, categories of, 472
flowing content into pages, 240
namespaces and, 472
transforming documents into XSL-FO documents, 244
XSL-FO and, 233
xsl:template element, 148
mode attribute, 157
xsl:value-of element, 150, 181
Y[ Top ]
Yiddish language, Unicode block for, 617
Z[ Top ]
zero-width nonbreaking space, UCS-2, 75