CS835 - Data and Document Representation & Processing

Lecture 4 – Hypermedia II – Linking - Xpath, Xmlbase, Xinclude, XLink and XPointer

Reference: The XML Revolution Technologies for the future Web

http://www.brics.dk/~amoeller/XML/index.html

XInclude - combining XML documents

To enhance reuse and modularity, a technique for constructing new XML documents from existing ones is desirable.

XInclude provides a simple inclusion mechanism.

Why yet another specification?

many XML documents and languages can benefit from modularity
as for the namespace solution, a generic approach can be implemented in generic tools

Application conformance: Think of XML as if Namespaces, XInclude, and XML Base were parts of the basic XML specification. (Caveat: the latter two are not widely implemented yet.)

An XInclude example

A document containing:

<foo xmlns:xi="http://www.w3.org/2001/XInclude">
<xi:include href="somewhere.xml"/>
</foo>

where somewhere.xml contains:

<bar>...</bar>

is equivalent to:

<foo xmlns:xi="http://www.w3.org/2001/XInclude">
<bar>...</bar>
</foo>

http://www.w3.org/2001/XInclude is the official XInclude namespace
the include element name in that namespace is an inclusion directive
right after parsing and before other processing, an XInclude processor performs the inclusion (tree substitution)
the original and the resulting document should be considered equivalent
it is an error to have cyclic includes

XInclude details

How is the included resource denoted?

with XPointer (described later...) - an extension of URLs that can address document nodes, node sets, or character data ranges

Other issues:

with parse="text" and encoding="..." attributes, a resource can be transformed into a character data node before inclusion
XInclude processors may need to create namespace declaration attributes to ensure equivalence

Many XInclude processors support only whole-document URIs, not full XPointer.

XML Base

A URI identifies a resource:

http://somewhere/somefile.xml is an absolute URI
somefile.xml is a relative URI

Inspired by the <base href="..."> mechanism in HTML, XML Base provides a uniform way of resolving relative URIs.

In the following example:

<... xml:base="http://www.daimi.au.dk/">
<... href="~mis/mn/index.html" .../>
</...>

the value of href attribute can be interpreted as the absolute URI http://www.daimi.au.dk/~mis/mn/index.html.

the xml namespace prefix is hardwired by the Namespace specification
xml:base has lexical scope (as namespace declarations)
the URI used to access the document is used as default URI base

Examples of applications:

XLink (requires XML Base support)
XHTML (will use XML Base)
Namespaces (does not conform to XML Base, but it ought to...)
your future XML language

Future XML parsers will support Namespaces, XInclude, and XML Base.

Three layers:

XLink

a generalization of the HTML link concept
higher abstraction level (intended for general XML - not just hypertext)
more expressive power (multiple destinations, special behaviors, linkbases, ...)
uses XPointer to locate resources

XPointer

an extension of XPath suited for linking
specifies connection between XPath expressions and URIs

XPath

a declarative language for locating nodes and fragments in XML trees
used in both XPointer (for addressing), XSL (for pattern matching), XML Schema (for uniqueness and scope descriptions), and XQuery (for selection and iteration)

These technologies are standardized but not all widely implemented yet.

Problems with HTML links

The HTML link model:

Construction of a hyperlink:

<a name="here"> is placed at the destination
<a href="URL#here"> is placed at the source

Problems when using the HTML model for general XML:

Link recognition:

in HTML, links are recognized by element names (a, img, ..)
- we want a generic XML solution
the "semantics" of a link is defined in the HTML specification
- we want to identify abstract semantic features, e.g. link actuation

Limitations:

an anchor must be placed at every link destination (problem with read-only documents)
- we want to express relative locations (XPointer!)
the link definition must be at the same location as the link source (outbound)
- we want inbound and third-party links
only individual nodes can be linked to
- we want links to whole tree fragments
a link always has one source and one destination
- we want links with multiple sources and destinations

The usual point: generic solutions allow generic tools!

The XLink linking model

Basic XLink terminology:

Link: explicit relationship between two or more resources.
Linking element: an XML element that asserts the existence and describes the characteristics of a link.
Locator: an identification of a remote resource that is participating in the link.

One linking element defines a set of traversable arcs between some resources.

A local resource comes from the linking element's own content.

Outbound: the source is a local resource
Inbound: the destination is a local resource
Third-party: none of the resources are local

Third-party links can be used to construct shared link bases for browsers.

An example

A linking element defining a third-party "extended" link involving two remote resources:

  <mylink xmlns:xlink="http://www.w3.org/1999/xlink" xlink:type="extended">

    <myresource xlink:type="locator"

                xlink:href="students.xml#Fred" xlink:label="student"/>

    <myresource xlink:type="locator"

                xlink:href="teachers.xml#Joe" xlink:label="teacher"/>

    <myarc xlink:type="arc"

           xlink:from="student" xlink:to="teacher"/>

  </mylink>

the namespace http://www.w3.org/1999/xlink is used to recognize XLink information in general XML documents

the namespace often (but not necessarily) uses namespace prefix xlink
host language: elements and attributes not belonging to this namespace are ignored by XLink processors
all XLink information is defined in attributes (in host language elements)

xlink:type="extended" indicates a linking element
xlink:type="locator" locates a remote resource
xlink:type="arc" defines traversal rules

A powerful example application of general XLinks:

Using third-party links and a smart browser, a group of people can annotate Web pages with "post-it notes" for discussion - without having write access to the pages. They simply need to agree on a set of URIs to XLink link bases defining the annotations. The smart XLink-aware browser lets them select parts of the Web pages (as XPointer ranges), comment the parts by creating XLinks to a small XHTML documents, view each other's comments, place comments on comments, and perhaps also aid in structuring the comments.

Linking elements

- defining links

All elements with XLink information contain an xlink:type attribute.

a general linking element is defined using an xlink:type="extended" attribute; this element can contain the following:
a local resource is defined with xlink:type="resource"
a remote resource is defined with xlink:type="locator" and with an xlink:href attribute (an XPointer expression locating the resource)
arcs (traversal rules) are defined with xlink:type="arc":

both "resource" and "locator" elements can have xlink:label attributes
an arc element has an xlink:from and an xlink:to attribute
the "arc" element defines a set of arcs: from each resource having the from label to each resource having the to label

(Note the confusing terminology: a resource is defined either by a "resource" element or by a "locator" element.)

XPointer is described later - just think of XPointer expression as URIs for now...

Behavior

- link semantics

Arcs can be annotated with abstract behavior information using the following attributes:

xlink:show - what happens when the link is activated?

Possible values:

`embed`	insert the presentation of the target resource (the one at the end of the arc) in place of the source resource (the one at the beginning of the arc, where traversal was initiated) (example: as images in HTML)
`new`	display the target resource some other place without affecting the presentation of the source resource (example: as `target="_blank"` in an HTML link)
`replace`	replace the presentation of the resource containing the source with a presentation of the destination (example: as normal HTML links)
`other`	behavior specified elsewhere
`none`	no behavior is specified

xlink:actuate - when is the link activated?

Possible values:

`onLoad`	traverse the link immediately when recognized (example: as HTML images)
`onRequest`	traverse when explicitly requested (example: as normal HTML links)
`other`	behavior specified elsewhere
`none`	no behavior is specified

Note: these notions of link behavior are rather abstract and do not make sense for all applications.

Semantic attributes: describe the meaning of link resources and arcs

xlink:title

provide human readable descriptions (also available as xlink:type="title" to allow markup)

xlink:role and xlink:arcrole

URI references to descriptions

Simple vs. Extended links

- for compatibility and simplicity

Two kinds of links:

extended - the general ones we have seen so far
simple - a restricted version of extended links: only for two-ended outbound links (enough for HTML-style links)

Convenient shorthand notation for simple links:

  <mylink xlink:type="simple" xlink:href="..." xlink:show="..." .../>

is equivalent to:

  <mylink xlink:type="extended">

    <myresource xlink:type="resource"

                xlink:role="local"/>

    <myresource xlink:type="locator"

                xlink:role="remote" xlink:href="..."/>

    <myarc xlink:type="arc"

           xlink:from="local" xlink:to="remote" xlink:show="..." .../>

  </mylink>

Many XLink properties (e.g. xlink:type and xlink:show) can conveniently be specified as defaults in the schema definition!

When should I use XLink? Tim Berners-Lee: only for hypertext linking (Not everybody agree...)

HLink vs. XLink

Why is XHTML not using XLink?

The problem:

we want a general mechanism for identifying links
...but we want full control when designing the syntax of the host languages

When integrating XLink in a host language, the use of the XLink namespace makes a mess.

HLink: a recent alternative to XLink

same underlying ideas
different syntax

Example HLink: Definition of the link semantics of <a href="..."> elements in XHTML.

<hlink namespace="http://www.w3.org/1999/xhtml"

       element="a"

       locator="@href"

       effect="replace"

       actuate="onRequest"

       replacement="@target"/>

13 September 2002: W3C's HTML Working Group publishes HLink draft for intended use in XHTML 2.0

24 September 2002: W3C's Technical Architecture Group rejects HLink in favor of XLink for the design of XHTML 2.0

XPointer: Why, what, and how?

an extension of XPath which is used by XLink to locate remote link resources
relative addressing: allows links to places with no anchors
flexible and robust: XPointer/XPath expressions often survive changes in the target document
can point to substrings in character data and to whole tree fragments

Example of an XPointer:

URI

   -----------------------------------------------------------------

  /                                                                 \

  http://www.foo.org/bar.xml#xpointer(article/section[position()<=5])

                            |         \                            /|

                            |          ---------------------------- |

                             \              XPointer expression    /

                              \                                   /

                               -----------------------------------

                                  XPointer fragment identifier

(points to the first five section elements in the article root element.)

In HTML, fragment identifiers may denote anchor IDs - XPointer generalizes that.

XPointer vs. XPath

XPointer is based upon XPath:

an XPointer expression is basically the same as an XPath expression
XPath says nothing about URIs; XPointer specifies that connection
an XPath expression is evaluated wrt. a context; XPointer specifies this context
XPointer adds some features not available in XPath

XPointer fragment identifiers

An XPointer fragment identifier (the substring to the right of # in the URI) is either

the value of some ID attribute in the document (ID attributes are specified by the schema),
a sequence of element numbers denoting the path from the root to an element (e.g. /1/27/3), or
a sequence of the form

xpointer(...) xpointer(...) ...

containing a list (typically of length 1) of XPointer expressions.

Each expression is evaluated in turn, and the first where evaluation succeeds is used. (This allows alternative pointers to be specified thereby increasing robustness.)

Recently, the XPointer spec has been split into four (tiny) parts:

Next: We will now look into XPath and then later describe what additional features XPointer adds to XPath...

XPath: Location paths

XPath is a declarative language for:

addressing (used in XLink/XPointer and in XSLT)
pattern matching (used in XSLT and in XQuery)

The central construct is the location path, which is a sequence of location steps separated by /, e.g.:

  child::section[position()<6] / descendant::cite / attribute::href

selects all href attributes in cite elements in the first 5 sections of an article document.

a location step is evaluated wrt. some context resulting in a set of nodes
a location path is evaluated compositionally, left-to-right, starting with some initial context

location paths resemble operating system directory paths
each node resulting from evaluation of one step is used as context for evaluation of the next, and the results are unioned together

A context consists of:

a context node
a context position and size (two integers)
variable bindings, a function library, and a set of namespace declarations

Initial context: defined externally (e.g. by XPointer, XSLT, or XQuery).
Location paths can be prefixed with / to use the document root as initial context node!

Note: in the XPath data model, the XML document tree has a special root node above the root element.

There is a strong analogy to directory paths (in UNIX). As an example, the directory path /*/d/*.txt selects a set of files, and the location path /*/d/*[@ext="txt"] select a set of XML elements

Location steps

A single location step has the form

axis :: node-test [ predicate ]

The axis selects a rough set of candidate nodes (e.g. the child nodes of the context node).
The node-test performs an initial filtration of the candidates based on their

types (chardata node, processing instruction, etc.), or
names (e.g. element name).

The predicates (zero or more) cause a further, potentially more complex, filtration.
Only candidates for which the predicates evaluate to true are kept.

The candidates that survive the filtration constitute the result.

This structure of location steps makes implementation rather easy and efficient, since the complex predicates are only evaluated on relatively few nodes.

The example from before:

  child::section[position()<6] / descendant::cite / attribute::href

selects all href attributes in cite elements in the first 5 sections of an article document.

Axes

Available axes:

`child`	the children of the context node
`descendant`	all descendants (children, childrens children, ...)
`parent`	the parent (empty if at the root)
`ancestor`	all ancestors from the parent to the root
`following-sibling`	siblings to the right
`preceding-sibling`	siblings to the left
`following`	all following nodes in the document, excluding descendants
`preceding`	all preceding nodes in the document, excluding ancestors
`attribute`	the attributes of the context node
`namespace`	namespace declarations in the context node
`self`	the context node itself
`descendant-or-self`	the union of `descendant` and `self`
`ancestor-or-self`	the union of `ancestor` and `self`

Note that attributes and namespace declarations are considered a special kind of nodes here.

Some of these axes assume a document ordering of the tree nodes. The ordering is the left-to-right preorder traversal of the document tree - which is the same as the order in the textual representation.

The resulting sets are ordered intuitively, either forward (in document order) or reverse (reverse document order).
For instance, following is a forward axis, and ancestor is a reverse axis.

(Frustratingly, each technology uses a slightly different tree model...)

Node tests

Testing by node type:

`text()`	chardata nodes
`comment()`	comment nodes
`processing-instruction()`	processing instruction nodes
`node()`	all nodes (not including attributes and namespace declarations)

Testing by node name:

`name`		nodes with that name
`*`		any node

Warning: There is a bug in the XPath 1.0 spec! Default namespaces are required to be handled incorrectly, so, if using Namespaces together with XPath (or XSLT), all elements must have an explicit prefix.

Predicates

- expressions coerced to type boolean

A predicate filters a node-set by evaluating the predicate expression on each node in the set with

that node as the context node,
the size of the node-set as the context size, and
the position of the node in the node-set wrt. the axis ordering as the context position.

Example:

  child::section[position()<6] / descendant::cite[attribute::href="there"]

selects all cite elements with href="there" attributes in the first 5 sections of an article document.

(Compare with the earlier example.)

The XPath predicate language is very large, but these are the essential ones to know

[attribute::name="flour"]: test equality of an attribute
[attribute::name!="flour"]: test inequality of an attribute
[attribute::amount='0.5' and attribute::unit='cup']: test two things at once (also or)
[position()=2]: test position among siblings
[attribute::amount<'0.5']: a syntax error
[attribute::amount<'0.5']: a useless test of lexicographical order
[number(attribute::amount)<number('0.5')]: what you meant to write instead!

An entire location path may be used as a predicate

start at the current node
the predicate is true if the location path hits some result positions
it is false otherwise

This is very useful to look ahead:

[attribute::amount]: the node has an amount attribute
[descendant::ingredient]: the node has a nested ingredient

Expressions

Available types:

node-set (set of nodes)
boolean (true or false)
number (floating point)
string (Unicode text)

An expression can be:

a constant, e.g. "..."
a variable: $variable
a function call: function( arguments )
a boolean expression: or, and, =, !=, <, >, <=, >= (standard precedence, all left associative)
a numerical expression: +, -, *, div, mod
a node-set expression (using location paths!): | (set union)

Coercion may occur at function arguments and when expressions are used as predicates.

Variables and functions are evaluated using the context.

Core function library

Node-set functions:

last()	returns the context size
position()	returns the context position
count(node-set)	number of nodes in node-set
name(node-set)	string representation of first node in node-set
...	...

String functions:

string(value)		type cast to string
concat(string, string, ...)		string concatenation
...		...

Boolean functions:

boolean(value)		type cast to boolean
not(boolean)		boolean negation
...		...

Number functions:

number(value)		type cast to number
sum(node-set)		sum of number value of each node in node-set
...		...

- see the XPath specification for the complete list.

Abbreviations

Syntactic sugar: convenient notation for common situations

Normal syntax	Abbreviation
`child::`	nothing (so `child` is the default axis)
`attribute::`	`@`
`/descendant-or-self::node()/`	`//`
`self::node()`	`.` (useful because location paths starting with `/` begin evaluation at the root)
`parent::node()`	`..`

Example:

  .//@href

selects all href attributes in descendants of the context node.

Furthermore, the coercion rules often allow compact notation, e.g.

  foo[3]

refers to the third foo child element of the context node (because 3 is coerced to position()=3).

XPath visualization

Using Explorer 6 (or an updated version of Explorer 5) it is easy to experiment with XPath expressions.

The XPath Visualizer provides an interactive XPath evaluator that additionally visualizes the resulting node set (online installation).

This tool is implemented as an ordinary HTML page that makes heavy use of XSLT and JavaScript.

XPath examples

The following XPath expressions point to sets of nodes in the recipe collection:

"The amounts of flour being used":

//ingredient[@name="flour"]/@amount

0.5

0.25

"The ingredients of which half a cup are used":

//ingredient[@amount='0.5' and @unit='cup']/@name

grated Parmesan cheese

shredded mozzarella cheese

shortening

flour

orange juice

"The second step in preparing stock for Cailles en Sarcophages":

//ingredient[@name="stock"]/preparation/step[position()=2]/text()

When the liquid is relatively clear, add the carrots, celery, whole onion,

bay leaf, parsley, peppercorns and salt. Reduce the heat, cover and let

simmer at least 2 hours to make a hearty stock.

XPath 2.0

- currently a Working Draft, developed to capture the common subset of XSLT 2.0 and XQuery 1.0

Major changes from 1.0:

now using XML Schema primitive types instead of the four in 1.0

new type operators: cast, treat, assert, instance of

now using sequences instead of node-sets

also allow non-node types
new operators: for, if, some, every, intersect, except

many(!) new functions

regular expression match/replace/tokenize
date formats
...

XPointer, Part II - how XPointer uses XPath

XPointer: Context initialization

An XPointer is basically an XPath expression occurring in a URI.

When evaluated, the initial context is defined as follows:

the context node is the root node of the document
the context position and size are both 1 (because the root has no siblings)
the variable bindings are empty (variables are not used by XPointer)
the function library consists of the core XPath functions + a few extra functions
the namespace declarations are set as follows:

xmlns(myprefix=http://mynamespace.org) xpointer(...)

Warning: several levels of character escaping occur when using XPointer in XML documents

in XPointer, unbalanced parentheses must be escaped, e.g. ^)
in URIs, many characters must be escaped, e.g. %20
in XML attribute values, quotes, ampersand, etc. must be escaped, e.g. <

Extra XPointer features

XPointer provides a more fine-grained addressing than XPath.

Instead of just nodes, XPointers address locations, which can be nodes, points, or ranges.
A point can represent the location preceding or following any individual character in e.g. chardata nodes.
The special node test
point()
selects the set of points of a node.
A range consists of two points in the same document, and is specified using a special range-to location step construct.
XPointer provides some extra functions:

`here()`	get location of element containing current XPointer
`origin()`	get location where user initiated link traversal
`start-point(location-set)`	get start point of location set
`string-range(...)`	find matching substrings
`...`

Example:

  /descendant::text()/point()[position()=0]

selects the locations right before the first character of all character data nodes in the document.

Example:

  /section[1] / range-to(/section[3])

selects everything from the beginning of the first section to the end of the third.

Selected links: