Reasoning Strategies

In what follows we provide an oversimplified introduction to some basic choices related to the implementation of semantic repositories with inference functionality.

Forward- vs. Backward-Chaining

Two main reasoning strategies can be outlined:

The outlined strategies have different advantages and drawbacks well studied throughout the history of knowledge representation and expert systems. Hybrid strategies (involving partial forward- and backward-chaining) are also possible and in many contexts proven to be efficient.

Here, the term inferred closure is defined as follows: the extension of a KB (or, in the case of RDF/OWL semantic repository, the extension of a graph of RDF triples) with all the implicit facts (triples), that could be inferred from it, using the enforced semantics.

Materialization

Consider a repository capable to reason by means of total forward-chaining; after each update to the KB made, the repository assures that the inferred closure is computed or updated and made available for query evaluation or retrieval. Generally, this strategy is known as materialization. In order to avoid confusion with various partial materialization approaches, let us call total materialization such an inference strategy, when applied under monotonic semantics. The main advantages and disadvantages of such a schema can be summarized as follows:

Probably, the most important advantage of inductive systems, based on total materialization, is that they can easily benefit from RDBMS-like query optimization techniques, as long as all the data is available at query time. The availability of all the data enables that the query evaluation engine use statistics and other means to make "educated" guesses about the cost of evaluation of a particular constraint and the cardinality of the result. Such optimizations are much more complex in case of deductive query evaluation.

As a reasoning strategy, total materialization is adapted in a number of the popular Semantic Web repositories, including some of the standard configurations of Sesame and Jena. Experiments, like the performance tests of OWLIM, suggest that an efficient implementation of total materialization often can outperform systems based on more complicated reasoning strategies.

Monotonic Logics

Under a monotonic logic, the addition of new explicit facts (or statements) to the knowledge base (or repository) has the effect that new implicit facts may extend the inferred closure, while at the same time the removal of facts, which were part of the inferred closure, is disallowed. In other words, the addition of new facts can only extend the inferred closure monotonically.

An interesting discussion on the relevance and the utility of monotonic logics for the Semantic Web between Ian Horrocks, Pat Hayes, and Seth Russell can be found in the W3C archives. A broader overview on the subject is provided by Seth Russell here. The semantics of RDFS is monotonic.