
FactForge (formerly LDSR) represents a reason-able view to the web of data. It aims to allow users to find resources and facts based on the semantics of the data, like web search engines index WWW pages and facilitate their usage.
FactForge enables users to easily identify resources in the Linking Open Data (LOD) cloud. It provides efficient mechanism to query data from multiple datasets and sources, considering their semantics. FactForge is designed also as a use-case for large-scale reasoning and data integration.
FactForge includes several of the most central datasets of LOD. OWLIM semantic repository is used to load the data and "materialize" the facts that could be inferred from it. It is probably the largest and most heterogeneous body of general factual knowledge that was ever used for logical inference. Read more: presentation, brochure, presentation.
FactForge has the following characteristics:
The data is accessible through a web use interface at http://factforge.net, which allows:
<http://dbpedia.org/resource/Madrid> or dbpedia:Madrid(...add prefixes here...)SELECT * WHERE {
?Person dbp-ont:birthPlace [geo-ont:parentFeature dbpedia:Germany] ;
rdf:type dbp-ont:Politicians ;
om:hasRDFRank ?RR .
} ORDER BY DESC(?RR)
This is an example of a structured query, the evaluation of which involves data from 4 datasets and interpretation of the semantics of several schemata (i.e. reasoning). Within few seconds it returns results ranked by PageRank (in the RDF graph).(...add prefixes here...)SELECT * WHERE {
?Person pext:birthPlace [ptop:subRegionOf dbpedia:Germany] ;
pext:hasProfession pext:Politician ;
om:hasRDFRank ?RR .
?BirthPlace
} ORDER BY DESC(?RR)
Note that the conceptualization of Politician in this model is a profession, whereas in the first query, Politician is
defined as a person. Additionally, executing the second query retrieves 35% more results over the entire FactForge
dataset.
A public SPARQL end-point is available at http://factforge.net/sparql, allowing FactForge to be used as a query evaluation web service.
"Linked data" represents a set of principles for publishing of structured data they can be explored and navigated in a manner analogous to the HTML WWW. The linked data concept is an enabling factor for the realization of the Semantic Web as a global web of structured data around the Linking Open Data initiative.
FactForge has been initially developed as an evaluation case in the European research project LarKC. It has been extended, improved and build into the data layer infrastructure of RENDER FP7 European research project. The development of OWLIM, as well as other relevant technology and know-how, has been supported by several projects within programs FP5, FP6, and FP7 of the European Commission: RASCALLI, TAO, TripCom, SEKT, On-To-Knowledge.
The Linked Life Data service is similar to FactForge. It represents a reason-able view towards the life science part of LOD, including UniProt, GeneOntology, and more than 20 other datasets. FactForge and LLD are based on the same technology: Forest semantic web fronts-ends and OWLIM semantic repository. With its 5 billion explicit statements, LLD is probably the largest body of non-synthetic knowledge that was used for inference.
FactForge is an experimental project from Ontotext. The access to this demonstration service is free of charge. Ontotext does not provide any guarantees for quality, availability, or fitness for particular purpose. FactForge is far from perfect. Here are few comments on known shortcomings and development plans: