The Lehigh University Benchmark (LUBM) is probably the most popular benchmarking framework for semantic repositories. It uses a relatively simple ontology describing a university organization structure with synthetically generated datasets. The data generated for each university include a number of departments and related individuals together with relevant descriptions and existing relations between them. The largest 'standard' dataset, LUBM(50), includes 50 universities, described in 1000 OWL files with a total size of 583MB and almost 7 million explicit statements.
The LUBM benchmark consists of sample ontologies (and data), Java benchmark programs, and 'adapter' interfaces, making it easy to adapt to different repositories. The OWLIM 'adapter' for LUBM, which is provided together with the distribution of each OWLIM version, enables users to reproduce the results in their own environment. The framework also contains a data generator, so that one can easily generate a dataset with the desired size.
Last but not least, LUBM comes with 14 queries, the correct evaluation of which requires inference with respect to both the ontology and the data. The semantics of the ontology and the datasets is fully covered by the OWL Horst language fragment. The complexity of the inference process is relatively low because the generated datasets are loosely interconnected.
Note: The results presented below have been achieved by generating data for 90,000 universities, which is well above any of the standard LUBM data set sizes.
Highlights:
| Run | OWLIM version | benchmark | explicit sts. | implicit sts. | total sts. | inference | loading time | loading speed (expl. st/sec) |
test environment |
| b31lu90kH | BigOWLIM 3.1 | LUBM(90k) | 12,03B | 8.43B | 20.46B | owl-horst | 290h | 11,543 | ontonap-s |
| b30lu50kN | BigOWLIM 3.0 | LUBM(50k) | 6,7B | - | 6.7B | none | 42h | 43,914 | ontonap |
| b31lu45kH | BigOWLIM 3.1 | LUBM(45k) | 6,01B | 4.20B | 10.2B | owl-horst | 135h | 12,403 | ontonap-s |
| b30lu25kH | BigOWLIM 3.0 | LUBM(25k) | 3,3B | 2.2B | 5.3B | owl-horst | 84h | 11,042 | ontonap |
| b31lu8kH | BigOWLIM 3.1 | LUBM(8k) | 1,07B | 786M | 1.85B | owl-horst | 14.4h | 20,631 | ontosol |
| b31lu8kN | BigOWLIM 3.1 | LUBM(8k) | 1,1B | - | 1,1B | none | 4.5h | 66,196 | ontosol |
| b33lu8kH | BigOWLIM 3.3 | LUBM(8k) | 1,1B | 786M | 1,1B | owl-horst | 14.11h | 21,014 | ontosol |
| b33lu8kN | BigOWLIM 3.3 | LUBM(8k) | 1,1B | - | 1,1B | none | 3.8h | 77,323 | ontosol |
The table presents query evaluation time and number of results per query for different runs
| Run | OWLIM version | Q1 | Q2 | Q3 | Q4 | Q5 | Q6 | Q7 | Q8 | Q9 | Q10 | Q11 | Q12 | Q13 | Q14 | Total time (sec.) | |
| b31lu8kH | BigOWLIM 3.1 | Time (msec.) | 52 | 873s | 25 | 28 | 29 | 174s | 71 | 132 | 1,460s | 24 | 23 | 33 | 121.9s | 157s | 46m |
| # of results | 4 | 2,528 | 6 | 34 | 719 | 83.6M | 67 | 7,790 | 2.2M | 4 | 224 | 15 | 37,118 | 63.4M | 149.2M |
Query evaluation statistics for LUBM(8000) on a desktop machine (run b31lu8kH):
Query evaluation statistics for the run on LUBM(25000):