GraphDB Users Ask: What is the Most Important Hardware Attribute for Optimizing GraphDB Performance?

TESTED ON: GraphDB 10.2

April 27, 2023 4 mins. read GraphDB Q&As

ONTOTEXT ANSWER:

The question of optimizing any database is often a critical one. Even with the rise of cloud computing and how easy it is to obtain hardware nowadays, computing power does not come cheap. So, what is most optimal for performance and how to save costs?

There are essentially three attributes that heavily impact GraphDB’s performance. Those are memory, processor and storage.

Random access memory

Random access memory is probably the easiest to understand when it comes to optimizing GraphDB’s performance. The more memory you can give to the database, the bigger part of your knowledge graph can be kept in it. Accessing data from RAM is much faster than reading it from disk, so a RAM increase would result in better performance. This increase is limited by two factors:

  • GraphDB is not designed to load all of its data in memory. New triples always need to be stored on disk. Also, the sheer size of the database would be a limitation. Billions of triples can lead to data sizes over a terabyte and using so much memory would be prohibitively expensive.
  • RAM is not used for computations. So, it would always be blocked, at least partially, by the CPU.

Beside those limitations, some operations are performed only in-memory. This means that, for each database, there is a minimum RAM size. This is noted in our documentation.

While RAM frequencies and other technical attributes also have an impact on performance, it is negligible.

Processing

As a multi-threaded application, GraphDB can benefit from more cores. The number of cores you can use is tied to your license. More cores lead to more concurrent queries and faster inference. This means that in general, CPU cores have a strong impact on your performance. However, there is always single-threaded computations. For example, writing into storage is always a single-threaded operation as we can’t risk multiple threads overwriting the same data. The same applies for some queries, which also have a single-threaded part. Therefore, the individual clock speed of CPU cores also has a large impact on your overall performance.

CPU caches have a small effect on performance as well.

Storage

GraphDB is a database that persists the data to disk. Just like RAM, you need sufficient storage to actually store all your data. And, just like RAM, the actual speed of storage has a large impact. We always recommend using SSDs with GraphDB. In particular, more recent technologies like NVME are a huge boon to the database’s performance.

On the other hand, technologies such as RAID or NAS, while good for redundancy, usually have a negative impact on performance. This is particularly evident with cloud providers. AWS, Azure, Google cloud all have quotas for IOPS and bandwidth on their disks. This can be a serious problem, as when you hit that quota, your requests are throttled. So, you need to keep a close eye on monitoring your disk usage.

To sum it up, the question of hardware optimization of GraphDB is a complex one. There’s two barriers to entry, related to the database size – RAM and Storage size. Actual performance depends heavily on CPU speed, but it’s important to watch out for throttling from IO operations. Concurrent operations, on the other hand, benefit greatly from an increased number of cores. In such an interconnected environment, very often, it’s not enough to increase just one parameter. This is why we recommend keeping a close watch on GraphDB’s resource utilization and boosting your hardware as needed.

You can check our benchmark page, which reflects some of those concerns. Get in touch with us for recommendations for cloud hardware to make the most out of your GraphDB.

New call-to-action

 

Article's content

Ontotext answers questions from our GraphDB users. You can also check out the frequently asked questions on general topics about GraphDB. Or you can get quick answers on technical questions from the community as well as Ontotext experts using the graphdb tag on stack overflow.

GraphDB Users Ask: Where Can We Deploy GraphDB And What Are Some Best Practices?

In this blog, we answer questions from our GraphDB users. This question is about where can one deploy GraphDB and what are some best practices

GraphDB Users Ask: What Isolation Levels Does GraphDB Support?

In this blog, we answer questions from our GraphDB users. This question is about the the isolation levels GraphDB supports..

GraphDB Users Ask: What is the Most Important Hardware Attribute for Optimizing GraphDB Performance?

In this blog, we answer questions from our GraphDB users. This question is about the most important hardware attribute for optimizing GraphDB performance.

GraphDB Users Ask: What is the Best Way to Store the Triples’ History in the Database?

In this blog, we answer questions from our GraphDB users. This question is about the best way to store the triples’ history in the database

GraphDB Users Ask: Can I Use Nested Repositories to Introduce Logical Separation to GraphDB?

In this blog, we answer questions from our GraphDB users. This question is about using nested repositories to introduce logical separation to GraphDB

GraphDB Users Ask: Can I Fine-tune Security on Some of the Endpoints in GraphDB?

In this blog, we answer questions from our GraphDB users. This question is about fine-tuning securing on a GraphDB endpoint.

GraphDB Users Ask: What Are the Different Ways to Deploy GraphDB?

In this blog, we answer questions from our GraphDB users. This question is about the different ways to deploy GraphDB.

GraphDB Users Ask: What is the best way to integrate JSON data in GraphDB?

In this blog, we answer questions from our GraphDB users. This question is about the best ways to integrate JSON data in GraphDB.

GraphDB Users Ask: How Does GraphDB’s Security Work, Especially for Automated APIs?

In this feature, we answer questions from our GraphDB users. This question is about how about GraphDB security workds, especially for Automated APIs

GraphDB Users Ask: Is Kafka Only Used for Exporting Data, or for Importing, or Can We Do Both?

In this feature, we answer questions from our GraphDB users. This question is about if Kafka is used only for exporting or importing data or we can use for both

GraphDB Users Ask: How Do I Change the Configuration of an Existing Connector?

In this feature, we answer questions from our GraphDB users. Today’s question is about how to change the configuration of connector if you’ve made a mistake when creating it

GraphDB Users Ask: Are There Any Administration Differences to Operating a Cluster on GraphDB 10?

In this feature, we answer questions from our GraphDB users. Today’s question is about whether there are administration differences to operating a cluster in GraphDB 10

GraphDB Users Ask: Can I Scale GraphDB?

In this feature, we answer questions from our GraphDB users. Today’s question is if one can scale GraphDB.

GraphDB Users Ask: Can I Change My Inference At Runtime?

In this feature, we answer questions from our GraphDB users. Today’s question is if one can change inference at runtime.

GraphDB Users Ask: How To Mark Statements In A Query As Explicit Or Implicit?

In this feature, we answer questions from our GraphDB users. Today’s question is about how to mark statements in a query as explicit or implicit.

GraphDB Users Ask: Can I Use the Standard Ontop Configurations?

In this feature, we answer questions from our GraphDB users. Today’s question is if one can use the standard Onotp configurations.

GraphDB Users Ask: Should I Use a SPARQL Repository or a HTTP Repository?

In this feature, we answer questions from our GraphDB users. Today’s question us whether to use a SPARQL Repository or a HTTP Repository.

GraphDB Users Ask: Do You Have Any Advice on the Log4j Vulnerability for Different Versions of GraphDB?

In this feature, we answer questions from our GraphDB users. Today’s question is about the Log4j vulnerability for different versions of GraphDB.

GraphDB Users Ask 12 Very Short Questions

In this feature, we answer questions from our GraphDB users. Today, we answer 12 very short question from GraphDB users.

GraphDB Users Ask: Which of the GraphDB Logs Do I Need to Monitor for Problems?

In this feature, we answer questions from our GraphDB users. Today’s question is about GraphDB logs and how to monitor for problems.

GraphDB Users Ask: Can You Help Me Optimize My Queries?

In this feature, we answer questions from our GraphDB users. Today’s question is about how users can optimize their queries.

GraphDB Users Ask: What’s the Difference Between SPARQL and FedX Federation?

In this feature, we answer questions from our GraphDB users. Today’s question is about the difference between SPARQL and FedX federation.

GraphDB Users Ask: What Does The “Insufficient Free Heap Memory” Error Mean?

In this feature, we answer questions from our GraphDB users. Today’s question is about what the “Insufficient Free Heap memory” error means.

GraphDB Users Ask: How To Optimize My Inference?

In this feature, we answer questions from our GraphDB users. Today’s question is about how to optimize inference.

GraphDB Users Ask: Is RDF-Star The Best Choice For Reification?

In this feature, we answer questions from our GraphDB users. Today’s question is about whether RDF-star is the best choice for reification.

GraphDB Users Ask: Can GraphDB Infer Data Based on Values From a Virtualized Repository?

In this feature, we answer questions from our GraphDB users. Today’s question is about if GraphDB’s inference works with virtualized repositories.

GraphDB Users Ask: How Does SHACL Work on GraphDB?

In this feature, we answer questions from our GraphDB users. Today’s question is about how SHACL works on GraphDB.

GraphDB Users Ask: Does GraphDB Support ABAC?

In this feature, we answer questions from our GraphDB users. Today’s question is about if GraphDB supports ABAC.

GraphDB Users Ask: Why Do I Get Errors About GraphDB Being “Unable to Find Valid Certification Path to Requested Target”?

In this feature, we answer questions from our GraphDB users. Today’s question is about getting errors about GraphDB being “unable to find valid certification path to requested target”.

GraphDB Users Ask: How Can I Break Up My Data to Control Access To It?

In this feature on our blog, we answer questions from our GraphDB users. Today’s question is about GraphDB security and access control.

GraphDB Users Ask: Why does My Import Start Really Fast But Then Starts Losing Speed After a While?

In this feature on our blog, we answer questions from our GraphDB users. Today’s question is about GraphDB import speed.

GraphDB Users Ask: Can You Help Me Understand The Built-in GraphDB Security?

In this feature on our blog, we answer questions from our GraphDB users. Today’s question is about GraphDB security.

GraphDB Users Ask: How Many Repositories Can I Have in GraphDB and How Can I Unite the Disparate Data Between Them?

In this feature, we answer questions from our GraphDB users. Today’s question is about the number of repos in GraphDB and accessing the data.