• Blog
  • Informational

Here’s a Graph, Go Figure! Coupling Text Analytics with a Knowledge Graph

June 22, 2018 8 mins. read Teodora Petkova

These days, there’s hardly a company or an individual for whom textual information isn’t playing a key role in what they do and how they do it. We are all surrounded by gigabytes of textual data to put to use: from the mundane archive folder of our emails, we often lose ourselves in, to the large bodies of legal documents a lawyer needs to go through to ensure a company’s actions comply with the relevant laws, policies and regulations.

The truth is, we can’t possibly manipulate these gigantic quantities of text streams in a lifetime. The best we could try is to teach our machines to do that for us, efficiently and accurately.

New call-to-action

 

Autumn Leaves and Text Analytics

Like autumn leaves carried away, digital words and numbers dash through our private and public lives. They swirl across so many different channels in so many different contexts that our traditional methods and tools for coping with the written word are becoming increasingly inadequate. In a digital swirl of texts, new methods find their places in our approaches towards taming the written word. One such method is Text Analytics.

Text analytics processes, or simply put, the techniques used to make the meaning of textual resources machine-readable, processable and further ready for analysis and action, are different and vary according to the needs and the scales of the solutions required. Their challenges, however, are similar: work out a means for our language – an elaborate construct, ambiguous and context-dependent in its nature – to be formalized and presented in an unambiguous way to a computer system.

Autumn Leaves and Text Analytics

Let’s get back to “autumn leaves” and see what all of the above means in practical terms. For those of us who love jazz, “autumn leaves” can directly bring to mind a song by Nat King Cole. But what about a computer? How is a system supposed to figure out whether this is the name of a song, or a sentence about the end of the autumn season, or a tag about a bunch of autumn leaves?

Computationally, the tasks of disambiguating words, detecting a reference to a song and understanding the sentence’s syntax are extremely complex. They require a system to be trained to understand the definitions and the relationships between the words used together with the context they are in. This is where Text Analytics comes into play.

The main goal of Text Analytics is to bring textual content to a point where it is represented as data and thus accessible for machines to process and make sense of. Click To Tweet

This might involve syntax, token and sentence parsing, keywords and named entities detection and classification, as well as text summarization and trustworthiness and veracity analytics. It is through this “weaving” of data into text, which is the result of Text Analytics, that:

  • texts become readable for machines;
  • information retrieval is improved;
  • connectivity on multiple levels is achieved;
  • knowledge is turned into highly-manageable chunks.

Coupling Text Analytics with a Knowledge Graph

Understanding a text, or let’s be more accurate and say processing a text as to extract certain meaning out of it, presents many challenges to a machine: from identifying where the words start and end, through detecting phrases and sentences, all the way to determining what the entire text is about, based on the people, things, events and places, mentioned in it.

The first two are what Text Analytics deals with. The latter is the hardest for it often requires referring to knowledge that is outside the given text. Such knowledge, available for machines to access, is what a Knowledge Graph is all about.

Here’s a Graph, Go Figure!

If Text Analytics allows for a machine to have basic reading skills, Text Analytics coupled with a Knowledge Graph makes for a machine capable of recognizing people, organizations, locations and general encyclopaedic concepts in a text.

The Knowledge Graph represents a collection of interlinked descriptions of entities – real world objects, events, situations or abstract concepts. When a critical mass of concept descriptions are linked together in a big Knowledge Graph, they allow computers to interpret them and derive context and awareness, similar to the one that people develop in specific domains by means of education – formal or informal.

Knowledge Graphs serve machines to connect the dots. For example, a Knowledge Graph could contain the explicit statement that “Autumn Leaves” is a song by the artist Nat King Cole, recorded by other artists such as Eva Cassidy, Edith Piaf, Erik Clapton, etc.

Case in point, make a Google search for “Autumn Leaves” and you will see a direct answer, featuring all of the above-listed connections. What’s behind this result is Google’s implementation of a Knowledge Graph technology – a graph database that provides structured and detailed information about a given entity and the connections it enters with other entities.

Here is a Graph

As Atanas Kiryakov explained in a webinar touching on the workings behind a Knowledge Graph (see Graph Analytics on Company Data and News), a big Knowledge Graph can provide all flavors of context, which computers need to be able to recognize an entity in a text. Such richness is the result of a complex dynamics of relationships between machine-readable context and Text Analytics and involves technologies related to:

  • Classification – grouping entities into classes, categories, etc. they belong to. Knowing that Paris Hilton is a Person helps us disambiguate a reference to her in a context like “Paris Hilton was arrested” so that it is clear that this is not about the Hilton hotel in Paris.
  • Related entities – if an article mentions multiple cities in Texas, this is evidence that “Paris” refers to Paris, TX and not the capital of France.
  • Differentiating features and similar nodes – where similarity acts as an implicit relationship.
  • Importance and popularity – if Einstein is mentioned in a text, without much context suggesting alternatives, the machine should assume it is a reference to the most popular entity with this name – Albert Einstein, the genius physicist.
  • Co-occurrence – when entities are typically mentioned together.

The Knowledge Graph and the Enterprise

Within an enterprise context, a Knowledge Graph can be used to enhance Text Analytics in several ways. For example, a machine’s reading skills can be significantly improved when the system is fine-tuned to read Wikipedia, the news, various databases and sets of documents.

This can happen with the creation and, further, with the extension of an underlying Knowledge Graph with specialized datasets that will fine-tune the system’s reading skills with respect to specific types of text and the sort of facts that should be extracted from them. For instance, once built, a Knowledge Graph can further be augmented with proprietary databases, documents and other data about products, employees, clients and suppliers over time.

At Ontotext, the Text Analytics processes coupled with the creation and maintenance of a Knowledge Graph are proven to be of immense help in the fields of:

  • Publishing and Media – a system can process terminologies in various domains such as engineering, technology, etc.
  • Financial services – a system can process macro- and microeconomic concepts, financial instruments, money amounts, etc..
  • Life science and Healthcare – a system can be fed with knowledge about genomics (genes, mutations, assay), drug products (vendors, brand names, ingredients), etc.

The Meaningful Way Out of the Textual Heaps

Despite being overwhelmed by texts flying around us at electric speed, we can still get better results faster and search our content more easily with minimum effort and maximum clarity and meaning. We only need to conceive of Text Analytics in the broader context of teaching machines not only to read for us but to read for us better, that is, with more awareness of the context around the text itself.

Knowledge Graphs put information in context and allow its better interlinking, interpretation, analytics and reuse. Click To Tweet

Tons of diverse documents online can be easily automatically retrieved, with the needed information extracted from them, combined and made sense of. And this is how, in concert with having information collected from different sources about a place or a person, Text Analytics makes the power of the billions of words and numbers swirling across our digital spaces work for us in a meaningful way.

For wouldn’t it be wonderful to have a machine read all the latest laws or sift through a thousand of documents to find an answer to a question of ours?

It would be! And it is.

Want to learn more about knowledge graphs and their application in the enterprise?

New call-to-action

Article's content

Marketing Expert at Ontotext

Teodora is a philologist fascinated by the metamorphoses of text on the Web. Curious about our networked lives, she explores how the Semantic Web vision unfolds, transforming the possibilities of the written word. From 2022 on, Teodora helps with the creation and curation of the Ontotext knowledge graph to foster information ecology out of marketing content that will enable relevant user experiences across Ontotext's universe.

GraphDB in Action: Smells Like Semantics Spirit

Read about a project called Odeuropa and a number of exciting applications delivered by it with our RDF graph database humming behind them

KGF 2023: Bikes To The Moon, Datastrophies, Abstract Art And A Knowledge Graph Forum To Embrace Them All

Read about this year’s Knowledge Graph Forum – a space where Ontotext and partners presented smart and linked ways to tame and thrive on complexity, rather than be drowned by it

Do Large Language Models Dream of Knowledge Graphs – Impressions from Day 2 At SEMANTiCS 2023

Read our report from Day 2 of SEMANTiCS 2023 to find out if ChatGPT is the killer app for the Semantic Web, how do we tame the genie of LLMs for Healthcare and more

Can LLMs Become Knowledgeable – Impressions from Day 1 At SEMANTiCS 2023

Read about the interplay between LLMs & KGs and how business and academia tackle them in our report from Day 1 at SEMANTiCS 2023

It’s Time We Give Each Other More Data Spaces: Impressions from the Pre-conference Day at SEMANTiCS 2023

Read about SEMANTiCS pre-conference day, which covered the topics of interoperability, ESG data, knowledge engineering, scholarly communication, and academia & industry collaboration.

GraphDB in Action: Navigating Knowledge About Living Spaces, Cyber-physical Environments and Skies 

Read about three inspiring GraphDB-powered use cases of connecting data in a meaningful way to enable smart buildings, interoperable design engineering and ontology-based air-traffic control

Your Knowledge Graph Journey In Three Simple Steps

A bird’s eye view on where to start in building a knowledge graph solution to help your business excel in a data-driven market

GraphDB in Action: Putting the Most Reliable RDF Database to Work for Better Human-machine Interaction

Read about the world of academia research projects that use GraphDB to meet the challenges of heterogeneous data across various domains

Knowledge Graphs for Retail – Connecting People, Products and Platforms

Read about how knowledge graphs can serve the retail industry’s growing need to connect, manage and utilize data efficiently, aligning it in a collaborative data ecosystem

Data Wants To Be Truly Sovereign: Designing Data Spaces with Linked Data Principles In Mind

Read about what data spaces are and how semantic technologies and Linked Data can make them a stronger and safer mechanism for commercial data exchange

GraphDB in Action: Powering State-of-the-Art Research

Read about how academia research projects use GraphDB to power innovative solutions to challenges in the fields of Accounting, Healthcare and Cultural Heritage

KGF22: Knowledge Graphs and The Not So Quiet Cognitive Revolution

Read about Ontotext’s KGF22 days dedicated to stories about knowledge graphs in the domains of Industry, Healthcare & Life Sciences and Financial Services

KGF22: Wittgenstein, Developers Empathy and Other Semantic Data Considerations

Read about our event report from Ontotext’s Knowledge Graph Forum 2022, highlighting expert insight on building knowledge graphs and designing enterprise-grade solutions with semantic technologies.

A Little SEMANTiCS Goes A Long Way

Take a sneak peek at some of the keynote speeches and tutorials throughout SEMANTiCS 2022

It Takes A Village To Raise An Enterprise Knowledge Graph

Read about the design processes behind crafting knowledge-graph enabled solutions and explore some of the stories of our partners.

Smart Buildings Are Built of Smart Data: Knowledge Graphs for Building Automation Systems

Read about how knowledge graphs offer a sustainable solution for harnessing and making sense of heterogeneous data in the building automation industry.

Metadata Moves: Knowledge Graph Technology for Logistics

Read about how the world of metadata humming behind the logistics and other supply chain processes can benefit from using knowledge graph technology.

Electrical Standards, Smart Grids and Your Air Conditioner

Read about how applying Linked Data principles and semantic technology to electricity data can make for a more efficient, reliable and sustainable electricity market.

The Semantic Web: 20 Years And a Handful of Enterprise Knowledge Graphs Later

Read about how the Semantic Web vision reincarnated in thousands of Linked Open Data datasets and millions of Schema.org tagged webpages. And how it enables knowledge graphs to smarten up enterprises data.

Metadata is Like Packaging: Seeing Beyond the Library Card Metaphor

Read about what metadata is, why it is important and how it can enhance the ways information flows across the enterprise.

From Fragmented Data to a Comprehensive Knowledge Graph: The Case for Building an R&D Repository

Read about how enterprise knowledge graphs can unlock meaning and thus create a smart future-proof living repository of scientific data and its relationships.

Texts Without Pages: Advancing Text Analytics with Content Enrichment

Read about how text analytics can be brought forward with content enrichment processes for better text authoring, delivery and navigation.

A Shield Built of Connected Data: Knowledge Graphs Meet Cybersecurity

Read about how a knowledge graph can help organizations stay vigilant of the increasing number of cyber threats, keeping malicious attacks at bay with the help of semantics.

Digital Twins: If It Sounds Like Cyberpunk, It’s Because It Is

Read about what digital twins are, what makes them attractive to companies and how digital twins relate to semantic technology and enable organizations to design, simulate and validate various scenarios virtually.

Eating the Knowledge Soup, Literally

Read about the fluid essence of knowledge and the capability of knowledge graphs to power an information-rich platform of diverse facts about anything, a broccoli soup included.

If Curiosity Cabinets Were Knowledge Graphs

Read about why and how knowledge graph technology can help build networks of interwoven digital objects in the world of cultural heritage.

On the Hunt for Patterns: from Hippocrates to Supercomputers

Read about the ExaMode project that will help medical professional use the power of supercomputers and knowledge graphs for more efficient patient care through data-driven diagnoses.

Crafting a Knowledge Graph: The Semantic Data Modeling Way

Read about how to build a knowledge graph the semantic data modeling way in 10 steps, provided by our knowledge graph technology experts.

A Graphful of Investment Opportunities

Read about the story of an algorithm that mines data to narrow down opportunities for investing.

Okay, You Got a Knowledge Graph Built with Semantic Technology… And Now What?

Read about how knowledge management can be made smarter using a knowledge graph built with semantic technology.

If Johnny Mnemonic Smuggled Linked Data

Read about how semantic technology and Linked Data can help enterprises benefit from smart data management and retrieval practices.

Data, Databases and Deeds: A SPARQL Query to the Rescue

Read about why and how SPARQL queries make for a better search in diverse datasets across an organization in an integrated way.

Semantic Technology and the Way We See the World

Read about how semantic technology can help you set the wheels for many processes related to еfficient data management and governance.

Telling Stories with an RDF Database

Read about the opportunities for authoring and publishing workflows opened by an RDF triplestore.

The Power of URI or Why Odysseus Called Himself Nobody

Read about URI and its power to enable the sharing and reuse of machine-readable data with minimum integration costs.

From Cultivating Nature to Cultivating Data: Semantic Technology and Viticulture

Learn how the potential that Big Data streams bring to grape and wine production can be harnessed with the right kind of technology.

The Knowledge Graph and the Enterprise

Read about the knowledge graph and about how many enterprises are already embracing the idea of benefiting from it.

It Don’t Mean a Thing If It Ain’t Got Semantics

Learn how you can turn data pieces into actionable knowledge and data-driven decisions with an RDF database.

The Bounties of Semantic Data Integration for the Enterprise

Learn about the potential semantic data integration carries for piecing massive amounts of data together.

Here’s a Graph, Go Figure! Coupling Text Analytics with a Knowledge Graph

Learn why and how a Knowledge Graph boosts significantly Text Analytics processes and practices and makes text work for us in a more meaningful way.

Cognitive Computing: Let’s Play an Awareness Game

Read about the new breed of computing is emerging before our eyes – cognitive computing and join us in our Awareness Game.

Machine Learning and Our (Insatiable) Penchant for Making Things Smarter

Read about how machines can be of great help with many tasks where fast and error-free computation over big amounts of data are required.

Staying In the Vanguard of Digital Transformation with Open Data

Learn about Open Data and its potential to be part of smart solutions to data problems and innovative products and services.

Whose Meaning? Which Ontology?

Read about how ontologies open up opportunities for a new class of tools to power information consumption and knowledge management.

Shiny Happy Data: A Praise for RDF

Learn how to choose the right solution for working with your data the conceptual framework of “happy connected people”.

Enterprise Metadata Matters: From Having Data to Acting Upon Them

Learn more about the importance of being metadata-driven today in our latest SlideShare presentation.

Data Daiquiri: The Power of Mixing Data

Learn how companies can tap into the power of the data coming their way by integrating the huge data flows with their proprietary data.

Migrating to GraphDB: Your Why and How in 20 slides

Learn about the steps you need to migrate your data to GraphDB to use it as a smart brain on top of your legacy systems.

Got meaning? Or Why an RDF Graph Database Is Good for Making Sense of Your Data

Read about how you can create systems capable of discovering relationships and detecting patterns within all kinds of data.

Brains Armored with Smart Data

Read our thoughts rising from questions such as “Will Giant Brains Rule the World?” and “Can a mechanical brain replace you?”

One Step Closer to Intertwingularity: Semantic Metadata

Learn about how semantic metadata allows us to add granularity to an object, interlink it to other objects and make it easy to search.

Exceptional User Experiences with Meaningful Content NOW

Content enrichment and semantic web technologies are key to efficient content management. Learn why and see these technologies in action.

Semantic Information Extraction: From Data Bits to Knowledge Bytes

Learn about semantic information extraction and how it pulls out meaningful data from textual sources, ready to be leveraged for insights, decisions and actions.

Weaving Data Into Texts: The Value of Semantic Annotation

Read about how semantic annotation links certain words to context and references that can be processed by an algorithm.

Exploring Linked Open Data with FactForge

Learn about FactForge and how you can turn the opportunities that data flows on the web can pour into our business into a real experience.

What is GraphDB and how can it help you run a smart data-driven business?

Learn about GraphDB in a simple and easy to understand way and see what Ontotext’s semantic graph database has to do with pasta making.

Linked Data for Libraries: Our New Librarians

Learn how semantic technologies can bring audiences back to libraries and make library archives and collections visible and accessible.

Working with Data Just Got Easier: Converting Tabular Data into RDF Within GraphDB

Read about OntoRefine – a new tool that allows you to do many ETL (extract, transform and load) tasks over tabular data.

GraphDB: Answers for Kids of All Ages

Read about how GraphDB can help you clean up messes of data (just like your room) – whether you are a kid or not.

The Knowledge Discovery Quest

Learn how by joining the dots, semantic search enhances the way we look for clues and compare correlations on our knowledge discovery quest.

Connectivity, Open Data and A Bag of Chips

Learn how LOD’s connectivity allows data to be shared seamlessly, used and reused freely. As simple as a bag of chips.

Data Integration: Joining the Data Pieces of Your Business Puzzle

Learn how to use information interconnectedness to integrate, interpret and ultimately make sense of data.

Cooking Up the Semantic Web

Read about the Semantic Web and what it takes to reach its potential and evolve from a Web of Documents to a Web of Data.

Semantic Search: The Paradigm Shift from Results to Relationships

Read about semantic search and how it takes information retrieval to the next level and puts information at our fingertips.

A Web of People and Machines: W3C Semantic Web Standards

Learn how and why Semantic Web Standards are to serve the Web of Data for better collaboration between people through computers.

Thinking Outside the Table

Learn how to manage highly connected data, working with complex queries and having readily available relationships, without the need to express them explicitly.

Our Networked Lives, Publishing and Semantic Technologies

Read about how semantic technology helps publishing handle data in an interconnected way, attaching machine-processable and readable meaning to them.

Why Graph Databases Make a Better Home for Interconnected Data Than the Relational Databases?

Read about how you can turn data into a resource, easily accessed and effectively used across the organization with a graph database.

Text, Data and the Roman Roads: Semantic Enrichment

Read about semantic enrichment and the unique opportunity it offers for interconnecting objects to facilitate knowledge discovery.

4 Things NOW Lets You Do With Content

Go beyond conventional publishing with Ontotext’s News On the Web and get the feel of how you can discover and consume content with semantic technology.