Ontotext

Clients by Industry

Ontotext has completed a transition from a purely research organization to a supplier of commercial products and services and has amassed a dazzling array of top-class clients, spanning the globe and a number of industries.

A sampling of the diversity of our clients is shown below, with a short description of the supplied products and services. We have also included links to some of our most important business success stories. (See Collaborations for long-standing specific collaborations and Partners for information about our technology partners.)

Media, Publishing

Global media organizations have to face increasing volumes of topics and news, a distributed writer workforce, and ever-shortening publication deadlines. Instead of writing monolithic articles about one main topic, it is more efficient to create small journalistic assets (eg one photo or paragraph) and dispatch them automatically to all relevant topics. As a result, Semantic Media Publishing has emerged as an innovative IT area.

Probably the highest profile application of semantic technology to date, BBC’s 2010 World Cup web site is delivered using the OWLIM Enterprise semantic repository. A famous "call to action" by John O'Donovan (Chief Technical Architect, BBC) has spurred a flurry of semantic technology activity in world-known media companies, particularly in the UK and US.

See more details in the BBC: Dynamic Semantic Publishing success story

The producer of 40% of UK news, Press Association chose Ontotext to enrich all news assets with information on mentioned entities and facts - a service ready to be delivered to their clients. The confirmation they had made the right choice followed soon afterwards when Press Association and our technology were selected to handle the official news for the London 2012 Olympics, enriching them with athletes, disciplines and competing countries.

Now, Press Association is looking into sharing this richness with their enterprise clients, such as AOL, Yahoo!, BBC and MSN.

In an unprecedented act of joint involvement in semantic technologies, ALL daily publishers from the Netherlands commissioned Ontotext, and our partners Dayon, to create the next generation publishing platform. Though competitors in printed daily newspapers, Dutch publishers have decided to make a commercial alliance in order to address the needs of tomorrow.

To realise their idea, Ontotext had the task to create a semantic tagging platform, by sourcing the proper data sets and engineering the text analytics, necessary to build the semantic indices that would drive Newz and 3rd party apps. After this success, more opportunities and more news to come soon.

For the last year and a half Siemens and their partner Publicis have been investing in building a semantic tagging framework, commissioned to Ontotext. The game changer that both organisations had in mind was related to the authoring process - how a journalist or a copywriter creates a wider coverage content.

With our technologies, it turned out to be pretty trivial. Every paragraph submitted to the system in the WYSIWYG editor fired a query returning relevant articles/content from the Siemens' content silos. Adopting this technology would enable big medias to produce more analytical stories instead of the common digest-type ones. Take a moment to figure out which one brings more value.

A leading international business-to-business publisher, focused primarily on international finance, chose Ontotext to develop their semantic publishing platform, targeting macroeconomics, IPOs, bond issuance, M&A deals and other investment-related events.

undisclosed

One of the largest US media companies is currently engaged with Ontotext in a Proof Of Concept (POC) pilot involving application of semantic technologies to the political domain

NetInfo is the leading Bulgarian internet media holding, reaching 80% of all BG internet users. AdWise is a targeted advertisement management platform. Ontotext has developed for AdWise a component to evaluate the relevance of an advertisement to a piece of content (e-mail or web-page), so the platform can show only ads that fit the user's context. This uses content similarity metrics, taking into account specifics of the language, various possible word forms, and the varying significance of words

Life Sciences: Pharmaceutical, Gene Research, Health

Medical research, pharmaceutical and other life sciences organizations have to deal with huge and ever increasing volumes of information that range from structured (distributed departmental databases) to semi-structured (excel tables without a common unifying format) to unstructured (free-text documents) and span both corporate servers and the global medical community. The improved discovery and enriched searchability of data relevant to a particular clinical study or other medical research endeavor has direct financial benefits, by reducing the amount of specific research that is required.

AstraZeneca is a top-5 global pharmaceutical company. Ontotext has executed a number of successful projects for AstraZeneca, such as data integration of clinical studies, analysis and retrieval of clinical trial reports as well as integration of biomedical databases for identification of drug targets and interactions

AZ is also a long-term research partner of Ontotext in the EU research project LarKC. Learn more what Ontotext does for AZ in the field of causality mining here.

UCB is a global biopharmaceutical company focused on the discovery and development of innovative medicines and solutions to transform the lives of people living with severe diseases of the immune system or of the central nervous system. UCB selected Ontotext and the Linked Life Data (LLD) service to provide support for the linked data cloud and leverage its capabilities to help support research questions faced in the drug discovery process. UCB is using LLD to integrate dozens of public biological, chemical and medical data sources into its internal research stream for drug discovery. The knowledge base is a valuable source of research data that may be used to generate and validate complex hypothesis and does this while offloading the data integration cost from the UCB researchers.

The mission of Database Center for Life Science is to integrate and improve biomedical databases and create a user-friendly portal website of all life science datasets in Japan.
National Institute of Biomedical Innovation uses OWLIM for web application that displays information about RNA microarray experiments and associated data

undisclosed

A growing consultancy and solutions oriented business where Ontotext provides automatic information extraction of meta-data from clinical documents and semantic tagging; advanced semantic search and medical documents linking.

South London and Maudsley is the largest provider of secondary mental healthcare in Europe. Ontotext introduces new text analysis functionality in the Biomedical Research Center (BRC) Case Register Interactive Search (CRIS), which is primary used to generate in-depth secondary analysis by investigating a combination of patient summary meta-data. See more details in the SLAM: Electronic Patient Records success story.

Government Institutions

Many governments are starting to adopt semantic web technologies for Open Government Data publishing, as an important enabler to fulfill their openness and transparency mandates.

Ontotext's OWLIM database is used in an indexing and search application at the UK Parliament where the records of both Houses, as contained in the official report, called Hansard, is combined with several other data sources, including a feed from legislation.gov.uk (see below). Semantic annotations are made to the content by a team of indexers and made available for search, both semantic search driven by SPARQL queries over OWLIM, and a full-text search via SOLR, which is populated from OWLIM.

legislation.gov.uk carries most types of legislation and their accompanying explanatory documents. It has about 2 million visitors monthly. The Stationery Office (TSO) developed, and now hosts and manages, the legislation.gov.uk site for The National Archives (TNA). The programme to implement the site required content migration from various sources and resulted into a massive 6.5 million documents that are available on the website and this is growing daily. For the first time the government can demonstrate precisely how an Act would look if an Amending Bill is passed. OWLIM is being used to support a new, sophisticated, editorial and workflow management system that underpins legislation.gov.uk.OWLIM was chosen because of its scalability, support for transactional processes and SPARQL 1.1 support.

Food and Agriculture Organization of the United Nations (FAO) chose OWLIM as an RDF Database for their portal data.fao.org. The portal is an innovative web-based platform that brings together statistics, maps and pictures (and soon documents) on nutrition, food and agriculture from throughout FAO, providing easy access, a powerful search engine and data visualizations all from one convenient location.

Read more about it in the official FAO blog here and here

The Canadian Government Department - Natural Resources Canada (NRCan) seeks to enhance the responsible development and use of Canada’s natural resources. Through the use of OWLIM NRCan are planning to move their RDBMS static file based mediation database, which allows querying heterogeneous databases (schematic and semantic), to a RDF store that provides reasoning. Speed is the main element to consider in the project, due to the fact that the translation involves a significant amount of vocabulary conversion.

Telecoms, Interactive Television

Telecommunication and television deal with huge amounts of data and offer rich areas of application of semantic technologies, eg for mining of client and billing data, making recommendations, etc

“KT provides semantic VOD search service for IPTV and various smart mobile devices using OWLIM as semantic repository. We found that BigOWLIM is suitable for our service, as it has shown good performance for its price compared with other products in the actual simulation tests. Best of all, OWLIM could be easily tested, installed and used thanks to open library, detailed guidelines, and lots of samples etc. By and large, our experience is that the OWLIM is very powerful and reliable high-performance semantic repository. ” – Joo Won Sung, Senior Researcher, Central R&D Laboratory, KT. See the KT: Interactive TV success story for details.

I.S.D.D. plus is a Slovak software company providing complex services in the field of information technologies, specializing in telecom and banking industries. They chose OWLIM to take advantage specifically of its Geo-Spatial Extensions.

Archives, Libraries, Cultural Heritage

The UK Government contracted Ontotext to implement a Semantic Knowledge Base for the Government Web Archive. The project brings together publicly available linked data and open-source text mining technology for semantic indexing and search of over 150M documents. See The National Archives: Semantic Knowledge Base success story.

An Ontotext-led consortium won The British Museum tender for development of the ResearchSpace project funded by the Andrew W. Mellon Foundation that aims to support collaborative web-based research, information sharing and web publishing for the cultural heritage scholarly community.

The system will be based on the CIDOC CRM ontology, which supports very generic description of cultural artifacts, related events (e.g. creation, acquisition, curation, conservation), annotations, research discourse. See more

ConservationSpace is another project funded by the Andrew W. Mellon Foundation that is run by the National Gallery of Art, Washington D.C. and 7 other institutional partners. The project will develop an open-source system to address a core need of the conservation community for a shared solution for documentation management.
Sirma ITT and Ontotext won the international tender for the Build phase of the project, making this the second Mellon project being developed by Ontotext.

The British Museum (BM) and Yale Center for British Art (YCBA) have selected OWLIM for their semantic repository, because of its high performance for both data loading and querying, strict standards compliance, OWL reasoning and rule inferencing that are useful for CRM, especially regarding complex search. An added benefit is that OWLIM supports the new W3C standard SPARQL 1.1 Federated Query that allows a user to interrogate several semantic repositories at once. Federation can help research communities and projects to collaborate on a certain topic, even when the relevant works and data are scattered in different collections. See details.

In September 2012 Europeana published its API for free access to all Europeana data  amounting to more than 20 million cultural heritage objects from the entire Europe.  A month later the RDF dataset in EDM (Europeana Data Model) format was made available to the public. Ontotext was invited to host the Europeana SPARQL endpoint. Europeana SPARQL end point is a reason-able view of the web of data, loaded in OWLIM semantic repository with OWL-Horst inference and using Forest framework as a front end. It comprises 993 million explicit statements, and close to 4 billion retrievable statements.  It is accessible at http://europeana.ontotext.com, and via http://data.europeana.eu.

Europeana Creative is an European CIP PSP project that started in Feb 2013. It will allow Europeana to facilitate the creative re-use of cultural heritage metadata and content. It will create five pilot applications in the thematic areas of History Education, Natural History Education, Tourism, Social Networks, and Design.
Ontotext will work on the Content reuse framework (core backend component of the architecture) and on Geo-tagging of cultural heritage objects.

The consortium Dayon (NL) - Ontotext (BG) is one of 3 that won Lot2 of an ambitious cultural heritage (CH) aggregation project of the Dutch Public Library http://bibliotheek.nl (BNL). The project aims to create an Open Search Platform to search across a variety of national and regional CH data sources, including libraries, audio collections, manuscripts, etc.
The goal of Lot2 is to aggregate data from approximately 150 sources, for a total of 40M heritage objects (more than twice the current number in Europeana). Ontotext will deal with ontology modeling, architecture, conversion pipelines, XML to RDF conversion, etc.

Sofia University is the oldest Bulgarian university and hosts the recently established BG-KR IT Cooperation Center (ITCC).
Part of ITCC's research focus includes semantic technology applications to Cultural Heritage, in particular semantic publishing of Bulgarian cultural heritage to Europeana. Ontotext established Bulgariana.eu as a Bulgarian aggregator for Europeana, a networking group for cultural heritage in Bulgaria, and organized 2 conferences.

FP7 CHARISMA (Cultural Heritage Advanced Research Infrastructures: Synergy for a Multidisciplinary Approach to Conservation/Restoration) is an EU-funded Integrated Project carried out in the FP7 Capacities specific programme "Research Infrastructures". The project provides transnational access to most advanced scientific instrumentation and knowledge; allowing scientists, conservators-restorers and curators to enhance their research. CHARISMA uses OWLIM to power a portal that provides metadata from 6 major European cultural institutions: Centre de Recherche et des Restauration, Louvre (FR), The National Gallery London, The British Museum, Opificio delle Pietre Dure (IT), ICN Cultural Heritage Agency of the Netherlands, Museo Nacional del Prado (ES)

LODAC (Linked Open Data in Academia) is created by Japan's National Institute of Informatics and aggregates various information across multiple Japanese resources as LOD.
Many Japanese museums have digitized their museum collections. This data is scraped by LODAC and mapped to RDF in CIDOC CRM. Associated artists and artworks from different museum collections are associated and integrated data views are provided. The result is LODAC-Museum, including web presentation, natural language search and SPARQL endpoint. See more

The Polish Digital National Museum aggregates artifacts from cultural institutions in the Digital Libraries Federation PIONIER Network: over 70 contributing institutions including universities, libraries, museums, archives, research. The Poznan Supercomputing and Networking Center transforms all provided data to RDF using common ontologies such as CIDOC CRM, and relevant vocabularies related to Europeana. The aggregated collection contains 681 thousand objects and is published with specially developed software (dMuseion). OWLIM is used as the RDF repository for this project.

The Gothenburg City Museum provided close to 9K museum objects from two collections to build a use case within the MOLTO FP7 project for a knowledge representation infrastructure that allows querying RDF and presenting RDF results in natural language. The knowledge representation infrastructure is based on Ontotext's approach "reason-able views of the web of data". Museum data is modelled according to CIDOC CRM, integrated with the DBpedia and GeoNames datasets, and upper-level ontologies such as PROTON that facilitate the integration. The museum reason-able view contains 305M triples, and is accessible via a SPARQL endpoint here.

Domain-specific Semantics and Search

DATALAN is a leading Slovak provider of innovative business solutions and IT services. They chose OWLIM as a foundation for their new business venture that will be announced soon.
Undisclosed Ontotext created semantic patent search software for a provider of integrated patent management solutions. A demo can be seen in KIM Showcases.

RonsMap is an innovative provider of car offer information in the US.
  • It collects information from inventory of US car dealers, used car offers.
  • Presents all data in a unified semantic format.
  • Allows intelligent searching and subscription-based notifications

Ontotext is developing a semantic knowledge base of foods, recipes, nutrients, cooking etc. Recipes are crawled from numerous sites; processed to extract knowledge about ingredients, cooking times, nutrition information; and presented through various dynamic and intelligent interfaces that allow sorting and filtering by a number of preferences. See more details in the EDAMAM Food Knowledge Base success story

Defense, Security, Financial Intelligence

Defense and Homeland security have been early adopters of innovative technologies related to inference and artificial intelligence. Semantic technologies offer the best way to integrate data from numerous disparate sources (both public and intelligence-related) and facts obtained through text mining.

LMI is a non-profit management consulting firm serving U.S. Government departments and agencies. LMI has partnered with Ontotext to combine Ontotext’s semantic annotation and search capabilities with OWLIM to create an advanced document search. Open Policy offers end-users facetted searching capabilities alongside semantic keywords and full text search in an easy-to-use browser interface. The combined search capabilities vastly improve access to key regulatory information for government users at all levels- from field operations to policy specialists.

Open Policy is deployed at the Office of Deputy Assistant Secretary of Defense, Supply Chain Integration (DASD - SCI) and in the Centers for Medicaid and Medicare Support (CMS). 

Ontotext created the Asset Recovery Intelligence System (ARIS) Platform prototype for the International Center for Asset Recovery (ICAR) of Basel Institute on Governance. ARIS helps financial investigators, analysts and Financial Intelligence Units in developing countries with the tracking of assets stolen by departing dictators (kleptocrats). See more details in the Basel: Uncovering Financial Crime success story

One of the top-5 defense contractors in the US uses OWLIM for semantic technology projects related to defense

Academic Institutions

Ontotext has long-standing collaborations in the semantic technologies area with leading European universities, such as Sheffield University (GATE group). Below are listed some other academic clients:

Ontotext provided product licenses and training services to Notthingham Trent to kick-off their semantic technologies projects

The Pohang University of Science and Technology (POSTECH) is one of Korea's top universities dedicated to science and engineering. POSTECH uses OWLIM for semantic technology research and industrial projects

National Institute of Informatics is Japan's general academic research institution seeking to create future value in informatics. NII seeks to advance integrated research and development activities in information-related fields, including networking, software, and content. NII uses OWLIM for research related to semantic technologies

Innovation, Consulting

Various consulting and innovation companies have started using semantic technologies in their endeavors.

Innovaro provides clients a comprehensive portfolio of solutions to navigate the complexities of the entire innovation ecosystem. It has selected semantic technologies as a prominent area of innovation and licenses OWLIM for semantic-related projects

Knowledge Integration specializes in the development of high quality open source products and components based around open standards and specifications. It has selected OWLIM as the RDF database of choice for its endeavors

Volz Innovation supports organizations in process and product innovation with new technologies. Volz has created new products and technologies for the Web, mobile navigation, enterprise software and cloud computing. Volz uses OWLIM for its semantic technology projects