AnnoMarket: Annotation Resource Marketplace in the Cloud

  • Completed
  • Programme: FP7
  • Call: ICT-2011.4.1 - SME initiative on Digital Content and Languages
  • Start date: 01.06.2012
  • End date: 31.05.2014

AnnoMarket (Annotation Resource Marketplace in the Cloud) aims to revolutionize the text annotation market, by delivering – an affordable, open marketplace for pay-as-you-go, cloud-based extraction resources and services, in multiple languages. The project is driven by a commercially-dominated consortium, from 3 EU countries and with 41% of the budget assigned to SMEs.

Contact: Marin Dimitrov

Project Overview

The key differentiating feature of AnnoMarket is its open marketplace concept. In addition, the Software-as-a-Service (SaaS) model reduces the complexity of deployment, maintenance, customization and sharing of text processing resources and services, making them affordable to SMEs – both end users and resource providers.

The main beneficiaries will be the SME providers of text analysis resources and services, who will be able to deploy their custom components/applications and receive revenue via the AnnoMarket marketplace. There will be a mixture of paid-for proprietary resources and services and free open-source ones, in different languages.

AnnoMarket will also promote customization and re-targeting to new vertical domains and languages. The open-source nature of the underlying infrastructure will encourage the participation of an already existing strong developer community and enable easy deployment on private and public cloud infrastructures. Pricing will be transparent (based on data volumes and API calls) and the business model self-sustainable.

Ontotext’s Role

Ontotext is responsible for the AnnoMarket cloud services, including: marketplace, cloud deployment of NLP pipelines and tools, integrated authentication, security and encryption, pricing, Amazon cloud deployment, logging, reporting, billing.

Ontotext also developed a browser extension for semantic annotation, comparable to OpenCalais and Alchemy. It also performed system benchmarking: throughput (annotated documents / Megabytes per second), backend and frontend load (CPU and I/O), average and max latency.

Ontotext Newsletter