• Blog
  • Informational

Open Data Play in Sports Journalism And EURO 2016

June 23, 2016 6 mins. read Milena Yankova

Open Data Journalism

Sir Arthur Conan Doyle was The Daily Mail’s star reporter at the marathon race in the 1908 London Olympics. His reports of the dramatic competition contributed to creating the marathon myth and making Italian long-distance runner Dorando Pietri a legend. Pietri collapsed just a few feet before the finish line and was assisted through it by officials, which later led to his disqualification.

Conan Doyle – who wrote “the Italian’s great performance can never be effaced from our records of sport, be the decision of the judges what it may” – led a campaign for the Italian runner and raised £309, which Pietri donated to charity.

The Modern-Day Sherlock

In today’s sports and reporting, journalists are a kind of Conan Doyle’s most famous fictional detective in literature, Sherlock Holmes.

Sports journalists have to deal with a lot of facts and statistics in order to support their articles with information and increase trust, to investigate the connection between money, sports and sponsorship, and to pursue transparency and accountability from governments and authorities.

Still, what is a great detective without his signature hat, a pipe and a magnifying glass? In sports reporting, the magnifying glass is the statistics that make the backbone of a match report, an exclusive interview or an insightful original story behind the numbers.

Sports journalists Open Data

Open Data Opportunities

Data is the primary source and the easier it is to obtain, the easier the job of the writer would be. Open Data is freely accessible for anyone to use and share so it does not need freedom of information requests. Having open datasets of statistics about sports or an athlete helps sports writers have the basics of their stories or add another layer to their articles.

Yet, huge raw datasets from various sources may turn out to be even more difficult to analyze than getting to sit down with Leo Messi for an exclusive interview.

The Semantic Technology Magnifying Glass

So our sports journalist Sherlock may need a little help from a Semantic Technology ‘magnifying glass’ to see the whole content organized and interlinked. One such technology, the semantic graph database (also known as RDF triplestore), for example, creates a rich Knowledge Graph structure with links to open datasets like DBpedia, GeoNames, etc.

The ability to show links between concepts and to infer new knowledge out of existing facts distinguishes the semantic graph database from relational databases or any spreadsheet-like set of top goal scorers or league standings. Click To Tweet

A semantic graph database like Ontotext’s GraphDB stores the statement ‘Wayne Rooney plays for England’, for example, in the form of a subject-predicate-object statement, also called a triple. ‘Wayne Rooney’ is the subject, ‘plays for’ is the predicate, and ‘England’ is the object, with the predicate showing the relationship between the subject and object.

Inference shows that Rooney currently plays in Group B of the UEAFA Euro 2016, England’s group in the tournament, which generates more content to be further reused, repurposed or repackaged.

The BBC Sport website is the poster boy for a semantic news publishing technology. The BBC first started using Dynamic Semantic Publishing for the 2010 World Cup and scaled it up to a Linked Data platform for its online content.


Magnifying Glass Semantic Technology

Open Data for Sports Reporting & Fans

Open Data surely contributes to better content creation. It can provide the raw numbers for reporters to back their investigative stories with. Thus, sports journalists can focus their efforts on writing what the data actually reveals and have enough time for the exclusive reporting, investigative analyses or interviews.

Furthermore, Open Data from smart cities or government agencies help both journalists and fans with information on venues, stadium capacity, security, routes to the venues, traffic.

For example, Rio de Janeiro and research groups have created transport mobility apps for the upcoming 2016 Summer Olympics, which Rio is hosting this August. The apps, based on Open Data, will be offering travel options to facilitate the arrival of spectators to competition venues.

The UK’s data.gov.uk government data features datasets such as Sport Pitches Playing Fields and Statistics on football banning orders. The publisher of the latter set, the Home Office, has included in the data the number of arrests and banning orders issued during the season, shown by club and by offense.

Open Data for Transparency and Accountability

Governments in the UK and the US have already opened many public datasets, including such on government spending.

At the same time, sport and its implications on the global business, sponsorship and bidding to host Olympic Games or World Cups have become front-page news and quite worthy of investigative reporting, not only by sports writers. Unfortunately, data on these are not open-source and open-access, and may well not be such soon.

In the huge bribery scandal at FIFA last year, the US indicted 14 FIFA officials over “rampant, systemic, and deep-rooted” corruption, especially in accepting bribes for securing that certain nations would host World Cups. This is why our sports and investigative Sherlock Holmes should push for more transparency and accountability via data openness.

In June 2015, Jack Hardinges from the Open Data Institute (ODI) wrote:

Adopting an open data policy could act as a turning point for FIFA. It could be the way to restore faith and trust in it as the global face of football amongst fans, sponsors and wider global community.

It’s not only FIFA that needs opening up data, though. The use of banned performance-enhancing substances, as well as the TV and sponsorship deals in all sports, are also worthy of reporting and investigating. Open data, if and when it becomes available, would do a huge favor not only to journalism but also to the general public.

Meanwhile, our sports reporter Sherlock is looking through a Semantic Technology magnifying glass to unlock the value of Open Data, Linked Data and Big Data in sports reporting in order to engage more audience and seek truth and transparency.

In a football analogy, the late Johan Cruyff put it like this:

You play football with your head, and your legs are there to help you.


Master Publishing Brochure
          New call-to-action

Article's content

A bright lady with a PhD in Computer Science, Milena's path started in the role of a developer, passed through project and quickly led her to product management. For her a constant source of miracles is how technology supports and alters our behaviour, engagement and social connections.

Linked Data Solutions for Empowering Analytics in Fintech

Read about how FinTech can use the power of Linked Data to put data into context and expose various links between these concepts.

Semantic Technology: Creating Smarter Content for Publishers

Learn how semantic technology helps publishers create better content publishing workflows and improved content consumption for readers.

The 5 Key Drivers Of Why Graph Databases Are Gaining Popularity

Read about the 5 key characteristics of graph databases – speed, meaning, answers, relationships, and transformation.

GraphDB Migration Service: The 10-Step Pathway from Data to Insights

Welcome to our GraphDB Migration Service that helps you prepare for migrating your data to GraphDB, walks you through the setup and monitors performance.

Fighting Fake News: Ontotext’s Role in EU-Funded Pheme Project

Read about the EU-funded project PHEME aiming to create a computational framework for automatic discovery and verification of information at scale and fast.

Semantic Technology: The Future of Independent Investment Research

Learn how independent research firms use cutting-edge technologies to add value to research pieces and monetize the content they offer.

Top 5 Semantic Technology Trends to Look for in 2017

Read about the top 5 trends in which Semantic Technology enables enterprises to make sense of their data and fine-tune their offerings to customers.

Ontotext’s 2016: Our Top 7 Webinars Of The Year

Data shows that in 2016 we had a total of 22 webinars that attracted over 7 000 people – here are the 7 best webinars!

Ontotext’s 2016: What Did You Liked The Most On The Blog

Nearly 10 000 people read our blog in 2016 and the following 5 posts gathered most interest.

Linked Data in Regtech: Boosting Compliance and Performance

Learn how regulatory technology, coupled with semantic technology, can help enterprises and financial institutions reduce exposure to risk.

How Data Integration Joined the Music Hit Charts

Learn how today it is the Internet, data integration, and tailored recommendations that stage the music scene for the new Bob Dylans.

Open Data Innovation? Open Your Data And See It Happen

Learn how open data trend-setting governments and local authorities are opening up data sets and actively encouraging innovation.

Linked Data Innovation – A Key To Foster Business Growth

Learn how freely available and machine-readable Linked Open Data enriches organizations’ data and helps them discover new links and insights.

Linked Data Approach to Smart Insurance Analytics

Read about how Linked Data and semantic technology can enrich data and pave the way to advanced analytics.

Linked Data Paths To A Smart Tourism Journey

Read about how the tourism industry can benefit from Linked Data and big data analytics for wiser investments and higher profits.

Linked Data Pathways To Wisdom

Learn about the linked data pathways to wisdom through ‘who’, ‘what’, ‘when’, ‘where’, ‘why’, ‘how to’ and, finally, ‘what is best’.

Taking Semantic Web to its Next Level with Cognitive Computing

Learn about the new age of cognitive computing and integrating its concepts into two decades of semantic web growth.

Open Data Play in Sports Journalism And EURO 2016

Read about how open data gives those modern-day Sherlocks the bases of their stories.

Open Data Sources for Empowering Smart Analytics

Learn how Open Data and how more businesses use data analytics to gain insights, predict trends and make data-driven decisions.

Journalism in the Age of Open Data

Learn how governments and authorities can start relying more on journalism to promote the use of open data and its social and economic value.

Building Linked Data Bridges To Fish In Data Lakes

Learn how enterprises can build bridges to extracting more powerful and more relevant insights from their Big Data analytics.

Open Data Use Cases In Five Cities

Learn how London, Chicago, New York, Amsterdam and Sofia deal with open data and extract social and business value from databases.

ODI Summit Take Out: Open Data To Be Considered Infrastructure

Learn about The ODI’s second Summit with prominent speakers such as Sir Tim Berners-Lee, Martha Lane Fox and Sir Nigel Shadbolt.

Highlights from the “Mining Electronic Health Records for Insights” Webinar

Read some of the Q&As from our webinar “Mining Electronic Health Records for Insights”.

Highlights from ISWC 2015 – Day Three

The 14th International SemanticWeb Conference started three days ago and Ontotext has been its most prominent sponsor for 13 years in a row.

Highlights from ISWC 2015 – Day Two

The 14th International SemanticWeb Conference started three days ago and Ontotext has been its most prominent sponsor for 13 years in a row.

Overcoming the Next Hurdle in the Digital Healthcare Revolution: EHR Semantic Interoperability

Learn how NLP techniques can process large volumes of clinical text while automatically encoding clinical information in a structured form.

Highlights from ISWC 2015 – Day One

The 14th International SemanticWeb Conference started three days ago and Ontotext has been its most prominent sponsor for 13 years in a row.

Text Mining to Triplestores – The Full Semantic Circle

Read about the unique blend of technology offered by Ontotext – coupling text mining and RDF triplestores.

Text Mining & Graph Databases – Two Technologies that Work Well Together

Learn how connecting text mining to a graph database like GraphDB can help you improve your decision making.

Semantic Publishing – Relevant Recommendations Create a Unique User Experience

Learn how semantic publishing can personalize user experience by delivering contextual content based on NLP, search history, user profiles and semantically enriched data.

Why are graph databases hot? Because they tell a story…

Learn how graph databases like GraphDB allow you to connect the dots and to tell a story.