27th Oct 2016


EU bill on data mining lacks ambition

  • "If the Commission is serious about building a Digital Single Market, then it should introduce a rule change that applies to everybody, not just academics."

European researchers have become frustrated in recent years by the restrictions European copyright laws put on their freedom to use text and data mining - two automated techniques for analysing data - on resources they can legally access and analyse with non-automated means.

As part of its recent proposals to reform copyright laws, the European Commission has recommended lifting these restrictions, but only for academics. This is a good first step, but the EU should also allow everyone to take advantage of these more efficient and effective data-driven research methods.

The commission’s proposal is good for European researchers in a wide range of disciplines, from bioinformatics to digital humanities. For scholars and scientists, access to the rigorously scrutinised work of their peers, such as academic journals and databases, has always been a vital resource.

Researchers who subscribe to these sources can explore them using traditional keyword searches and meta-tags predefined by publishers, but that has serious limitations.

Manually reviewing all of these sources is a slow and tedious process, the results of which are often inaccurate and incomplete.

Text and data mining is a powerful tool that allows researchers to plough into texts and datasets and interpret minute details.

Data mining gives researchers the ability to not only find a needle in a haystack, but to quickly find and categorise all manner of small objects hidden in many hundreds or thousands of haystacks.

For example, medical researchers can use technologies like natural language processing to quickly analyse the outcomes of thousands of clinical trials.

This type of analysis supports efforts to develop data-driven precision medicine initiatives that use the latest evidence to deliver personalised treatments.

Data mining cannot provide all of the insights gained from human experts closely studying texts, but it does allow researchers to use rapidly developing tools to draw on a much larger pool of literature and data to support their work.

The use of data mining on copyrighted material often falls foul of existing intellectual property laws because the technical process involves extracting data from its original source and copying it into another database for analysis.

The proposed exemption is reasonable because it creates a special dispensation for data mining and does not alter other laws that prohibit the unauthorised extraction or reproduction of copyrighted works.

After all, there is nothing illegal about “mining” databases manually; this technology only automates the process.

Old methods

A researcher could legally sift through many thousands of published works, note their findings with pen and paper, and then analyse the assembled notes. This is why an exemption for academics is not enough: This method should be legal for anyone.

Copyright law should allow publishers to set the subscription fees for access to their content, prohibit unauthorised reproductions of their content, and receive appropriate compensation. But it should not require people with lawful access to content, such as paid subscribers, to seek approval from publishers for using automated research methods.

Some member states - such as the United Kingdom - have already implemented similar (and similarly inadequate) exceptions. But national legislation is insufficient; the issue should be tackled at the EU level, because research is often cross-border.

Researchers and sources are spread across different countries. Unless the same rule applies throughout Europe, this work is very difficult. For example, is an online repository of books, films, art, and other materials that have been digitised in various member states.

A researcher could legally mine this archive from the UK while a colleague elsewhere could not - or the former could inadvertently commit a crime by mining a resource in the latter’s country.

Not far enough

It does not go far enough, but the commission’s proposal does address this problem for the academic community.

If the commission is serious about building a Digital Single Market, then it should introduce a rule change that applies to everybody, not just academics.

If everyone had this freedom, Europe would enjoy far greater opportunities for data-driven innovation in several sectors. Nevertheless, this exemption could be the first step towards that, so the Council and the Parliament should support it.

Nick Wallace is a Brussels-based analyst for the Centre for Data Innovation, a think tank that focuses on data policy


Europe ready to tackle Greek debt relief

The Greek government has built and broadened alliances in EU institutions and member-states that acknowledge the need to restructure the debt and deliver another economic model for the eurozone.

Stakeholders' Highlights

  1. EU-China ForumDebating the Future of the EU-China Relations on 28 November in Prague
  2. COMECEMigrants: From Fear to Compassion
  3. Birdlife EuropeBusiness as Usual - Juncker Snubs Environment and Protects Broken CAP
  4. EFADraft Bill for a 2nd Scottish Independence Referendum
  5. UNICEFCalls on European Council to Address Plight of Refugee and Migrant Children
  6. ECTAJoin us on 9-10 November in Brussels and Discover the new EU Digital Landscape
  7. Access NowCan you Hear me now? Verizon’s Opportunity to Stand for Global Users
  8. Belgrade Security ForumMeaningful Dialogue Missing Not Only in the Balkans, but Throughout Europe
  9. EuropecheEU Fishing Sector Celebrates Sustainably Sourced Seafood in EU Parliament
  10. World VisionWomen and Girls Urge EU Leadership to Help end Gender-based Violence
  11. Belgrade Security ForumGet the Latest News and Updates on the Belgrade Security Forum @BelSecForum
  12. Crowdsourcing Week EuropeMaster Crowdsourcing, Crowdfunding and Innovation! Conference 21 November - 10% Discount Code CSWEU16