Research Data Network

Research Data Network

Welcome to the Research Data Network. Jisc set up this site for you to exchange information on research data management and related topics within the UK research information management sector.

We have now produced a research data management toolkit https://rdmtoolkit.jisc.ac.uk/

Go there to have your questions about RDM answered and leave us some feedback if you think part of the kit needs updating, or additional resources. You can find hot topics, links from the research data lifecycle and even access your local RDM pages from our toolkit: https://rdmtoolkit.jisc.ac.uk/rdm-for-researchers/

We still want to collect a body of useful resources, help people share up to minute developments, practices and develop new ideas, but the toolkit will be the place to do that. This network site will be closed by end 2018. Most of the material has already been migrated into the toolkit, but our working group will review what's here before closure.

Also, if you are interested, you could get involved with our working group, just get in touch using the free text in the comment form: https://rdmtoolkit.jisc.ac.uk/suggest-an-update/

Get Started    

Case study titles and abstracts

Research Data Management: Benefits and Data reuse case studies

Introduction

Research Data Management (RDM) is an overarching term encompassing the organisation, storage, and documentation of data generated during research projects. RDM deals with the organisation and curation of active research data, with its day-to-day management and use, and with its long-term preservation.
RDM is an important practice for both institutions and individual researchers. Data supporting results should be made available and preserved so as to allow its reuse and the verification of published research. Several other benefits can arise from the implementation of RDM, including increased citations, increased research collaborations, or increased visibility. Today, data has to be managed not only for preservation purposes, but also to fulfil the requirements of most research funders.
Although RDM has been around for a while, the above benefits are usually described qualitatively and the lack of a solid body of evidence makes advocacy difficult. We have sought to fill this gap by using case studies to present a rich and varied picture of the impacts underpinned by RDM.

Spread of disciplines

The case studies assembled here come from a wide range of research fields. Due to inherent differences between disciplines, the benefits of RDM become apparent in different ways. They are more tangible in certain fields, and more abstract in others. Nonetheless, the case studies demonstrate that RDM is a worthwhile activity for all institutions and researchers.
The examples below mostly involve large data management initiatives, as these are more likely to show the wide reach of the benefits of RDM. However, we would like to stress that even smaller data management efforts can have an impact. Unfortunately, this can be very difficult to track, as an individual researcher reusing data from other individual researchers is often lost in a sea of information. Similarly, impact sometimes cannot be traced to a specific source: in some studies, clear evidence of the impact of RDM is available, but they point to a whole repository rather than to a single study or dataset.

Enablers of impact from RDM

The effective implementation of RDM requires both cultural change and specific data skills. This makes its dissemination and practical realisation difficult and is the main obstacle to the above-mentioned benefits. It is, therefore, desirable to examine the RDM environment to investigate its enablers and what has worked historically to encourage future data curation and reuse.
Our research into the benefits or RDM led us to discovering some of the circumstances and situations that facilitate it, along with some of the reasons why this practice should be pursued. A summary of our findings is as follows:

  • Open licensing (e.g., in the case of computer code and algorithms) is essential to allow crowd-sourced improvements.
  • Data repositories and infrastructures are among the most significant enablers of impact: without them, very few of the impact case studies below would have been possible.
  • Collaborations between international bodies or organisations strongly promote data re-use, especially in fields where it was not possible for a single player to take charge. These collaborations create the right environment for sharing and re-use of research data: cultural change is encouraged along with the use of joint infrastructures at a national or international level.
  • The impact of RDM is normally seen after a long time, when, i.e., after has been produced, curated, maintained, and reused. Thus, there is a need for sustained investment in this field, as benefits cannot be seen immediately.
  • Aggregation of data and digitisation of documents are key to encouraging the development of digital humanities. These initiatives often arise from the collaboration between museums, research libraries, and universities. When data that was spread between several sources (e.g., many different books/articles) or held in obsolete formats was organised and analysed through sound RDM, hidden findings could be uncovered.

    Presentation format

    The case studies are grouped into Long Form—more detailed with abstracts etc.— and Short Form. In all cases key information is presented in a table after the text.

Long-form case studies

Title

1: Neurodegeneration and Dementia Research

Subtitle

Leveraging data from cohort studies to accelerate medical progress

Abstract

Dementia and neurodegenerative diseases affect both individuals and society as a whole. However, neither cures nor treatments are available at the moment. The Dementias Platforms UK collaboration aims to turn dementia research into treatments as quickly as possible, to both improve people's lives and decrease the socioeconomic cost related to these types of diseases

Keywords

dementia; Alzheimer; cohort studies

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Impact on health and wellbeing

Facts and figures

850,000 people with Dementia, £53 million project (DPUK), £6.9 million project financed using DPUK

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2014

Title

2: Biological Data Mining

Subtitle

Difficulties in data handling in the field of biology

Abstract

Biologists often need to deal with heterogeneous data sources, which makes their work difficult and time-consuming. The InterMine system provides an easy-to-use data warehouse solution that biologists can exploit for their studies with little programming knowledge.

Keywords

biology; data warehouse; data mining

Research subject area

BIOLOGICAL SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Facts and figures

574 citation

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2012

Title

3: Gene Expression Omnibus Data to Fight Cancer

Subtitle

Systems biology and enhanced cancer diagnosis

Abstract

Sharing genomic data on the Gene Expression Omnibus (GEO) repository has a significant impact on medical research and improves the efficiency of the publishing environment. New approaches to cancer research were developed, including a novel method of diagnosis based on the study of exosomes.

Keywords

medical research; cancer; repository; publishing

Research subject area

BIOLOGICAL SCIENCES

Type of RDM impact/benefit

Methodological impact (e.g., new approaches developed); Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

65,000 results from Google Scholar

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2001

Title

4: The World vs E.Coli

Subtitle

Researchers joining forces to sequence bacterial genomes

Abstract

During the 2011 E.coli outbreak, the release of the bacterial genome in the public domain allowed researchers to reach conclusions quickly and effectively. The strain was tracked to a seed shipment from 2009 and the crow-sourced analysis received high media attention.

Keywords

infection; e.coli; outbreak

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Impact on health and wellbeing

Facts and figures

17 authors working in a crowdsourcing effort;
12 academic publications;
8 media outlets covering the initiative;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2011

Title

5: Data-Powered Patents

Subtitle

Citation patterns in granted patents in the field of biology

Abstract

The analysis of data citation patterns in the field of biology showed how over 8,000 patents were based on publicly available data. While this proves the usefulness of repositories as a whole, it also shows how the evaluation of researchers should consider data citations and alternative sources, too, as these are often key to uncover the broader industrial and societal value of academic research.

Keywords

citations; patents; text mining

Research subject area

BIOLOGICAL SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed); Socio/Economic impact

Summary Impact Type

Impact on economy and business

Facts and figures

8,000 patents thanks to open data

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2014

Title

6: Sharing Research Data and Infrastructure to Study Proteins

Subtitle

A research group's impact on the study of circular dichroism

Abstract

The DICHROWEB and PCDDB platforms are widely used for the study of proteins. Since their release, hundreds of thousands of users accessed them, from both academia and the private sector. In academia, the platforms fuel research and teaching, while they led to several advances in industry, including the development of 11 patents.

Keywords

protein; repository; infrastructure

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Impact on health and wellbeing

Facts and figures

375,000 analyses;
3,600 registered users;
over 1,000 citations;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2001

Title

7: Using Flies to Understand The Human Brain

Subtitle

Harvesting published data to power neurobiology research

Abstract

Studying the brain of fruit flies is helping researchers uncover how our brains work at the molecular level. Data repositories such as the Virtual Fly Brain help them curate, share, and re-use data in a structured way.

Keywords

neurobiology; medicine; fly; brain;

Research subject area

BIOLOGICAL SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2012

Title

8: A Data-Based Approach to Preventing Curable Eye Diseases

Subtitle

Leveraging data to inform health policies

Abstract

The Vision Loss Expert Group (VLEG) gathered and released data on vision loss all around the world. The data is localised, which means that every country can tackle local issues to reduce the burden of eye loss. The findings of the VLEG are far-reaching and were picked up by large organisations such as PwC, the World Bank, and the World Economic Forum, shaping policies, debates, and educations programmes worldwide.

Keywords

ophthalmology; eye health; VLEG; GBD

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Impact on health and wellbeing; Impact on public services; Impact on politics and governance

Facts and figures

15,000 articles harvested;
$2.2 per capita to eliminate avoidable blindness in developing countries by 2020;
collaboration between 79 scientists;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1990

Link

Title

9: Self Compacting Concrete

Subtitle

Advancements in engineering led by data aggregation

Abstract

Data from over 250 academic sources was aggregated in the form of a database and used to inform future design of self-compacting concrete. This large-scale study allowed researchers to precisely describe the differences between self-compacting concrete and "traditional" vibrated concrete.

Keywords

concrete; database; self-compacting

Research subject area

ENGINEERING

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

250 articles;
1500 concrete mixtures analysed;

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2014

Link

Title

10: Preventing Drug Interactions On Your Mobile

Subtitle

Novel tools for patients and healthcare professionals

Abstract

The iChart apps developed by the University of Liverpool help clinicians and patients with HIV or hepatitis C better deal with drug interactions. The apps allow clear and ubiquitous access to research data that has been arranged for maximum effectiveness and dissemination, thus, improving patient response and reducing the side effects experienced. In addition, clinicians can save time, as all the information they need is now available directly on their smartphones.

Keywords

hiv; hepatitis; drug interactions

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Impact on health and wellbeing

Facts and figures

128 countries;
17,000 downloads;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2010

Link

Title

11: Meeting Sustainability Objectives

Subtitle

Reducing carbon emissions from agricultural production

Abstract

Meeting sustainability objectives is becoming increasingly important to reduce the impact of global warming. The field of agriculture has been deemed responsible for a third of our greenhouse emissions, thus, resources such as the Cool Farm Tool are essential to help people in the sector understand how they can reduce their environmental impact.

Keywords

agriculture; carbon footprint;

Research subject area

ENVIRONMENTAL SCIENCES

Type of RDM impact/benefit

Methodological impact (e.g., new approaches developed); Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Environmental impact

Facts and figures

800 global datasets for N2O;
100 global datasets for soil carbon sequestrations;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2011

Link

Title

12: Finding Offshore Hydrocarbons

Subtitle

Leveraging satellite data to improve the efficiency of exploration

Abstract

Satellite data has been re-used to produced a more up-to-date and precise dataset helping with offshore exploration. The improved gravity data prepared by the researchers has shown a very high potential and is estimated to be 10% more accurate than previously-available work. The dataset has been used by major oil companies to drive decision-making and improve the safety of their exploration efforts.

Keywords

oil; exploration; gravity

Research subject area

EARTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Impact on economy and business; Environmental impact

Facts and figures

$2.5 million per project
£1.2 million received
10% improvement compared to previous data

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1998

Link

Title

13: Geological Data Made Easy

Subtitle

The British Geological Survey and the OpenGeoscience portal

Abstract

Geological data was released by the British Geological Survey to align their resources to the principles of open science. Their efforts took the form of the OpenGeoscience portal, where data is shared through an Open Government License. Users of OpenGeoscience resources are encouraged to reshape the data to develop new products, called mash-ups, and more than 20 of these are available on the BGS website.

Keywords

geology; survey; NERC

Research subject area

EARTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Impact on economy and business; Environmental impact

Facts and figures

Over 20 mash-ups (projects derived from OpenGeoscience data)

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2010

Link

Title

14: Protecting The Oceans By Coordinating Data Sharing Efforts

Subtitle

UNESCO’s efforts to preserve marine environments

Abstract

UNESCO's efforts to protect marine environments materialised with the creation of IODE in 1961. The programme aims to improve data management in the field and to guide and coordinate the data gathering work by a large number of countries. Such a high-level initiative allows data to be shared very effectively, as no country could possibly gather so much information on its own. In addition,

Keywords

marine; ocean; coordination; unesco

Research subject area

EARTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Environmental impact; Impact on politics and governance; Impact on economy and business

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1990

Link

Title

15: Corruption in Public Sector Procurement

Subtitle

Digital whistleblowing to quantify the cost of corruption

Abstract

The DIGIWHIST project gathered and elaborated information on public procurement and accountability of public officials within the EU and in neighbouring countries. This was picked up by the European Commission, which released a study showing how corruption may cost Europe up to €990 billion per year.

Keywords

corruption; european commission; transparency; public sector

Research subject area

STUDIES IN HUMAN SOCIETY

Type of RDM impact/benefit

Socio/Economic impact; Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Impact on politics and governance; Impact on public services; Impact on economy and business

Facts and figures

up to €990 billion lost to corruption yearly

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2015

Link

Title

16: Drones for Research

Subtitle

Using unmanned aerial vehicles to power new approaches to scientific investigations

Abstract

Drones are becoming a constant presence in technology news and media, thus, it is not surprising that they also caught the attention of the research community. Drones allow researchers to capture aerial images easily and at a low cost, however, the data they gather needs to be properly curated to allow any applications. In the field of agriculture, drone-captured datasets are being used to spot plant diseases and help farmers better protect their yields. In addition, drone time-stamped drone image sets have been used to study how to best protect crops and land from floods.

Keywords

agriculture; drones

Research subject area

AGRICULTURAL AND VETERINARY SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Impact on economy and business; Economic; Environmental

Facts and figures

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2016

Link

Title

17: Data-Enhanced Archaeology

Subtitle

Using modern tools to study ancient times

Abstract

In the field of archaeology, data is scarce and difficult to find. This is simply because it usually comes from excavations or physical operations on artefacts, which are normally expensive and can be performed only in certain conditions. The Archaeology Data Service (ADS) aims to fill the gap by freely providing more than 1.3 million metadata records on archaeological data and by driving developments in research data management in the field. The service enables increased efficiency thanks to data reuse and this is valued at at least £13 million per annum.

Keywords

archaeological data; repository

Research subject area

HISTORY AND ARCHAEOLOGY

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

£13 million per annum savings due to increased efficiency;
2-fold to 8-fold return on investment;
44% of interviewed stakeholders could not have carried out their work without the ADS;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1996

Link

Title

18: Supporting Science and Industry by Sharing Computer Code

Subtitle

Sharing software as a form of research data

Abstract

Sharing software is not as common as sharing other types of research data, however, it is sometimes very impactful. The Statistical Parametric Mapping (SPM) software in the field of neuroimaging is an example of how sharing computer code can lead to far-reaching effects. In this case, making the code public allowed the creation of a new field of study and led the software to become the leader in the sector. In addition, thanks to the licensing chosen, some companies were able to exploit the code to create derivative products, which are generating substantial income, while major pharmaceutical companies are using it in the field of drug research.

Keywords

software; code; algorithms

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Economic; Technological; Health

Facts and figures

64% of users in the field use the software;
€5,000 for each license (software derived from sharing);

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1991

Link

Title

19: Citizen Science at Zooniverse

Subtitle

Sharing data and efforts via an online platform

Abstract

It is not always easy to deal with large amounts of data. At times, algorithms can help researchers make sense of their large datasets, however, sometimes the human mind cannot be replaced. In these cases, platforms like Zooniverse come to the researchers' help, allowing them to have citizen volunteers analyse scientific data and enable new scientific discoveries. More than 130 articles were published thanks to citizen science, showing how wise research data management can allow the crowd-sourcing of scientific research.

Keywords

crowd-sourced research; citizen science

Research subject area

INFORMATION AND COMPUTING SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

More than 130 articles published;
Over 1.5 million registered volunteers;
At least 58 web-based citizen science projects;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2009

Link

Title

20: History and Data

Subtitle

The link between disciplines that seem diametrically opposed

Abstract

Access to ancient books, manuscripts, and artefacts is often limited due to their fragility and importance. In addition, they are spread between several locations, which makes the work of historians difficult. The digitisation of heritage data by the British History Online digital library bridges the gap by making material available to researchers from all over the world. The application of research data management in this field led to changes in the researchers' workflow and earned the library a large number of citations in the academic literature and mentions in the news.

Keywords

British history; digitisation; library

Research subject area

HISTORY AND ARCHAEOLOGY

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

330,000 unique visitors a month;
1,410 Google Scholar results;
9 mentions in news;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2003

Link

Title

21: Saving the Earth from Mankind

Subtitle

How can we preserve Earth and develop sustainably?

Abstract

Our planet needs to be protected, as human development tends to ignore sustainability and the effect business has on the environment and on biodiversity. Luckily, things are slowly changing, and better decisions and policies supported by research data can now be made. The Global Biodiversity Information Facility (GBIF) plays a critical role in enabling sustainable development by hosting evidence on more than 1.6 million species and its data is featured in more than 1,400 research papers. GBIF data is used by scientists, policymakers, and journalists alike.

Keywords

biodiversity; environment; sustainable development

Research subject area

ENVIRONMENTAL SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)

Summary Impact Type

Political; Economic; Environmental

Facts and figures

Evidence on more than 1.6 million species;
1,400 peer-reviewed articles citing GBIF data;
81 countries;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1999

Link

Title

22: Understanding Mobile Users and Evaluating Vulnerabilities

Subtitle

User data as an investigation tool

Abstract

The Device Analyzer project gathers data on smartphone usage and curates it for re-use by companies, universities, and research institutes. The project's data led to the development of important statistics on the vulnerability of smartphones using the Android operating system. The researchers found that this is related to the slow pace of system updates and only manufacturers have the tools to address the problem.

Keywords

Android; smartphone; mobile phones

Research subject area

INFORMATION AND COMPUTING SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

30,978 contributors; 87.7% of Android devices are exposed to at least one of 11 known critical vulnerabilities

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2012

Link

Title

23: Understanding War

Subtitle

Data-powered insights into the motives and consequences of war

Abstract

In the world, there are currently 58 ongoing conflicts. These cause tens of thousands of fatalities each year and are related to reasons that are obscure, complex, and often difficult to understand. Researchers and political scientists have been trying to uncover the reasons for war for a long time. Today, they can leverage data to explain conflicts, and geotagged datasets can be organised to build visualisations that greatly facilitate the understanding of contexts and actors in a war. The use of research data management is instrumental in helping us fully grasp the reasons for conflicts and, hopefully, preventing future ones.

Keywords

War; conflicts

Research subject area

STUDIES IN HUMAN SOCIETY

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Political; Societal

Facts and figures

4,000 citations: Correlates of War project
3,800 citations: Uppsala Conflict Data Program

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2012

Link

Title

24: Surfing Gravitational Waves

Subtitle

Understanding gravity and Einstein’s General Theory of Relativity

Abstract

Since 2002, the LIGO experiment has been running and collecting data in the domain of astrophysics. This very large facility created over 1 petabyte of information, which required management for several years. This effort has allowed researchers to confirm the existence of gravitational waves, ripples in space time hypothesised by Einstein. In addition, data from the experiment is being used to produce peer-reviewed publications and additional information on past detections, such as visuals, audio, and other media.

Keywords

gravitational waves; relativity

Research subject area

PHYSICAL SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

111,000 news mentions;
Over 1 petabyte of data

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1994

Link

Title

25: Art From The Comfort Of Your Chair

Subtitle

Aggregating arts metadata to power a digital museum

Abstract

Educational institutions, museums, and other organisations hold a wealth of information on paintings, sculpture, and artefacts. This has historically been kept private or indexed locally, however, the Europeana project aims to aggregate metadata in the field of arts and make it publicly accessible. Over 3,000 organisation contribute to the portal, which hosts more than 54 million records from across Europe. The Europeana project has been using its material to promote creative thinking in a number of disciplines and led to the creation of a web exhibition platform for institutions holding art. In addition, Europeana tracks research arising from its data through the Europeana Treasures blogpost series.

Keywords

cultural heritage; digitisation

Research subject area

LANGUAGE, COMMUNICATION AND CULTURE

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Technological; Cultural

Facts and figures

Collaboration of 3,000 institutions;
54,364,816 artworks, artefacts, books, videos and sounds from across Europe;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2008

Link

Short-form case studies

Title

1: A Performance Artwork Based on Datasets and Partnerships

Keywords

music; sound; software

Research subject area

EARTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

430GB of data created

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2014

Link

Title

2: Open Science Underpins Collaboration: The Structural Genomics Consortium (SGC)

Keywords

protein structure

Research subject area

BIOLOGICAL SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Economic

Facts and figures

over 1,500 protein structures and 75 kinase structures shared;
Protein Data Bank is worth $12 billion;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2003

Link

Title

3: Testing Doubts About the Reliability of Science: The Reproducibility Project

Keywords

reproducibility; open science

Research subject area

PSYCHOLOGY AND COGNITIVE SCIENCES

Type of RDM impact/benefit

Reproducibility

Summary Impact Type

Facts and figures

Only 40% of findings could be replicated

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2011

Link

Title

4: Large Volumes of Data Engage Large Communities: Sloan Digital Sky Survey (SDSS))

Keywords

space; citizen science

Research subject area

EARTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

Classification of nearly 900,000 galaxies

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1998

Link

Title

5: Scholarly Communication is About Combining Effort: Polymath Project

Keywords

collaboration; problem solving

Research subject area

MATHEMATICAL SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

3 out of 12 mathematical problems solved;
6 published articles;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2009

Link

Title

6: Is Citizen Generated Data Suitable for Academic Purposes? Volunteered Geographic Information (VGI)

Keywords

maps; volunteered data

Research subject area

EARTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

Only 12% of photographs considered unusable

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2004

Link

Title

7: A Specialist Research Data Archive: Crawdad

Keywords

wireless; networks

Research subject area

INFORMATION AND COMPUTING SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

2378 papers using CRAWDAD datasets or mentioning CRAWDAD;
1217 downloads for the most popular datasets and 566 citations;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2004

Link

Title

8: A Large-Scale Research Data Service: The European Bioinformatics Institute

Keywords

genome information

Research subject area

BIOLOGICAL SCIENCES

Type of RDM impact/benefit

Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Economic

Facts and figures

Efficiency gains worth £1 billion per annum;
Future research impact worth £335 million annually or £2.5 billion over 30 years

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1992

Link

Title

9: Combining Data & Influencing Government Sustainability Policies

Keywords

travelling; sustainability

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Political; Societal

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2008

Link

Title

10: Objective Measures & Self-Reported Data Underpin Policy On Obesity: Health Survey for England (HSE)

Keywords

health survey; obesity

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1991

Link

Title

11: Archival Acceptance as an "Indicator of Quality": Foot & Mouth Disease (FMD) Testimonial Data

Keywords

crisis; diaries

Research subject area

MEDICAL AND HEALTH SCIENCES

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Societal

Facts and figures

3,200 weekly diaries leading to the understanding of how people react to abnormal events

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2001

Link

Title

12: Improving Policy by Providing Data: Live Music Exchange (LMX)

Keywords

live music

Research subject area

LANGUAGE, COMMUNICATION AND CULTURE

Type of RDM impact/benefit

Socio/Economic impact

Summary Impact Type

Political

Facts and figures

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2015

Link

Title

13: From Hard Copy Primary Sources to An Open Online Resource: Reading Experience Database, (RED)

Keywords

reading

Research subject area

LANGUAGE, COMMUNICATION AND CULTURE

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)

Summary Impact Type

Facts and figures

contributions of >120 volunteers, who created over 6000 entries;

1800 users per month from >135 countries;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

1995

Link

Title

14: Qualitative Data in Many Formats, Archived Online: Tate Encounters

Keywords

qualitative data; museum

Research subject area

STUDIES IN HUMAN SOCIETY

Type of RDM impact/benefit

Socio/Economic impact

Summary Impact Type

Societal; Cultural

Facts and figures

300 student questionnaires;
200 student essays on Tate Modern and Tate Britain;
12 student workshops;
12 in depth student research projects;
5 extended participant family edited ethnographic films;
38 Tate staff interviews and interviews with 72 participants through the Research in Process events at Tate Britain;

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2010

Link

Title

15: Data Combination and Self Re-Use: Understanding Pauper Lives in Georgian London

Keywords

pauper lives; London

Research subject area

HISTORY AND ARCHAEOLOGY

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Over 300,000 records about over 50,000 poor individuals in the West End of London

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2004

Link

Title

16: Ireland-Bristol Trade in the Sixteenth Century

Keywords

trade; Ireland; England

Research subject area

HISTORY AND ARCHAEOLOGY

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2006

Link

Title

17: Film and an Ethnographic Approach: Buddhist Cosmology in Food

Keywords

buddhism; food

Research subject area

LANGUAGE, COMMUNICATION AND CULTURE

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

6 videos created as a project output

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2014

Link

Title

18: Data About Data Archiving: ICPSR’s Data Sharing in the Social Sciences

Keywords

meta-analysis

Research subject area

STUDIES IN HUMAN SOCIETY

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

Maturity of the initiative/data source

Long-standing (5+ years)

Year (e.g., first data release, first output, year of impact)

2010

Link

Title

19: Research Data Supports Restoration: Mackintosh Architecture

Keywords

restoration; building

Research subject area

BUILT ENVIRONMENT AND DESIGN

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact

Summary Impact Type

Societal; Cultural

Facts and figures

350 architectural projects shared

Maturity of the initiative/data source

Recent (<5 years

Year (e.g., first data release, first output, year of impact)

2014

Link

Title

20: Evidencing Value of Artistic, Cultural or Sporting Activities: UK Subjective Well-Being Data (SWB)

Keywords

understanding society; sport; culture

Research subject area

STUDIES IN HUMAN SOCIETY

Type of RDM impact/benefit

Efficiency in research and data re-use (e.g., reduce duplication of effort)

Summary Impact Type

Facts and figures

Maturity of the initiative/data source

Recent (<5 years)

Year (e.g., first data release, first output, year of impact)

2016

Link

Updated 2 years ago

Case study titles and abstracts


Suggested Edits are limited on API Reference Pages

You can only suggest edits to Markdown body content, but not to the API spec.