Case study titles and abstracts

Research Data Management: Benefits and Data reuse case studies

Introduction

Research Data Management (RDM) is an overarching term encompassing the organisation, storage, and documentation of data generated during research projects. RDM deals with the organisation and curation of active research data, with its day-to-day management and use, and with its long-term preservation.
RDM is an important practice for both institutions and individual researchers. Data supporting results should be made available and preserved so as to allow its reuse and the verification of published research. Several other benefits can arise from the implementation of RDM, including increased citations, increased research collaborations, or increased visibility. Today, data has to be managed not only for preservation purposes, but also to fulfil the requirements of most research funders.
Although RDM has been around for a while, the above benefits are usually described qualitatively and the lack of a solid body of evidence makes advocacy difficult. We have sought to fill this gap by using case studies to present a rich and varied picture of the impacts underpinned by RDM.
###Spread of disciplines
The case studies assembled here come from a wide range of research fields. Due to inherent differences between disciplines, the benefits of RDM become apparent in different ways. They are more tangible in certain fields, and more abstract in others. Nonetheless, the case studies demonstrate that RDM is a worthwhile activity for all institutions and researchers.
The examples below mostly involve large data management initiatives, as these are more likely to show the wide reach of the benefits of RDM. However, we would like to stress that even smaller data management efforts can have an impact. Unfortunately, this can be very difficult to track, as an individual researcher reusing data from other individual researchers is often lost in a sea of information. Similarly, impact sometimes cannot be traced to a specific source: in some studies, clear evidence of the impact of RDM is available, but they point to a whole repository rather than to a single study or dataset.
###Enablers of impact from RDM
The effective implementation of RDM requires both cultural change and specific data skills. This makes its dissemination and practical realisation difficult and is the main obstacle to the above-mentioned benefits. It is, therefore, desirable to examine the RDM environment to investigate its enablers and what has worked historically to encourage future data curation and reuse.
Our research into the benefits or RDM led us to discovering some of the circumstances and situations that facilitate it, along with some of the reasons why this practice should be pursued. A summary of our findings is as follows:

  • Open licensing (e.g., in the case of computer code and algorithms) is essential to allow crowd-sourced improvements.
  • Data repositories and infrastructures are among the most significant enablers of impact: without them, very few of the impact case studies below would have been possible.
  • Collaborations between international bodies or organisations strongly promote data re-use, especially in fields where it was not possible for a single player to take charge. These collaborations create the right environment for sharing and re-use of research data: cultural change is encouraged along with the use of joint infrastructures at a national or international level.
  • The impact of RDM is normally seen after a long time, when, i.e., after has been produced, curated, maintained, and reused. Thus, there is a need for sustained investment in this field, as benefits cannot be seen immediately.
  • Aggregation of data and digitisation of documents are key to encouraging the development of digital humanities. These initiatives often arise from the collaboration between museums, research libraries, and universities. When data that was spread between several sources (e.g., many different books/articles) or held in obsolete formats was organised and analysed through sound RDM, hidden findings could be uncovered.
    ###Presentation format
    The case studies are grouped into Long Form—more detailed with abstracts etc.— and Short Form. In all cases key information is presented in a table after the text.

Long-form case studies

Title 1: Neurodegeneration and Dementia Research
Subtitle Leveraging data from cohort studies to accelerate medical progress
Abstract Dementia and neurodegenerative diseases affect both individuals and society as a whole. However, neither cures nor treatments are available at the moment. The Dementias Platforms UK collaboration aims to turn dementia research into treatments as quickly as possible, to both improve people's lives and decrease the socioeconomic cost related to these types of diseases
Keywords dementia; Alzheimer; cohort studies
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Impact on health and wellbeing
Facts and figures 850,000 people with Dementia, £53 million project (DPUK), £6.9 million project financed using DPUK
Maturity of the initiative/data source Recent (<5 years)
Year (e.g., first data release, first output, year of impact)2014
Linkhttps://doi.org/10.5281/zenodo.818367

Top of Long-form Case Studies

Title 2: Biological Data Mining
Subtitle Difficulties in data handling in the field of biology
Abstract Biologists often need to deal with heterogeneous data sources, which makes their work difficult and time-consuming. The InterMine system provides an easy-to-use data warehouse solution that biologists can exploit for their studies with little programming knowledge.
Keywords biology; data warehouse; data mining
Research subject area BIOLOGICAL SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type https://dash.readme.io/project/research-data-network/v2.01/docs/case-study-abstracts
Facts and figures 574 citation
Maturity of the initiative/data source Long-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2012
Linkhttps://doi.org/10.5281/zenodo.818423

Top of Long-form Case Studies

Title 3: Gene Expression Omnibus Data to Fight Cancer
Subtitle Systems biology and enhanced cancer diagnosis
Abstract Sharing genomic data on the Gene Expression Omnibus (GEO) repository has a significant impact on medical research and improves the efficiency of the publishing environment. New approaches to cancer research were developed, including a novel method of diagnosis based on the study of exosomes.
Keywords medical research; cancer; repository; publishing
Research subject area BIOLOGICAL SCIENCES
Type of RDM impact/benefit Methodological impact (e.g., new approaches developed); Efficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures 65,000 results from Google Scholar
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2001
Linkhttps://doi.org/10.5281/zenodo.818443

Top of Long-form Case Studies

Title 4: The World vs E.Coli
Subtitle Researchers joining forces to sequence bacterial genomes
Abstract During the 2011 E.coli outbreak, the release of the bacterial genome in the public domain allowed researchers to reach conclusions quickly and effectively. The strain was tracked to a seed shipment from 2009 and the crow-sourced analysis received high media attention.
Keywords infection; e.coli; outbreak
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Impact on health and wellbeing
Facts and figures 17 authors working in a crowdsourcing effort;
12 academic publications;
8 media outlets covering the initiative;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2011
Linkhttps://doi.org/10.5281/zenodo.818489

Top of Long-form Case Studies

Title 5: Data-Powered Patents
Subtitle Citation patterns in granted patents in the field of biology
Abstract The analysis of data citation patterns in the field of biology showed how over 8,000 patents were based on publicly available data. While this proves the usefulness of repositories as a whole, it also shows how the evaluation of researchers should consider data citations and alternative sources, too, as these are often key to uncover the broader industrial and societal value of academic research.
Keywords citations; patents; text mining
Research subject area BIOLOGICAL SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed); Socio/Economic impact
Summary Impact Type Impact on economy and business
Facts and figures 8,000 patents thanks to open data
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2014
Linkhttps://doi.org/10.5281/zenodo.818511

Top of Long-form Case Studies

Title 6: Sharing Research Data and Infrastructure to Study Proteins
Subtitle A research group's impact on the study of circular dichroism
Abstract The DICHROWEB and PCDDB platforms are widely used for the study of proteins. Since their release, hundreds of thousands of users accessed them, from both academia and the private sector. In academia, the platforms fuel research and teaching, while they led to several advances in industry, including the development of 11 patents.
Keywords protein; repository; infrastructure
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Impact on health and wellbeing
Facts and figures 375,000 analyses;
3,600 registered users;
over 1,000 citations;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2001
Linkhttps://doi.org/10.5281/zenodo.818545

Top of Long-form Case Studies

Title 7: Using Flies to Understand The Human Brain
Subtitle Harvesting published data to power neurobiology research
Abstract Studying the brain of fruit flies is helping researchers uncover how our brains work at the molecular level. Data repositories such as the Virtual Fly Brain help them curate, share, and re-use data in a structured way.
Keywords neurobiology; medicine; fly; brain;
Research subject area BIOLOGICAL SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2012
Linkhttps://doi.org/10.5281/zenodo.818551

Top of Long-form Case Studies

Title 8: A Data-Based Approach to Preventing Curable Eye Diseases
Subtitle Leveraging data to inform health policies
Abstract The Vision Loss Expert Group (VLEG) gathered and released data on vision loss all around the world. The data is localised, which means that every country can tackle local issues to reduce the burden of eye loss. The findings of the VLEG are far-reaching and were picked up by large organisations such as PwC, the World Bank, and the World Economic Forum, shaping policies, debates, and educations programmes worldwide.
Keywords ophthalmology; eye health; VLEG; GBD
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Impact on health and wellbeing; Impact on public services; Impact on politics and governance
Facts and figures 15,000 articles harvested;
$2.2 per capita to eliminate avoidable blindness in developing countries by 2020;
collaboration between 79 scientists;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1990
Link

Top of Long-form Case Studies

Title 9: Self Compacting Concrete
Subtitle Advancements in engineering led by data aggregation
Abstract Data from over 250 academic sources was aggregated in the form of a database and used to inform future design of self-compacting concrete. This large-scale study allowed researchers to precisely describe the differences between self-compacting concrete and "traditional" vibrated concrete.
Keywords concrete; database; self-compacting
Research subject area ENGINEERING
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures 250 articles;
1500 concrete mixtures analysed;
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2014
Link

Top of Long-form Case Studies

Title 10: Preventing Drug Interactions On Your Mobile
Subtitle Novel tools for patients and healthcare professionals
Abstract The iChart apps developed by the University of Liverpool help clinicians and patients with HIV or hepatitis C better deal with drug interactions. The apps allow clear and ubiquitous access to research data that has been arranged for maximum effectiveness and dissemination, thus, improving patient response and reducing the side effects experienced. In addition, clinicians can save time, as all the information they need is now available directly on their smartphones.
Keywords hiv; hepatitis; drug interactions
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Impact on health and wellbeing
Facts and figures 128 countries;
17,000 downloads;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2010
Link

Top of Long-form Case Studies

Title 11: Meeting Sustainability Objectives
Subtitle Reducing carbon emissions from agricultural production
Abstract Meeting sustainability objectives is becoming increasingly important to reduce the impact of global warming. The field of agriculture has been deemed responsible for a third of our greenhouse emissions, thus, resources such as the Cool Farm Tool are essential to help people in the sector understand how they can reduce their environmental impact.
Keywords agriculture; carbon footprint;
Research subject area ENVIRONMENTAL SCIENCES
Type of RDM impact/benefit Methodological impact (e.g., new approaches developed); Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Environmental impact
Facts and figures 800 global datasets for N2O;
100 global datasets for soil carbon sequestrations;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2011
Link

Top of Long-form Case Studies

Title 12: Finding Offshore Hydrocarbons
Subtitle Leveraging satellite data to improve the efficiency of exploration
Abstract Satellite data has been re-used to produced a more up-to-date and precise dataset helping with offshore exploration. The improved gravity data prepared by the researchers has shown a very high potential and is estimated to be 10% more accurate than previously-available work. The dataset has been used by major oil companies to drive decision-making and improve the safety of their exploration efforts.
Keywords oil; exploration; gravity
Research subject area EARTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Impact on economy and business; Environmental impact
Facts and figures $2.5 million per project
£1.2 million received
10% improvement compared to previous data
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1998
Link

Top of Long-form Case Studies

Title 13: Geological Data Made Easy
Subtitle The British Geological Survey and the OpenGeoscience portal
Abstract Geological data was released by the British Geological Survey to align their resources to the principles of open science. Their efforts took the form of the OpenGeoscience portal, where data is shared through an Open Government License. Users of OpenGeoscience resources are encouraged to reshape the data to develop new products, called mash-ups, and more than 20 of these are available on the BGS website.
Keywords geology; survey; NERC
Research subject area EARTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Impact on economy and business; Environmental impact
Facts and figures Over 20 mash-ups (projects derived from OpenGeoscience data)
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2010
Link

Top of Long-form Case Studies

Title 14: Protecting The Oceans By Coordinating Data Sharing Efforts
Subtitle UNESCO’s efforts to preserve marine environments
Abstract UNESCO's efforts to protect marine environments materialised with the creation of IODE in 1961. The programme aims to improve data management in the field and to guide and coordinate the data gathering work by a large number of countries. Such a high-level initiative allows data to be shared very effectively, as no country could possibly gather so much information on its own. In addition,
Keywords marine; ocean; coordination; unesco
Research subject area EARTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Environmental impact; Impact on politics and governance; Impact on economy and business
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1990
Link

Top of Long-form Case Studies

Title 15: Corruption in Public Sector Procurement
Subtitle Digital whistleblowing to quantify the cost of corruption
Abstract The DIGIWHIST project gathered and elaborated information on public procurement and accountability of public officials within the EU and in neighbouring countries. This was picked up by the European Commission, which released a study showing how corruption may cost Europe up to €990 billion per year.
Keywords corruption; european commission; transparency; public sector
Research subject area STUDIES IN HUMAN SOCIETY
Type of RDM impact/benefit Socio/Economic impact; Efficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type Impact on politics and governance; Impact on public services; Impact on economy and business
Facts and figures up to €990 billion lost to corruption yearly
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2015
Link

Top of Long-form Case Studies

Title 16: Drones for Research
Subtitle Using unmanned aerial vehicles to power new approaches to scientific investigations
Abstract Drones are becoming a constant presence in technology news and media, thus, it is not surprising that they also caught the attention of the research community. Drones allow researchers to capture aerial images easily and at a low cost, however, the data they gather needs to be properly curated to allow any applications. In the field of agriculture, drone-captured datasets are being used to spot plant diseases and help farmers better protect their yields. In addition, drone time-stamped drone image sets have been used to study how to best protect crops and land from floods.
Keywords agriculture; drones
Research subject area AGRICULTURAL AND VETERINARY SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Impact on economy and business; Economic; Environmental
Facts and figures
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2016
Link

Top of Long-form Case Studies

Title 17: Data-Enhanced Archaeology
Subtitle Using modern tools to study ancient times
Abstract In the field of archaeology, data is scarce and difficult to find. This is simply because it usually comes from excavations or physical operations on artefacts, which are normally expensive and can be performed only in certain conditions. The Archaeology Data Service (ADS) aims to fill the gap by freely providing more than 1.3 million metadata records on archaeological data and by driving developments in research data management in the field. The service enables increased efficiency thanks to data reuse and this is valued at at least £13 million per annum.
Keywords archaeological data; repository
Research subject area HISTORY AND ARCHAEOLOGY
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures £13 million per annum savings due to increased efficiency;
2-fold to 8-fold return on investment;
44% of interviewed stakeholders could not have carried out their work without the ADS;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1996
Link

Top of Long-form Case Studies

Title 18: Supporting Science and Industry by Sharing Computer Code
Subtitle Sharing software as a form of research data
Abstract Sharing software is not as common as sharing other types of research data, however, it is sometimes very impactful. The Statistical Parametric Mapping (SPM) software in the field of neuroimaging is an example of how sharing computer code can lead to far-reaching effects. In this case, making the code public allowed the creation of a new field of study and led the software to become the leader in the sector. In addition, thanks to the licensing chosen, some companies were able to exploit the code to create derivative products, which are generating substantial income, while major pharmaceutical companies are using it in the field of drug research.
Keywords software; code; algorithms
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Economic; Technological; Health
Facts and figures 64% of users in the field use the software;
€5,000 for each license (software derived from sharing);
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1991
Link

Top of Long-form Case Studies

Title 19: Citizen Science at Zooniverse
Subtitle Sharing data and efforts via an online platform
Abstract It is not always easy to deal with large amounts of data. At times, algorithms can help researchers make sense of their large datasets, however, sometimes the human mind cannot be replaced. In these cases, platforms like Zooniverse come to the researchers' help, allowing them to have citizen volunteers analyse scientific data and enable new scientific discoveries. More than 130 articles were published thanks to citizen science, showing how wise research data management can allow the crowd-sourcing of scientific research.
Keywords crowd-sourced research; citizen science
Research subject area INFORMATION AND COMPUTING SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures More than 130 articles published;
Over 1.5 million registered volunteers;
At least 58 web-based citizen science projects;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2009
Link

Top of Long-form Case Studies

Title 20: History and Data
Subtitle The link between disciplines that seem diametrically opposed
Abstract Access to ancient books, manuscripts, and artefacts is often limited due to their fragility and importance. In addition, they are spread between several locations, which makes the work of historians difficult. The digitisation of heritage data by the British History Online digital library bridges the gap by making material available to researchers from all over the world. The application of research data management in this field led to changes in the researchers' workflow and earned the library a large number of citations in the academic literature and mentions in the news.
Keywords British history; digitisation; library
Research subject area HISTORY AND ARCHAEOLOGY
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures 330,000 unique visitors a month;
1,410 Google Scholar results;
9 mentions in news;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2003
Link

Top of Long-form Case Studies

Title 21: Saving the Earth from Mankind
Subtitle How can we preserve Earth and develop sustainably?
Abstract Our planet needs to be protected, as human development tends to ignore sustainability and the effect business has on the environment and on biodiversity. Luckily, things are slowly changing, and better decisions and policies supported by research data can now be made. The Global Biodiversity Information Facility (GBIF) plays a critical role in enabling sustainable development by hosting evidence on more than 1.6 million species and its data is featured in more than 1,400 research papers. GBIF data is used by scientists, policymakers, and journalists alike.
Keywords biodiversity; environment; sustainable development
Research subject area ENVIRONMENTAL SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact; Methodological impact (e.g., new approaches developed)
Summary Impact Type Political; Economic; Environmental
Facts and figures Evidence on more than 1.6 million species;
1,400 peer-reviewed articles citing GBIF data;
81 countries;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1999
Link

Top of Long-form Case Studies

Title 22: Understanding Mobile Users and Evaluating Vulnerabilities
Subtitle User data as an investigation tool
Abstract The Device Analyzer project gathers data on smartphone usage and curates it for re-use by companies, universities, and research institutes. The project's data led to the development of important statistics on the vulnerability of smartphones using the Android operating system. The researchers found that this is related to the slow pace of system updates and only manufacturers have the tools to address the problem.
Keywords Android; smartphone; mobile phones
Research subject area INFORMATION AND COMPUTING SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures 30,978 contributors; 87.7% of Android devices are exposed to at least one of 11 known critical vulnerabilities
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2012
Link

Top of Long-form Case Studies

Title 23: Understanding War
Subtitle Data-powered insights into the motives and consequences of war
Abstract In the world, there are currently 58 ongoing conflicts. These cause tens of thousands of fatalities each year and are related to reasons that are obscure, complex, and often difficult to understand. Researchers and political scientists have been trying to uncover the reasons for war for a long time. Today, they can leverage data to explain conflicts, and geotagged datasets can be organised to build visualisations that greatly facilitate the understanding of contexts and actors in a war. The use of research data management is instrumental in helping us fully grasp the reasons for conflicts and, hopefully, preventing future ones.
Keywords War; conflicts
Research subject area STUDIES IN HUMAN SOCIETY
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Political; Societal
Facts and figures 4,000 citations: Correlates of War project
3,800 citations: Uppsala Conflict Data Program
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2012
Link

Top of Long-form Case Studies

Title 24: Surfing Gravitational Waves
Subtitle Understanding gravity and Einstein’s General Theory of Relativity
Abstract Since 2002, the LIGO experiment has been running and collecting data in the domain of astrophysics. This very large facility created over 1 petabyte of information, which required management for several years. This effort has allowed researchers to confirm the existence of gravitational waves, ripples in space time hypothesised by Einstein. In addition, data from the experiment is being used to produce peer-reviewed publications and additional information on past detections, such as visuals, audio, and other media.
Keywords gravitational waves; relativity
Research subject area PHYSICAL SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures 111,000 news mentions;
Over 1 petabyte of data
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1994
Link

Top of Long-form Case Studies

Title 25: Art From The Comfort Of Your Chair
Subtitle Aggregating arts metadata to power a digital museum
Abstract Educational institutions, museums, and other organisations hold a wealth of information on paintings, sculpture, and artefacts. This has historically been kept private or indexed locally, however, the Europeana project aims to aggregate metadata in the field of arts and make it publicly accessible. Over 3,000 organisation contribute to the portal, which hosts more than 54 million records from across Europe. The Europeana project has been using its material to promote creative thinking in a number of disciplines and led to the creation of a web exhibition platform for institutions holding art. In addition, Europeana tracks research arising from its data through the Europeana Treasures blogpost series.
Keywords cultural heritage; digitisation
Research subject area LANGUAGE, COMMUNICATION AND CULTURE
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Technological; Cultural
Facts and figures Collaboration of 3,000 institutions;
54,364,816 artworks, artefacts, books, videos and sounds from across Europe;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2008
Link

Top of Long-form Case Studies

Short-form case studies

Title 1: A Performance Artwork Based on Datasets and Partnerships
Keywordsmusic; sound; software
Research subject areaEARTH SCIENCES
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures>430GB of data created
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2014
Link

Top of Short-form Case Studies

Title 2: Open Science Underpins Collaboration: The Structural Genomics Consortium (SGC)
Keywordsprotein structure
Research subject areaBIOLOGICAL SCIENCES
Type of RDM impact/benefitReproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact TypeEconomic
Facts and figuresover 1,500 protein structures and 75 kinase structures shared;
Protein Data Bank is worth $12 billion;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2003
Link

Top of Short-form Case Studies

Title 3: Testing Doubts About the Reliability of Science: The Reproducibility Project
Keywordsreproducibility; open science
Research subject areaPSYCHOLOGY AND COGNITIVE SCIENCES
Type of RDM impact/benefitReproducibility
Summary Impact Type
Facts and figuresOnly 40% of findings could be replicated
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2011
Link

Top of Short-form Case Studies

Title 4: Large Volumes of Data Engage Large Communities: Sloan Digital Sky Survey (SDSS))
Keywords space; citizen science
Research subject area EARTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures Classification of nearly 900,000 galaxies
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1998
Link

Top of Short-form Case Studies

Title 5: Scholarly Communication is About Combining Effort: Polymath Project
Keywords collaboration; problem solving
Research subject area MATHEMATICAL SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures 3 out of 12 mathematical problems solved;
6 published articles;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact) 2009
Link

Top of Short-form Case Studies

Title 6: Is Citizen Generated Data Suitable for Academic Purposes? Volunteered Geographic Information (VGI)
Keywords maps; volunteered data
Research subject area EARTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures Only 12% of photographs considered unusable
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact) 2004
Link

Top of Short-form Case Studies

Title 7: A Specialist Research Data Archive: Crawdad
Keywords wireless; networks
Research subject area INFORMATION AND COMPUTING SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures 2378 papers using CRAWDAD datasets or mentioning CRAWDAD;
1217 downloads for the most popular datasets and 566 citations;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact) 2004
Link

Top of Short-form Case Studies

Title 8: A Large-Scale Research Data Service: The European Bioinformatics Institute
Keywords genome information
Research subject area BIOLOGICAL SCIENCES
Type of RDM impact/benefit Reproducibility; Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Economic
Facts and figures Efficiency gains worth £1 billion per annum;
Future research impact worth £335 million annually or £2.5 billion over 30 years
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact) 1992
Link

Top of Short-form Case Studies

Title 9: Combining Data & Influencing Government Sustainability Policies
Keywords travelling; sustainability
Research subject area MEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefit Efficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact Type Political; Societal
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact) 2008
Link

Top of Short-form Case Studies

Title10: Objective Measures & Self-Reported Data Underpin Policy On Obesity: Health Survey for England (HSE)
Keywordshealth survey; obesity
Research subject areaMEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1991
Link

Top of Short-form Case Studies

Title11: Archival Acceptance as an "Indicator of Quality": Foot & Mouth Disease (FMD) Testimonial Data
Keywordscrisis; diaries
Research subject areaMEDICAL AND HEALTH SCIENCES
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact TypeSocietal
Facts and figures3,200 weekly diaries leading to the understanding of how people react to abnormal events
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2001
Link

Top of Short-form Case Studies

Title12: Improving Policy by Providing Data: Live Music Exchange (LMX)
Keywordslive music
Research subject areaLANGUAGE, COMMUNICATION AND CULTURE
Type of RDM impact/benefitSocio/Economic impact
Summary Impact TypePolitical
Facts and figures
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2015
Link

Top of Short-form Case Studies

Title13: From Hard Copy Primary Sources to An Open Online Resource: Reading Experience Database, (RED)
Keywordsreading
Research subject areaLANGUAGE, COMMUNICATION AND CULTURE
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort); Methodological impact (e.g., new approaches developed)
Summary Impact Type
Facts and figurescontributions of >120 volunteers, who created over 6000 entries;
>1800 users per month from >135 countries;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)1995
Link

Top of Short-form Case Studies

Title14: Qualitative Data in Many Formats, Archived Online: Tate Encounters
Keywordsqualitative data; museum
Research subject areaSTUDIES IN HUMAN SOCIETY
Type of RDM impact/benefitSocio/Economic impact
Summary Impact TypeSocietal; Cultural
Facts and figures300 student questionnaires;
200 student essays on Tate Modern and Tate Britain;
12 student workshops;
12 in depth student research projects;
5 extended participant family edited ethnographic films;
38 Tate staff interviews and interviews with 72 participants through the Research in Process events at Tate Britain;
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2010
Link

Top of Short-form Case Studies

Title15: Data Combination and Self Re-Use: Understanding Pauper Lives in Georgian London
Keywordspauper lives; London
Research subject areaHISTORY AND ARCHAEOLOGY
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact TypeOver 300,000 records about over 50,000 poor individuals in the West End of London
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2004
Link

Top of Short-form Case Studies

Title16: Ireland-Bristol Trade in the Sixteenth Century
Keywordstrade; Ireland; England
Research subject areaHISTORY AND ARCHAEOLOGY
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2006
Link

Top of Short-form Case Studies

Title17: Film and an Ethnographic Approach: Buddhist Cosmology in Food
Keywordsbuddhism; food
Research subject areaLANGUAGE, COMMUNICATION AND CULTURE
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures6 videos created as a project output
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2014
Link

Top of Short-form Case Studies

Title18: Data About Data Archiving: ICPSR’s Data Sharing in the Social Sciences
Keywordsmeta-analysis
Research subject areaSTUDIES IN HUMAN SOCIETY
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures
Maturity of the initiative/data sourceLong-standing (5+ years)
Year (e.g., first data release, first output, year of impact)2010
Link

Top of Short-form Case Studies

Title19: Research Data Supports Restoration: Mackintosh Architecture
Keywordsrestoration; building
Research subject areaBUILT ENVIRONMENT AND DESIGN
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort); Socio/Economic impact
Summary Impact TypeSocietal; Cultural
Facts and figures350 architectural projects shared
Maturity of the initiative/data sourceRecent (<5 years
Year (e.g., first data release, first output, year of impact)2014
Link

Top of Short-form Case Studies

Title20: Evidencing Value of Artistic, Cultural or Sporting Activities: UK Subjective Well-Being Data (SWB)
Keywordsunderstanding society; sport; culture
Research subject areaSTUDIES IN HUMAN SOCIETY
Type of RDM impact/benefitEfficiency in research and data re-use (e.g., reduce duplication of effort)
Summary Impact Type
Facts and figures
Maturity of the initiative/data sourceRecent (<5 years)
Year (e.g., first data release, first output, year of impact)2016
Link

Top of Short-form Case Studies