Category: Data repositories
The Library currently has the Thomson Reuters Data Citation Index on trial, to provide access to research datasets for UQ staff and researchers. This new index connects researchers to quality data and data sets, across a range of disciplines, from around the world by providing links to data repositories. This makes data more discoverable and accessible by UQ researchers, and can potentially 'speed up the research process'.
Data Citation Index fully indexes a significant number of the world's leading data repositories of critical interest to the research community, including over two million data studies and datasets. The records for the datasets, which include authors, institutions, keywords, citations and other metadata, are connected to related peer-reviewed literature indexed in the Web of KnowledgeSM. Watch the video to find out more.
The purpose of the trial is to evaluate the Index and we are seeking feedback from researchers around UQ on the Data Citation Index. We encourage you to explore this new tool and send your feedback to email@example.com
Ross Wilkinson (ANDS) states, this issue of share celebrates and explains the changes that have taken place at Australian institutions to enable researchers to use data differently - working in new partnerships, addressing new problems and getting properly acknowledged for their data (Feb 2013).
- Progressing Melbourne University's agenda
- Deakin's Library plays a vital role
- Flinders' cross-institution collaboration
- CSIRO's data treasures revealed
- Griffith: Doing it differently
and more.......... share ANDS newsletter, February 2013
NeCTAR's Virtual Laboratories projects are a part of the Australian Government Super Science initiative. They aim to provide a new place to access data repositories and computational tools, collaborate easily, streamline research workflows and enable new opportunities for research innovation.
- CSIRO - Virtual Geophysics Laboratory
- University of Queensland - Virtual Genomic Laboratory
- University of Tasmania - Marine Virtual Laboratory
- The All Sky Virtual Laboratory
- Climate and Weather Sciences Laboratory
- Humanities Networked Infrastructure (HuNI) unlocking and uniting Australia's cultural data
- The Genomics Virtual Laboratory
- The Characterisation Virtual Laboratory: research environments for exploring inner space
A SURF project has helped create a digital home for dying languages. The University of Leiden Library in Holland, in collaboration with the CARDS (Controlled Access to Research Data Stored Securely) project of SURF in the Netherlands, is working on a project with Roberta D'Alessandro, Professor of Italian at Leiden University, to develop a data management system to manage and share important research data on dying Italian dialects. For Roberta and her team being able to upload and share the data they have collected is a central feature of the system, simplifying collaboration and allowing other language researchers to use the material. To read the full report go to Research Information: August/September 2012
The mission of the International Council for
Science is to ensure the long-term stewardship and provision of
quality-assessed data and data services to the international science community
and other stakeholders. To support this goal they have developed World
Data System (WDS) data portal and activity monitor.
The WDS Data Portal enables retrieval of datasets from members of the World Data System, and is searchable using both geographic and temporal coverage.
The WDS Activity Monitor is an interactive portal using the world map to capture current information about each member of the World Data System.
WDS member datasets include:
- National Climatic Data Centre
- National Geophysical Data Centre
- Space Environment Centre, National Weather Services
- NODC - National Oceanographic Data Centre
- PANGAEA - Data Publisher for Earth & Environmental Science
- World Data Centre for Climate
To find out further information go to World Data System.
Dryad is an international data repository in the basic and applied biosciences. The goal of the repository is to preserve all the underlying data reported in a paper at the time of publications. Recognising the emerging trend of data citation, this is an excellent example of a repository supporting the management of data and sharing. See the November 2011 blog on Dryad for further information.
The National Center for Ecological Analysis and Synthesis (NCEAS) Data Repository contains research datasets sourced from NCEAS funded activities. NCEAS, a cross-disciplinary and collaborative institute, has made the repository publicly available through the Knowledge Network for Biocomplexity (KNB) - a network designed to facilitate ecological and environmental research.
Researchers can also register a new dataset, which makes use of Ecological Metadata Language, and Morpho data management software and tools.
The repository is straightforward to use, and contains a wealth of ecological research data and information.
The Australian reported on July 11 2012, a plan for the CSIRO to make
200,000 research papers freely available, dating back to the 1920s.
Jon Curran, CSIRO's general manager of communications stated in the article, "we are removing the reasons why staff shouldn't put their work in our open-access repository."
In addition, to the publication repository, the CSIRO is also aiming to create a portal to contain raw research data.
The article also gives a general overview on the current progression and deliberation into open access by Australia's funding bodies and research organisations.
To find out more, visit CSIRO Research Publications Repository.
tDAR - the Digital Archaeological Record - is an international digital archive and repository. It houses data about archaeological investigations, research, resources and scholarship, to provide broad access to a wide range of archaeological data.
A key aim of tDAR is the long term preservation of digital archaeological data, as well as providing free, easy access to data for the public and researchers. The repository is designed to "enable researchers to more effectively create knowledge of the human past, and resource managers to better preserve and protect archaeological resources."
Within tDAR, users can search for documents, datasets, images, and GIS files, and make use of data integration tools. It also provides a comprehensive list of guides and resources about accessing, using and uploading to the repository, as well as a useful 'data dictionary' and video tutorials.
The Data Hub describes itself as "the easy way to get, use and share data".
The Data Hub is a community driven catalogue of datasets on the Internet. It uses open-source data cataloguing software CKAN, which provides each dataset record with fields for descriptions, formats, ownership, access and subject areas, among others.
Most of the data indexed is open data, which means it is openly licensed, and free to use.
On the site, you can:
- Find data - the Hub contains 3840 datasets that can be viewed or downloaded
- Share data - sign up to add your own datasets
In some cases, the Data Hub can provide data storage, and basic visualisation tools.
CSIRO's Information Management & Technology team has won the 2012 Excellence in eGovernment Awards: Project and Program Management Category. The award was received for their Research Data Service program.
The Awards website, describes the CSIRO Research Data Service (RDS) as follows:
The CSIRO Research Data Service (RDS) is a ground-breaking program designed to establish capabilities to facilitate the capture, description, access and retrieval of CSIRO's research data assets. The RDS program has successfully brought together national and international stakeholders, and supported the data management requirements of the enterprise while simultaneously meeting the data-related needs of the wide range of specific CSIRO research domains. Data provided through the RDS is now being accessed by researchers worldwide. Since its initial release in 2011, there have been thousands of searches and downloads of data that will lead to new and unexpected discoveries being made. For CSIRO scientists RDS provides an important means of protecting, disseminating and preserving our scientific results for the future.
RDS, funded by the ANDS: Seeding the Commons program, provides a platform to search and access CSIRO data at: Data Access Portal.
In early June, the Repositories Support Project (RSP) held an interesting and well-received event on "Scholarly Communications: New Developments in Open Access." The event was attended by librarians, repositories specialists, research officers and copyright consultants. Presentations were held on open access, social media, data citation, and research networking.
You can view the presentations by
downloading Powerpoints, or watching YouTube videos of the
Future events include webinars on "Repositories: management, policies and best practices" and "Bibliometrics: A way of demonstrating the importance of institutional research".
RSP is a JISC-funded initiative to build "repository capacity, knowledge and skills within UK higher education institutions. The project aim is to "progress the vision of a deployed network of interoperable repositories for academic papers, learning materials and research data across the UK."
Data Archiving and Networked Services (DANS), based in the Netherlands, promotes "sustained access to digital research data". DANS actively encourages researchers to archive research data through the online archiving system EASY.
EASY can also be used to deposit research data. Data is stored securely and permanently in a sustainable manner, and made available to other researchers under conditions specified by the depositor. DANS also makes its data management plan publicly available.
Click here to visit the DANS Data Archive.
The British Oceanographic Data Centre (BODC) is a United Kingdom national facility designed to hold, manage and distribute data concerning the marine environment. Data holdings include biological, chemical, physical and geophysical data, containing measurements of nearly 22,000 different variables.
BODC describes their approach to marine data management as:
- Careful storage, quality control and archiving of data - so that data are unaffected by changes in technology and will be available into the future.
- Distributing data to scientists, education, industry and the public - with a goal to enable online access to all data via the web site.
- Working alongside scientists during marine research projects - to provide a data management service during the life time of the project.
BODC holds a wealth of publicly accessible marine data under the provision of licence agreements. Read more about:
- Submitting, requesting and retrieving data.
- Data management services
- UK, European and International projects at BODC.
- The Published Data Library (PDL), which is an operational prototype that provides snapshots of specially chosen data sets, archived using rigorous version management. This enables citation of the data set in journal papers through the assignment of a Digital Object Identifier (DOI) in collaboration with the British Library.
The Atlas of Living Australia (Atlas) contains information on all the known species in Australia aggregated from a wide range of data providers: museums, herbaria, community groups, government departments, individuals and universities. The Atlas is the Australian node of the Global Biodiversity Information Facility (GBIF), and since 2001 the GBIF has been encouraging free and open access to biodiversity data through global online networks.
The Atlas of Living Australia can be used to:
- access information pages for each species containing photos, descriptions, maps and observations
- access scientific and common names
- explore the flora and fauna reported around your neighbourhood
- learn about Australia's biodiversity collections at museums, herbaria and other institutions
- learn about citizen science projects
- map, analyse and visualise biodiversity and environmental data and trends
- access tools to help track changes in biodiversity and the environment
- download and use open source tools
- download biodiversity data
- access images, literature and genetic information through Australian nodes of international data repositories
- volunteer for digitisation projects
- Upload datasets.
There are 370 datasets available in the Atlas, and the licensing tags make it clear which data can be used and how. The site also provides extensive explanatory information and help pages, including overviews on how data are integrated and described.
The Atlas of Living Australia is an Australian Government Initiative and is licensed under the Creative Commons Attribution 3.0 Australia License.