Metadata
What is metadata?
Metadata describes what your data is, how others can use it, and how it is physically put together. It allows other researchers to determine how, when, where, why and by whom the data was originally produced or collected.
Metadata may be stored as XML, in an accompanying document, a set of data fields in a repository, or a README file. Metadata is generally broken down into three different kinds:
Descriptive metadata
This type enables a dataset to be discovered and identified. Examples include a project title, a description of the research scope, an abstract, and relevant keywords. In many disciplines, controlled vocabularies and nomenclature exist to describe data.
Administrative metadata
This type helps manage the dataset. It includes rights management, access control, use requirements, technical data on file creation and quality control, file formats, software/hardware for access and use, and any information relevant to archiving and preservation.
Structural metadata
This type describes how items relate to one another, e.g. that chapter 1 comes before chapter 2 in a book, or that file x is the JPEG format of the archival TIFF image file z.
Why is there a need for metadata?
For data to be successfully stored, managed, shared and re-used throughout its life cycle, documentation and description of the data is essential. Documentation and metadata requirements will vary according to discipline and the nature of the research being done. Metadata and other necessary documentation to make data usable by and discoverable by others should be identified during data management planning.
Elements of metadata
Metadata is usually structured according to existing standards or 'schemas'. The most common metadata schema is probably Dublin Core, developed in 1995.
Simple Dublin core involves 15 elements, all of which are optional and repeatable, and which can appear in any order:
- title
- date
- type
- rights
- creator
- subject
- source
- description
- publisher
- contributor
- format
- identifier
- language
- relation
- coverage
'The metadata accompanying your data should be written for a user 20 years into the future ... Prepare the metadata for a user who is unfamiliar with your project, methods, or observations.'
Oak Ridge National Laboratory, Distributed Active Archive Center
The following list of standards and schemas is not exhaustive, but will provide some guidance about the appropriate schemas and standards in current use in some disciplines.
| Generic | Standard | Overview |
|---|---|---|
| Used as a basis for many industry-specific metadata schemes. Recommended if no specific discipline-related scheme exists |
|
Internationally recognised as the set of core metadata elements used to promote data interchange and interoperability across domains. |
| Discipline | Standard | Overview |
| Geographical and Geospatial | Geographic
information - metadata AS/NZS ISO 19115:2005 |
ISO standard for description of geographic information and services. |
| Government Information and Services | AGLS Metadata
Standard AS 5044-2010 |
Based on DC and mandated for use within Australian Government agencies. |
| Information Technology | Information technology-Metadata registries (MDR) AS ISO/IEC 11179.1-2005 |
Facilitates data interchange by electronic commerce and database designers. |
| Records Management | Records management processes - Metadata for records AS ISO 23081.1-2006 |
A two-part technical specification defining recordkeeping metadata. |
| Discipline | Scheme/Schema | Overview |
| Spatial | ANZLIC | Best practice in spatial data management across Australia and New Zealand. |
| Science | CCLRC Scientific Metadata Model | Attempts to capture scientific activities at different levels. |
| Social and Behavioural Sciences | Data Documentation Initiative (DDI) | A standard for technical documentation describing social science data. |
| Humanities, Social Sciences, Linguistics | Text Encoding Initiative | A standard that covers encoding methods for machine-readable texts. |
| Art Objects, Architecture, Groups of Objects, and Visual and Textual Surrogates | Categories for the Description of Art Works (CDAW) | Identifies the minimum information needed to uniquely identify a particular work of art or museum object. |
| Visual Arts | Visual Resources Association Core (VRA) | Used to describe works of visual culture as well as the images that document them. |


Loading