If you have any queries about specific or general data management issues, please contact the HMTF Data Manager (email@example.com – in post until 25th July).
Data management is a large topic and there are many excellent resources available on the internet. This page aims to provide links to information specific to the HMTF programme through to general information on best practices in research data management.
As a programme, NERC require that we document our datasets (metadata) to a standard that would allow a future researcher to be able to understand or potentially duplicate the dataset. Comprehensive documentation of datasets is good scientific research practice and will ensure that datasets archived to NERC computer centres (or other appropriate repositories e.g. ForestPlots) can contribute to future research. Part of the documentation process is also ensuring that anyone re-using an archived datasets correctly references the original researchers.
At the end of the programme all datasets that have been produced by research funded by NERC are required to be offered to a NERC computer centre for long term archiving to be available to future researchers (see section on EIDC below for further details). This condition is part of the funding agreement. Where appropriate, embargoes of up to 2 years can be put on the dataset. Curation of the archived datasets is free for files up to 1Tb (some conditions apply – please contact EIDC or data manager for further information).
If you need a DOI for a NERC funded dataset to support a journal paper, please apply to the NERC data centre for the DOI rather than using another repository. Free curation of the dataset is provided by NERC in almost all cases.
If your dataset is complete and cleaned, do not wait until the end of the programme if you can archive it now. A DOI will be issued when a dataset is archived so the procedure below should be followed.
DOI requests should be made via the HMTF data manager but until the new data manager is in post you should apply directly to our EIDC contact Claire Wood (firstname.lastname@example.org). Before contacting the HMTF data manager (or Claire Wood), download the following documents and complete as much as possible. However make contact as soon as possible to avoid urgent DOI requests.
You will also need to provide Supporting documentation (Guidelines here) and complete the information for the EIDC discovery metadata catalogue (template here). Examples of completed documentation: Service agreement example; Supporting documentation example
Metadata – describing your datasets
Metadata is data that describes other data. The EIDC and NERC require metadata that conforms to the UK GEMINI standard for spatial data.
NERC requires discovery metadata – the essential information that enables the potential user of data to find out if a particular resource exists, its location, ownership and whether it meets their requirements.
HMTF data management resources
All researchers have been sent an Excel spreadsheet template to fill in and return to the data manager (email@example.com) that gives basic details about the datasets they will be creating e.g. name, description, file type, likely final size, date dataset likely to be complete.
NERC metadata guidelines
General good practice guidance
Searches on metadata portals and search engines to which the metadata is exposed can result in a large number of results; the metadata should therefore be sufficiently clear and comprehensible to enable the reader to understand the nature of the entry and to assess whether it is suitable for reuse. Poor quality metadata can mean that a resource is effectively hidden from users and remains unused.
When writing good quality metadata, always keep in mind the ABCD of good discovery metadata. Metadata should be:
Accurate – correctly and precisely describe the resource in question
Beneficial – contains information that is useful to the end user without lots of extraneous, irrelevant information
Clear – easily understandable by a non-technical user and unambiguous
Distinctive – contains information that allows it to be distinguished from other, potentially similar, resources
The following metadata fields will be needed. Guidance on each field is given here which should be read in conjunction with this document.
Spatial extent: (bounding box)
Temporal extent: (dates from / to):
Spatial Reference system: e.g. British National Grid, WGS
Spatial representation type: e.g. raster, vector
Spatial resolution (For gridded data, this is the area of the ground (in metres) represented in each pixel. For point data, the ground sample distance is the degree of confidence in the point’s location e.g. for a point expressed as a six-figure grid reference, SN666781, the resolution would be 100m)
Author name: (Bloggs, J.J.)
Where appropriate, all datasets should also have detailed metadata on aspects such as experimental design, sampling, fieldwork or laboratory instrumentation, analytical methods; any information that would be necessary for a researcher not involved in the project to understand and/or re-use the dataset. Further guidance is available