Data Submission process overview
The EMODnet Data Ingestion portal focuses on data providers and their data sets that are not yet handled and part of the mainstream arrangements and data handling processes of the EMODnet Data Centres.
Note: Data submitters have to register in the Marine-ID register to get an account in the Data Submission service.
Potential data providers are marine data holders that are not yet routinely submitting data sets to national data centres and not (yet) used to practices and standards as used by the international marine data management community. Therefore they must be supported to submit their data packages for open access and use in national data centres and EMODnet.
For collected datasets that are archived at potential data providers this is done by means of the online Data Submission service whereby data providers can submit marine dataset packages. Each package should consist of a collection of datasets and relevant documentation, accompanied by completing a submission form with a limited set of metadata to describe the datasets collection.
A distinction is made between 2 phases in the life cycle of a data submission
- Phase I: from data submission to publishing of the submitted datasets package ‘as is’;
- Phase II: further elaboration of the datasets package and integration (of subsets) in national, European and EMODnet thematic portals.
The workflow process for Phase I from data submission to publishing of the submitted data package ‘as is’ is illustrated below.
To make the threshold for submission relatively low the completion of the submission form has been divided over the data submitter (part 1) and the data centre (part 2) that will be assigned to process the submission.
- Part 1 submission form: a number of key metadata fields to be completed by the data submitter, including uploading of a zip file with the datasets and related documentation;
- Part 2 submission form: a number of additional metadata fields to be completed by the assigned data centre, while reviewing the received datasets package and part 1 metadata and possibly in contact with the data submitter.
A data tracking service is integrated in the Data Submission service to allow data providers to stay informed about the progress from data submission to publishing.
In Phase II the published datasets package will be further processed and curated by the assigned data centre, most probably again in dialogue with the data submitter, in order to make these or subsets fit for long term storage and stewardship in their data management system and to share it with the appropriate European infrastructure and EMODnet thematic portal(s).
The validation and processing of data submissions will be made made easier for data centres if data submitters already apply standard data formats for their datasets collections and also provide relevant documentation as part of their submissions.That will greatly optimise the efforts and time needed for Phases I and II.