Bringing data to GDI
Here you can find user-guides for preparing and submitting data to GDI. These include step-by-step instructions on how to prepare data files for a packaging script, how to run the script on a data-provider’s server and how to submit the dataset package to GDI.
Before starting
Data-providers who are going to share their data via European User Portal need to make sure that their data and metadata conform to the set of rules established by 1+MG (Diagram 1). Some useful details on data quality can be found under Data Quality. Also, it is strongly advised to consult with the organisation ELSI expert or DPO to ensure that the legal, ethical and data protection aspects have been taken into account. Sharing data via User Portal is under organisation responsibility and needs to be approved either at the executive or principal investigator level.

Diagram 1. Genomic datasets prepared by data-providers according to GoE/1+MG guidelines can be made compatible with the GDI data flow by using gdi-dataset-tool that requires a metadata file in a specified format as an input. As a result, a Dataset Package is generated that can securely be uploaded to GDI.
Technically, to bring genomic data to the GDI system, it must become compatible with the system data flow and therefore go through steps of validation, conversion and encryption before submission. GDI team has developed a dataset processing script - gdi-dataset-tool - that aids data-providers in preparing the compatible and secure dataset package that can be loaded to GDI.