Glossary
GDI: European Genomic Data Infrastructure (2022-2026) is a project under the 1+ million genomes (1+MG) initiative with the goal to build national infrastructures for making local genomic data discoverable and accessible for secondary research.
Life Science AAI: authentication and authorization service for academia that relies on existing authentication services at various research/academia institutes and universities.
TARA: The State Authentication Service (TARA) providing a secure authentication with Estonian ID-card, Mobile-ID, Smart-ID and European Union notified eID (eIDAS) in public online services.
JWT: JSON Web Token is a standard for passing on authenticated user information as encoded text that can be verified for correctness.
GA4GH Passport & Visas: a standard that extends JWT for passing on user’s visa claims from visa issuers and through authentication services.
User Portal: public website for researchers to discover, filter, and access the datasets of member-state GDI nodes.
Beacon: an API specification for making genomic datasets queryable over REST API.
Beacon-Network: a service that implements Beacon API but actually delegates, combines, and returns responses from a list of registered Beacon services.
GA4GH Crypt4gh: a file encryption standard based on ChaCha20 symmetric stream cipher and Poly1305 message authentication code. It supports passing content encryption key in the file header to one or more recipents that have provided their public keys.
FAIR Data Point: a metadata service specification, based DCAT (W3C’s Data Catalog Vocabulaire) version 2, for publishing information about datasets.
Secure Processing Environment: a legal term for computer systems with hardened security to enable execution of arbitrary scripts on sensitive data with counter-measures to avoid the sensitive data to be exported from the system.
SAPU: a private research environment (Linux with a graphical user interface) for working on sensitive data; the service is offered by HPC at University of Tartu.
HPC: High-Performance Computing center, an organisation providing advance computational services mostly for scientific research.
S3: initially an Amazon cloud service called Secure Storage Service, in this document it refers to API specification (protocol) for storing and accessing data over HTTP protocol.
Minio: an open-source software that implements the S3 protocol.
HTTP(S): (Secure) Hyper-Text Transfer Protocol is an Internet messaging protocol for requesting and sending data over network; “Secure” indicates HTTP messaging over TLS-encrypted connection.
(S)FTP: (Secure) File Transfer Protocol is an Internet messaging protocol for requesting and sending files over network; “Secure” indicates HTTP messaging over TLS-encrypted connection.
GPFS: “General Parallel File System” is high-performance clustered file system, supporting maximum file-size of 8 EB, software developed and introduced by IBM in 1998.
Kubernetes: an open-source container orchestration system for automatic software deployment, scaling, and management in big clusters.
Vault: an open-source software (developed by HashiCorp) that provides various data security related features, such as reading and writing secrets, data encryption, and managing access tokens.