FinnGen Data Specifics
Last updated
Last updated
The FinnGen project contains phenotype and genotype data for approximately 520,000 individuals. Sample collection and data releases started in 2017. The main phase of sample collection ended in 2023, at the end of FinnGen2. Data releases and analyses will continue until the end of FinnGen3 in 2027.
FinnGen data is classified into aggregated "green" data and individual-level "red" data:
"green" data: aggregated or otherwise anonymous data that does not relate to only one individual, and from which no individual can be identified. A case count of N ≥ 5 is used as a rule for aggregation to turn individual-level data into anonymous data.
"red" data: sensitive, individual-level health and genetic data that is subject to data protection legislation and registry permits. This data must be handled confidentially according to the rules and restrictions set out by the FinnGen project.
In order to understand FinnGen data, we have some specifics listed in this section:
For information about the FinnGen project, genotype and phenotype data, watch the FinnGen security training videos and visit Finngen Sandbox tutorial site.