FinnGen Data Specifics

The FinnGen project contains phenotype and genotype data for approximately 500,000 individuals. Sample collection and data releases started in 2017. The main phase of sample collection ends in 2023 at the end of FinnGen2. Data releases and analyses will continue until the end of FinnGen3 in 2027.

FinnGen data

FinnGen data is classified into aggregated "green" data and individual-level "red" data:

  • "green" data: aggregated or otherwise anonymous data that does not relate to only one individual, and from which no individual can be identified. A case count of N ≥ 5 is used as a rule for aggregation to make individual-level data anonymous.

  • "red" data: sensitive, individual-level health and genetic data that is subject to data protection legislation and registry permits. This data must be handled confidentially according to the rules and restrictions set out by the FinnGen project.

In order to understand FinnGen data, we have some specifics listed in this section:

For information about the FinnGen project, genotype and phenotype data, watch the

FinnGen security training videos and visit Finngen Sandbox tutorial site.

Last updated