What do we mean by "red" and "green" data?

FinnGen's "green" data is aggregate level data in which at least 5 individuals have been used to generate the results.

These are most commonly combined results from different types of analyses, including FinnGen Core results delivered to Green library by the FinnGen Analysis team (for instance, GWAS summary statistics).

Green data is directly accessible by anyone with a @finngen.fi account and can also be downloaded to any user's local machine.

FinnGen's "red" data is individual-level genotype or phenotype data which is located in the Sandbox and which researchers can use to run their own analyses if individual-level genotypes and/or phenotypes are required as input.

We call this data "red" to remind users that we always need to take extra care and security in working with this data.

The data consists of VCF files that have genotypes for each individual and phenotype files that list phenotypes for all individuals by FinnGen ID. For access to this data you need access to the Sandbox in addition to having a @finngen.fi account.

Watch a FinnGen security training videos to learn more about FinnGen red and green data.

Last updated