Green data users
What is green data?
In FinnGen, green data refers to aggregated, anonymous summary-level data from which no individual can be identified. By green data we most commonly mean aggregate level result data, but also other data that are not individual-level participant data (that is red data) can be considered green data.
Examples of green data include:
Summary-level results from FinnGen’s core analysis team, shared with Consortium Partner researchers.
Summary-level results/data generated within the FinnGen Sandbox.
Publicly available summary-level results/data from FinnGen.
Any other data or results without personal-level FinnGen participant information.
How to access green data?
To get access to the green FinnGen data, see the FinnGen access and accounts. Only FinnGen partner organization affiliates can get access to the green data. Approval to access takes up to 7 working days. Green data is accessible by anyone with a @finngen.fi account and the data be downloaded directly to the user's local machine.
Green data that are publicly accessible (not anymore in embargo) can be accessed without credentials. More information on how to access public FinnGen results is found here.
Using green data
Users with credentials (my.name@finngen.fi) can download FinnGen core analysis team produced green data from FinnGen so called "green library" here. Data usage must comply with FinnGen’s 1-year exclusivity policy. Publishing embargoed data requires approval from the FinnGen Scientific Committee (that is an approved analysis proposal). If the user works with green data which has passed the embargo and is publicly available, they don’t need to have an analysis proposal.
Green data tools
There are various ways to explore and browse green data. The full catalog of FinnGen tools, including the green data tools, is available here. For example, the user might be interested in comparing their GWAS result to FinnGen core analysis results, or explore disease codes or lab measurements enriched in a certain endpoint.
How to download green data?
The green data generated by the FinnGen core analysis teams are stored in the “green library” which exists as a Google Cloud storage bucket named “finngen-production-library-green”. To download this data, the user needs to have FinnGen credentials and green data access. The detailed instructions on how to download green data is available here. Users without FinnGen credentials can download publicly available green library data using an online form.
Sandbox users can also generate “green data” as a result of analysis they have conducted on the individual level data (red data) in the Sandbox. Only green data can be downloaded out from the Sandbox. The detailed instructions on how to download results from the user’s Sandbox IVM are here.
Last updated
Was this helpful?