GWAS results format

The GWAS analysis produces the following types of files:

Summary statistics

The summary statistics are bgzipped, tabix-indexed tab separated files (.tsv). They contain the summary statistics of the GWAS analysis. The columns are:

Column

Description

#chrom

chromosome

pos

chromosomal position (in base pairs)

ref

reference allele

alt

alternate allele

rsids

rsIDs related to this variant

nearest_genes

nearest genes

pval

analysis p-value

mlogp

-log10(p-value)

beta

effect size of alternate allele

sebeta

standard error of effect size

af_alt

allele frequency for the alternate allele

af_alt_cases

alternate allele frequency in cases

af_alt_controls

alternate allele frequency in controls

Variant inclusion and INFO score

The GWAS analysis is performed for all variants in the genotype data, and the summary statistics contain all of these variants. These results are then uploaded to the green library, and shown in pheweb. An exception to this are variants with minor allele count less than 10, which are skipped during the GWAS analysis. For some subsequent analysis such as fine-mapping, variants with INFO score lower than 0.6 are filtered out. For each release you can find the variant's INFO score, as well as other annotations, in the variant annotation file, which can be found in the green library. The path is: gs://finngen-production-library-green/finngen_R[RELEASE_NUMBER]/finngen_R[RELEASE_NUMBER]_analysis_data/annotations/

Plots

The plots produced include three types of plots: a manhattan plot of the GWAS, a manhattan plot with a log^2-scaled y-axis, and a qqplot of the GWAS.

Read more about GWAS analysis. See more information about What are QQ and Manhattan plots.

Last updated