GWAS results format
The GWAS analysis produces the following types of files:
Summary statistics
The summary statistics are bgzipped, tabix-indexed tab separated files (.tsv). They contain the summary statistics of the GWAS analysis. The columns are:
Column
Description
#chrom
chromosome
pos
chromosomal position (in base pairs)
ref
reference allele
alt
alternate allele
rsids
rsIDs related to this variant
nearest_genes
nearest genes
pval
analysis p-value
mlogp
-log10(p-value)
beta
effect size of alternate allele
sebeta
standard error of effect size
af_alt
allele frequency for the alternate allele
af_alt_cases
alternate allele frequency in cases
af_alt_controls
alternate allele frequency in controls
Variant inclusion and INFO score
The GWAS analysis is performed for all variants in the genotype data, and the summary statistics contain all of these variants. These results are then uploaded to the green library, and shown in pheweb. An exception to this are variants with minor allele count less than 10, which are skipped during the GWAS analysis. For some subsequent analysis such as fine-mapping, variants with INFO score lower than 0.6 are filtered out. For each release you can find the variant's INFO score, as well as other annotations, in the variant annotation file, which can be found in the green library. The path is: gs://finngen-production-library-green/finngen_R[RELEASE_NUMBER]/finngen_R[RELEASE_NUMBER]_analysis_data/annotations/
Plots
The plots produced include three types of plots: a manhattan plot of the GWAS, a manhattan plot with a log^2-scaled y-axis, and a qqplot of the GWAS.
Read more about GWAS analysis. See more information about What are QQ and Manhattan plots.
Last updated