1 of 1

Data description

File naming pattern and file structure

Summary association statistics

GWAS summary statistics (tab-delimited, bgzipped, genome build 38, tabix index files included) are named as {endpoint}.gz. For example, endpoint I9_CHD has I9_CHD.gz and I9_CHD.gz.tbi.

To learn more about the methods used, see section .

The {endpoint}.gz have the following structure:

Fine-mapping results

Two fine-mapping methods were used:

Fine-mapping results are tab-delimited and bgzipped.

SuSiE results have the following filename pattern:

{endpoint}.SUSIE.cred.bgz
{endpoint}.SUSIE.cred_99.bgz
{endpoint}.SUSIE.snp.bgz

FINEMAP results have the following filename pattern:

{endpoint}.FINEMAP.config.bgz
{endpoint}.FINEMAP.region.bgz
{endpoint}.FINEMAP.snp.bgz

To learn more about the methods used, see section .

{endpoint}.SUSIE.cred.bgz contain credible set summaries from SuSiE fine-mapping for all genome-wide significant regions. {endpoint}.SUSIE.cred_99.bgz contain the 99% credible set summaries while the default is 95%. They have the following structure:

Column name

Description

{endpoint}.SUSIE.snp.bgz contain variant summaries with credible set information and have the following structure:

{endpoint}.FINEMAP.config.bgz contain summary fine-mapping variant configurations from FINEMAP method and have the following structure:

Column name

Description

{endpoint}.FINEMAP.region.bgz contain summary statistics on number of independent signals in each region and have the following structure:

Column name

Description

{endpoint}.FINEMAP.snp.bgz has summary statistics of variants and into what credible set they may belong to. Columns:

Column name

Description

Variant annotation

The variant annotation has measures (HWE, INFO, ...) listed per batch.

Gene-based burden test results of LoF variants

Loss of function (LoF) variants were generated from vcf files with VEP (). LoF variants are defined as having consequences in the list [frameshift_variant,splice_donor_variant,stop_gained,splice_acceptor_variant]. Also, a max_maf (0.01) and minimum info score (0.8) filters are applied. Then a bgen file is formed by filtering chromosome based vcfs and merging them into a single file, allowing us to run the whole analysis in one data set. Then the bgen is passed to step 2 of in burden mode, which uses the nulls from the standard GWAS runs.

## File structure

### Data

| File | Description |

|---|---|

|finngen_R8_lof_txt.gz | Merged results, sorted by mglop. |

|finngen_R8_lof_variants.txt | A tsv file with variant/geno/lof data used in the run. |

|finngen_R8_lof_sig_hits.txt | A summary of the results only including hits for mlogp > 3 and sorted by difference between mlogp and max(mlogp) of its variants.|

### Documentation

| File | Description |

|---|---|

|finngen_R8_lof.log| Merged logs of all runs.|

Data description

File naming pattern and file structure

Summary association statistics

GWAS summary statistics (tab-delimited, bgzipped, genome build 38, tabix index files included) are named as {endpoint}.gz. For example, endpoint I9_CHD has I9_CHD.gz and I9_CHD.gz.tbi.

To learn more about the methods used, see section .

The {endpoint}.gz have the following structure:

Fine-mapping results

Two fine-mapping methods were used:

Fine-mapping results are tab-delimited and bgzipped.

SuSiE results have the following filename pattern:

{endpoint}.SUSIE.cred.bgz
{endpoint}.SUSIE.cred_99.bgz
{endpoint}.SUSIE.snp.bgz

FINEMAP results have the following filename pattern:

{endpoint}.FINEMAP.config.bgz
{endpoint}.FINEMAP.region.bgz
{endpoint}.FINEMAP.snp.bgz

To learn more about the methods used, see section .

Column name

Description

{endpoint}.SUSIE.snp.bgz contain variant summaries with credible set information and have the following structure:

{endpoint}.FINEMAP.config.bgz contain summary fine-mapping variant configurations from FINEMAP method and have the following structure:

Column name

Description

{endpoint}.FINEMAP.region.bgz contain summary statistics on number of independent signals in each region and have the following structure:

Column name

Description

{endpoint}.FINEMAP.snp.bgz has summary statistics of variants and into what credible set they may belong to. Columns:

Column name

Description

Variant annotation

The variant annotation has measures (HWE, INFO, ...) listed per batch.

Gene-based burden test results of LoF variants

## File structure

### Data

| File | Description |

|---|---|

|finngen_R8_lof_txt.gz | Merged results, sorted by mglop. |

|finngen_R8_lof_variants.txt | A tsv file with variant/geno/lof data used in the run. |

|finngen_R8_lof_sig_hits.txt | A summary of the results only including hits for mlogp > 3 and sorted by difference between mlogp and max(mlogp) of its variants.|

### Documentation

| File | Description |

|---|---|

|finngen_R8_lof.log| Merged logs of all runs.|

Data description

hashtagSummary association statistics

hashtagFine-mapping results

hashtagVariant annotation

hashtagGene-based burden test results of LoF variants

Data description

hashtagSummary association statistics

hashtagFine-mapping results

hashtagVariant annotation

hashtagGene-based burden test results of LoF variants

Summary association statistics

Fine-mapping results

Variant annotation

Gene-based burden test results of LoF variants

Summary association statistics

Fine-mapping results

Variant annotation

Gene-based burden test results of LoF variants