Finemapping results format
The finemapping results come from two different finemapping methods: FINEMAP and SuSiE.
The purpose of finemapping is to find the set of 1 or more variants most likely to be responsible for the association at that locus. This set of likely variants is referred to as a "credible set". You can read more about the motivations for finemapping in the main concepts: Finemapping.
Most severe transcript is chosen by first taking the most severe among canonical protein coding transcripts, if no canonical transcript exists, then first (random) other protein coding transcript is chosen corresponding to the most severe annotation. Precedence of severity is chosen according to Ensembl Variant Effect Predictor (VEP) default.
SuSiE outputs
Both 95% credible set and 99% credible sets are provided. The file with _99 contains 99% credsets as below. The SuSiE outputs have been annotated using the variant annotation file.
PHENONAME.SUSIE.cred.bgz and PHENONAME.SUSIE_99.cred.bgz
Contains credible set summaries from SuSiE fine-mapping for all genome-wide significant regions.
Column | Description |
region | Region for which the fine-mapping was run |
cs | Running number for independent credible sets in a region |
cs_log10bf | Log10 Bayes factor comparing the solution of this model (cs independent credible sets) to cs -1 credible sets. |
cs_avg_r2 | Average correlation R2 between variants in the credible set |
cs_min_r2 | Minimum R2 between variants in the credible set |
cs_size | How many SNPs the credible set contains |
PHENONAME.SUSIE.cred.summary.tsv and PHENONAME.SUSIE_99.cred.summary.tsv
Summary of credible sets where the top variant for each CS is included.
Column | Description |
trait | Phenotype |
region | Region for which the fine-mapping was run |
cs | Running number for independent credible sets in a region |
cs_log10bf | Log10 Bayes factor comparing the solution of this model (cs independent credible sets) to cs -1 credible sets. |
cs_avg_r2 | Average correlation R2 between variants in the credible set |
cs_min_r2 | Minimum R2 between variants in the credible set |
low_purity | boolean (TRUE, FALSE) indicator if the CS is low purity (low min R2) |
cs_size | How many SNPs the credible set contains |
good_cs | boolean (TRUE, FALSE) indicator if this CS is considered reliable. IF this is FALSE then top variant reported for the CS will be chosen based on minimum p-value in the credible set, otherwise the top variant is chosen by maximum PIP |
cs_id | Credible set ID |
v | Top variant (chr:pos:ref:alt) |
p | Top variant p-value |
beta | Top variant beta |
sd | Top variant standard deviation |
prob | overall PIP of the variant in the region |
cs_specific_prob | PIP of the variant in the current credible set (this and previous are typically almost identical) |
0..n | Configured annotation columns. Typical default most_severe, gene_most_severe giving consequence and gene of top variant |
PHENONAME.SUSIE.snp.bgz and PHENONAME.SUSIE_99.snp.bgz
Contains variant summaries with credible set information.
Column | Description |
trait | Phenotype |
region | Region for which the fine-mapping was run |
v, rsid | Variant IDs |
chromosome | Chromosome no. |
position | Position on the chromosome |
allele1 | Major allele |
allele2 | Minor allele |
maf | Minor allele frequency |
beta | Original marginal beta |
se | Original standard error |
p | Original p-value |
mean | Posterior mean beta after fine-mapping |
sd | Posterior standard deviation after fine-mapping |
prob | Posterior inclusion probability |
cs | Credible set index within region |
lead_r2 | R2 value for a lead variant (the one with maximum PIP) in a credible set |
alphax | Posterior inclusion probability for the xth single effect (x := 1..L where L is the number of single effects/causal variants specified; default: L = 10). |
PHENONAME.SUSIE.snp.filter.tsv and PHENONAME.SUSIE_99.snp.filter.tsv
SNPs that are part of good quality credible sets as reported in PHENONAME.cred.summary.tsv file.
Column | Description |
trait | Phenotype |
region | Region for which the fine-mapping was run |
v | Variant ID (chr:pos:ref:alt) |
cs | Running credible set ID within region |
cs_specific_prob | Posterior inclusion probability for this CS |
chromosome | Chromosome no. |
position | Position on the chromosome |
allele1 | Major allele |
allele2 | Minor allele |
maf | Minor allele frequency |
beta | Original association beta |
p | Original p-value |
se | Original standard error |
most_severe | Most severe consequence of the variant |
gene_most_severe | Gene corresponding to most severe consequence |
FINEMAP outputs
PHENONAME.FINEMAP.config.bgz
Summary of fine-mapping variant configurations from the FINEMAP method.
Column | Description |
trait | Phenotype |
region | Region for which the fine-mapping was run |
rank | Rank of this configuration within a region |
config | Causal variants in this configuration |
prob | Probability across all n independent signal configurations |
log10bf | Log10 Bayes factor for this configuration |
odds | Odds for this configuration |
k | How many independent signals are in this configuration |
prob_norm_k | Probability of this configuration within k independent signals solution |
h2 | SNP heritability of this solution |
#NAME? | 95% confidence interval limits of SNP heritability of this solution |
mean | Marginalized shrinkage estimates of the posterior effect size mean |
sd | marginalized shrinkage estimates of the posterior effect standard deviation |
PHENONAME.FINEMAP.region.bgz
Summary statistics on the number of independent signals in each region.
Column | Description |
trait | Phenotype |
region | Region for which the fine-mapping was run |
h2g_snp or h2g | SNP heritability of this region |
h2g_sd | Standard deviation of SNP heritability of this region |
h2g_lower95 | Lower limit of 95% CI for SNP heritability |
h2g_upper95 | Upper limit of 95% CI for SNP heritability |
log10bf | Log10 Bayes factor compared against null (no signals in the region) |
prob_xSNP | x columns for probabilities of different numbers of independent signals |
expectedvalue | Expectation (average) of the number of signals |
PHENOTYPE.FINEMAP.snp.bgz
Summary statistics of variants and what credible set they are most likely to belong to.
Column | Description |
trait | Phenotype |
region | Region for which the fine-mapping was run |
v | Variant |
index | Running index |
rsid | Variant ID |
chromosome | Chromosome no. |
position | Position on the chromosome |
allele1 | Major allele |
allele2 | Minor allele |
maf | Minor allele frequency |
beta | Original marginal beta (effect size) |
se | Original standard error |
z | Original z-score |
prob | Posterior inclusion probability |
log10bf | Log10 Bayes factor |
mean | Marginalized shrinkage estimates of the posterior effect size mean |
sd | Marginalized shrinkage estimates of the posterior effect standard deviation |
mean_incl | Conditional estimates of the posterior effect size mean |
sd_incl | Conditional estimates of the posterior effect size standard deviation |
p | Original p-value |
csx | Credible set index for given number of causal variants x |
Read more about Finemapping
Last updated