Whole exome sequencing (WES) data
Data Location
Sandbox:
Google Cloud bucket:
Content
This dataset contains whole exome sequence (WES) variants (not raw data) for 25,201 individuals from the following cohorts:
FINRISK: FR92, FR97, FR02, FR07
H2000
SUPER
The FINRISK cohorts contains the following sub projects:
FINRISK AD (N=324) sequenced at Broad
FINRISK IBD (N=945) sequenced at Broad
FINRISK controls for SUPER (FR02, N=849) sequenced at Broad
FINRISK FinMetSeq (N=10,085) sequenced at WashU
Key Details and Data Processing
The dataset includes several duplicate samples labeled as _dup1, _dup2, dup_3, and dup4.
Genotypes were called at Broad for gnomAD v4. The Reference Genome used was GRCh38.
Variants were filtered with GQ > 20 and DP > 10, removing variants with AC = 0.
Sample identity was verified by matching genetic duplicates to imputed data to ensure the same FINNGENID was used across sources.
Samples without matching FinnGen individuals were removed.
Last updated
Was this helpful?