Atlas data model
Last updated
Last updated
Atlas uses the Observational Medical Outcomes Partnership (OMOP) common relational data model (CDM). This data model supports both standard and non-standard medical codes.
The OMOP-CDM relational data model used by FinnGen is explained in detail in the OMOP-CDM v.5.4 specification. OMOP-CDP data model conventions are described here.
The data model consists of the following tables. The Standardized clinical data tables and the Result tables are briefly described in the following sections:
Standardized clinical data tables contain all individuals and medical events. Individuals are stored in the PERSON table. In FinnGen, each individual has one observation period stored in the OBSERVATION_PERIOD table. It starts at birth or from the first entry to the registers and ends at death or at the end of the follow-up period. The other tables contain medical events.
OHDSI standardized vocabularies are strictly limited to one OMOP-CDM Domain and all medical event tables belong to only one OMOP-CDM Domain. The Domain has an effect on how cohorts are defined in Atlas.
Table
Domain
Description
PERSON
Uniquely identifies each individual and contains demographic information.
OBSERVATION_PERIOD
Defines the span(s) of time during which events happen to an individual. In FinnGen, each individual has only one observation period. It starts at birth or from the first entry to the registers. It ends at death or at the end of the follow-up period.
VISIT_OCCURENCE
Visit
Events where an individual interacts with the healthcare system for a duration of time. For example, an event could be a visit to a hospital, a diagnosis, a procedure, or a drug intake.
CONDITION_OCCURRENCE
Condition
Medical diagnosis events.
PROCEDURE_OCCURRENCE
Procedure
Medical operation events.
DRUG_EXPOSURE
Drug
Drug exposure events.
MEASUREMENT
Measurement
Laboratory test events.
DEATH
Death
Death events.
OBSERVATION
Observation
Events not captured by other domains.
FinnGen uses both standard and non-standard medical codes. Standard and non-standard codes are stored in the medical event tables using the following naming convention:
Column
Descriptions
_CONCEPT_ID
Foreign key to the medical code used in the OHDSI standard vocabulary. Standard codes are stored in this column.
_SOURCE_CONCEPT_ID
Foreign key to the medical code used in the source data. Non-standard codes are stored in this column.
The result tables contain cohort definitions as well as cohorts created using them.
Table
Description
COHORT_DEFINITION
Contains information to define a cohort. Cohorts are a set of individuals that satisfy inclusion criteria for a duration of time. The information in the COHORT_DEFINITION table is used to instantiate a cohort and store it in the COHORT table.
COHORT
Contains the individuals that satisfy the definition of a cohort stored in the COHORT_DEFINITION table. The period of time an individual is part of the cohort is captured in the COHORT_START_DATE and COHORT_END_DATE columns. Please note than an individual may enter and exit the cohort more than once.