Atlas data model
Last updated
Last updated
Atlas uses the Observational Medical Outcomes Partnership (OMOP) common relational data model (CDM). This data model supports both standard and non-standard medical codes.
The OMOP-CDM relational data model used by FinnGen is explained in detail in the OMOP-CDM v.5.4 specification. OMOP-CDP data model conventions are described here.
The data model consists of the following tables. The Standardized clinical data tables and the Result tables are briefly described in the following sections:
Standardized clinical data tables contain all individuals and medical events. Individuals are stored in the PERSON table. In FinnGen, each individual has one observation period stored in the OBSERVATION_PERIOD table. It starts at birth or from the first entry to the registers and ends at death or at the end of the follow-up period. The other tables contain medical events.
OHDSI standardized vocabularies are strictly limited to one OMOP-CDM Domain and all medical event tables belong to only one OMOP-CDM Domain. The Domain has an effect on how cohorts are defined in Atlas.
Table | Domain | Description |
PERSON | Uniquely identifies each individual and contains demographic information. | |
OBSERVATION_PERIOD | Defines the span(s) of time during which events happen to an individual. In FinnGen, each individual has only one observation period. It starts at birth or from the first entry to the registers. It ends at death or at the end of the follow-up period. | |
VISIT_OCCURENCE | Visit | Events where an individual interacts with the healthcare system for a duration of time. For example, an event could be a visit to a hospital, a diagnosis, a procedure, or a drug intake. |
CONDITION_OCCURRENCE | Condition | Medical diagnosis events. |
PROCEDURE_OCCURRENCE | Procedure | Medical operation events. |
DRUG_EXPOSURE | Drug | Drug exposure events. |
MEASUREMENT | Measurement | Laboratory test events. |
DEATH | Death | Death events. |
OBSERVATION | Observation | Events not captured by other domains. |
FinnGen uses both standard and non-standard medical codes. Standard and non-standard codes are stored in the medical event tables using the following naming convention:
Column | Descriptions |
_CONCEPT_ID | Foreign key to the medical code used in the OHDSI standard vocabulary. Standard codes are stored in this column. |
_SOURCE_CONCEPT_ID | Foreign key to the medical code used in the source data. Non-standard codes are stored in this column. |
The result tables contain cohort definitions as well as cohorts created using them.
Table | Description |
COHORT_DEFINITION | Contains information to define a cohort. Cohorts are a set of individuals that satisfy inclusion criteria for a duration of time. The information in the COHORT_DEFINITION table is used to instantiate a cohort and store it in the COHORT table. |
COHORT | Contains the individuals that satisfy the definition of a cohort stored in the COHORT_DEFINITION table. The period of time an individual is part of the cohort is captured in the COHORT_START_DATE and COHORT_END_DATE columns. Please note than an individual may enter and exit the cohort more than once. |