@@ -35,36 +35,35 @@ The project implements an ETL conversion of MIMIC IV PhysioNet dataset to OMOP C
3535### How to run the conversion ###
3636
3737######To run ETL end-to-end:######
38- * update config files accordingly
39- * see conf/\* .etlconf to set project level variables
40- * see vocabulary_refresh/README.md to configure vocabulary refresh workflow
41- * set the project root (location of this file) as the current directory
42- * run the commands given below
38+ * load the latest standard OMOP vocabularies from http://athena.ohdsi.org if needed
39+ * create a working copy of the loaded vocabularies, where custom mapping data will be added to
40+ * set variables in vocabulary_refresh/README.md
41+ * run vocabulary refresh commands given below from directory "vocabulary_refresh"
42+ * set the project variables in conf/\* .etlconf
43+ * run script "wf_read" to load waveform sample data if needed
44+ * run workflow commands below in the given sequence
45+ * in the workflow commands <env > is the "environment" name, which equals "dev" for the demo dataset and "full" for the full set
4346
47+ * set the project root (location of this file) as the current directory
4448```
4549cd vocabulary_refresh
4650python vocabulary_refresh.py -s10
4751python vocabulary_refresh.py -s20
4852python vocabulary_refresh.py -s30
4953cd ../
50- python scripts/wf_read.py -e conf/dev .etlconf
51- python scripts/run_workflow.py -e conf/dev .etlconf -c conf/workflow_ddl.conf
52- python scripts/run_workflow.py -e conf/dev .etlconf -c conf/workflow_staging.conf
53- python scripts/run_workflow.py -e conf/dev .etlconf -c conf/workflow_etl.conf
54- python scripts/run_workflow.py -e conf/dev .etlconf -c conf/workflow_ut.conf
55- python scripts/run_workflow.py -e conf/dev .etlconf -c conf/workflow_metrics.conf
56- python scripts/run_workflow.py -e conf/dev .etlconf -c conf/workflow_unload.conf
54+ python scripts/wf_read.py -e conf/<env> .etlconf
55+ python scripts/run_workflow.py -e conf/<env> .etlconf -c conf/workflow_ddl.conf
56+ python scripts/run_workflow.py -e conf/<env> .etlconf -c conf/workflow_staging.conf
57+ python scripts/run_workflow.py -e conf/<env> .etlconf -c conf/workflow_etl.conf
58+ python scripts/run_workflow.py -e conf/<env> .etlconf -c conf/workflow_ut.conf
59+ python scripts/run_workflow.py -e conf/<env> .etlconf -c conf/workflow_metrics.conf
60+ python scripts/run_workflow.py -e conf/<env> .etlconf -c conf/workflow_unload.conf
5761```
5862
5963######To look at UT and Metrics reports:######
6064* see metrics dataset name in the corresponding .etlconf file
6165
6266``` SQL
63- -- UT report
64- SELECT report_starttime, table_id, test_type, field_name
65- FROM metrics_dataset .report_unit_test
66- WHERE NOT test_passed
67- ;
6867-- Metrics - row count
6968SELECT * FROM metrics_dataset .me_total ORDER BY table_name;
7069-- Metrics - person and visit summary
@@ -80,13 +79,19 @@ SELECT
8079FROM metrics_dataset .me_mapping_rate
8180ORDER BY table_name, concept_field
8281;
83- -- Metrics - Top 100 Mapped and Unmapped
82+ -- Metrics - Mapped and Unmapped source values
8483SELECT
8584 table_name, concept_field, category, source_value, concept_id, concept_name,
8685 count AS row_count,
8786 percent AS rows_percent
8887FROM metrics_dataset .me_tops_together
8988ORDER BY table_name, concept_field, category, count DESC ;
89+ -- UT report
90+ SELECT report_starttime, table_id, test_type, field_name, test_passed
91+ FROM mimiciv_full_metrics_2023_02_17 .report_unit_test
92+ order by table_id, report_starttime
93+ -- WHERE NOT test_passed
94+ ;
9095```
9196
9297######More option to run ETL parts:######
@@ -104,6 +109,19 @@ ORDER BY table_name, concept_field, category, count DESC;
104109
105110### Change Log (latest first) ###
106111
112+ ** 2023-02-17**
113+
114+ * MIMIC 2.2 is issued. Run ETL on MIMIC 2.2.
115+ * minor change to measurement.value_source_value:
116+ * populate the field always instead of populating only when value_as_number is null
117+ * minor change to custom mapping vocabularies:
118+ * mimiciv_drug_ndc,
119+ * mimiciv_drug_route,
120+ * mimiciv_meas_lab_loinc,
121+ * mimiciv_obs_drgcodes,
122+ * mimiciv_proc_itemid
123+ * run with OMOP vocabularies v16-JAN-23
124+
107125** 2022-09-09**
108126
109127* MIMIC 2.0 is issued: run ETL on MIMIC 2.0.
0 commit comments