Skip to content

Commit 428ff3e

Browse files
changes made to run against MIMIC 2.2 on 2023-02-17
1 parent 0389899 commit 428ff3e

19 files changed

+6633
-2709
lines changed

README.md

Lines changed: 36 additions & 18 deletions
Original file line numberDiff line numberDiff line change
@@ -35,36 +35,35 @@ The project implements an ETL conversion of MIMIC IV PhysioNet dataset to OMOP C
3535
### How to run the conversion ###
3636

3737
######To run ETL end-to-end:######
38-
* update config files accordingly
39-
* see conf/\*.etlconf to set project level variables
40-
* see vocabulary_refresh/README.md to configure vocabulary refresh workflow
41-
* set the project root (location of this file) as the current directory
42-
* run the commands given below
38+
* load the latest standard OMOP vocabularies from http://athena.ohdsi.org if needed
39+
* create a working copy of the loaded vocabularies, where custom mapping data will be added to
40+
* set variables in vocabulary_refresh/README.md
41+
* run vocabulary refresh commands given below from directory "vocabulary_refresh"
42+
* set the project variables in conf/\*.etlconf
43+
* run script "wf_read" to load waveform sample data if needed
44+
* run workflow commands below in the given sequence
45+
* in the workflow commands <env> is the "environment" name, which equals "dev" for the demo dataset and "full" for the full set
4346

47+
* set the project root (location of this file) as the current directory
4448
```
4549
cd vocabulary_refresh
4650
python vocabulary_refresh.py -s10
4751
python vocabulary_refresh.py -s20
4852
python vocabulary_refresh.py -s30
4953
cd ../
50-
python scripts/wf_read.py -e conf/dev.etlconf
51-
python scripts/run_workflow.py -e conf/dev.etlconf -c conf/workflow_ddl.conf
52-
python scripts/run_workflow.py -e conf/dev.etlconf -c conf/workflow_staging.conf
53-
python scripts/run_workflow.py -e conf/dev.etlconf -c conf/workflow_etl.conf
54-
python scripts/run_workflow.py -e conf/dev.etlconf -c conf/workflow_ut.conf
55-
python scripts/run_workflow.py -e conf/dev.etlconf -c conf/workflow_metrics.conf
56-
python scripts/run_workflow.py -e conf/dev.etlconf -c conf/workflow_unload.conf
54+
python scripts/wf_read.py -e conf/<env>.etlconf
55+
python scripts/run_workflow.py -e conf/<env>.etlconf -c conf/workflow_ddl.conf
56+
python scripts/run_workflow.py -e conf/<env>.etlconf -c conf/workflow_staging.conf
57+
python scripts/run_workflow.py -e conf/<env>.etlconf -c conf/workflow_etl.conf
58+
python scripts/run_workflow.py -e conf/<env>.etlconf -c conf/workflow_ut.conf
59+
python scripts/run_workflow.py -e conf/<env>.etlconf -c conf/workflow_metrics.conf
60+
python scripts/run_workflow.py -e conf/<env>.etlconf -c conf/workflow_unload.conf
5761
```
5862

5963
######To look at UT and Metrics reports:######
6064
* see metrics dataset name in the corresponding .etlconf file
6165

6266
```SQL
63-
-- UT report
64-
SELECT report_starttime, table_id, test_type, field_name
65-
FROM metrics_dataset.report_unit_test
66-
WHERE NOT test_passed
67-
;
6867
-- Metrics - row count
6968
SELECT * FROM metrics_dataset.me_total ORDER BY table_name;
7069
-- Metrics - person and visit summary
@@ -80,13 +79,19 @@ SELECT
8079
FROM metrics_dataset.me_mapping_rate
8180
ORDER BY table_name, concept_field
8281
;
83-
-- Metrics - Top 100 Mapped and Unmapped
82+
-- Metrics - Mapped and Unmapped source values
8483
SELECT
8584
table_name, concept_field, category, source_value, concept_id, concept_name,
8685
count AS row_count,
8786
percent AS rows_percent
8887
FROM metrics_dataset.me_tops_together
8988
ORDER BY table_name, concept_field, category, count DESC;
89+
-- UT report
90+
SELECT report_starttime, table_id, test_type, field_name, test_passed
91+
FROM mimiciv_full_metrics_2023_02_17.report_unit_test
92+
order by table_id, report_starttime
93+
-- WHERE NOT test_passed
94+
;
9095
```
9196

9297
######More option to run ETL parts:######
@@ -104,6 +109,19 @@ ORDER BY table_name, concept_field, category, count DESC;
104109

105110
### Change Log (latest first) ###
106111

112+
**2023-02-17**
113+
114+
* MIMIC 2.2 is issued. Run ETL on MIMIC 2.2.
115+
* minor change to measurement.value_source_value:
116+
* populate the field always instead of populating only when value_as_number is null
117+
* minor change to custom mapping vocabularies:
118+
* mimiciv_drug_ndc,
119+
* mimiciv_drug_route,
120+
* mimiciv_meas_lab_loinc,
121+
* mimiciv_obs_drgcodes,
122+
* mimiciv_proc_itemid
123+
* run with OMOP vocabularies v16-JAN-23
124+
107125
**2022-09-09**
108126

109127
* MIMIC 2.0 is issued: run ETL on MIMIC 2.0.

backup_for_custom_mapping/custom_mapping_list.tsv

Lines changed: 0 additions & 27 deletions
This file was deleted.

0 commit comments

Comments
 (0)