Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deidentification of neurological assessment scores in notes (MIMIC-III, MIMIC-IV) #1796

Open
1 task done
tilmanbeck opened this issue Sep 18, 2024 · 0 comments
Open
1 task done

Comments

@tilmanbeck
Copy link

Prerequisites

Description

Hi,
in our project we are looking at patients with subarachnoid hemorrhage (SAH) diagnosis; such patients often undergo neurological assessment which includes grading scores such as Hunt and Hess, WFNS, or (Modified) Fisher scale. Such scores are often gathered during admission and reported in the discharge summary. Extracting these scores from free-text notes can be useful for downstream applications.

It seems that the description of these scores in the notes is masked. For example in MIMIC-III, the TEXT field of the entry with HADM_ID=167857 and CATEGORY="Discharge summary" in NOTEEVENTS.csv.gz has the Hess part of Hunt and Hess masked. Further, the subsequent score name in the same entry is completely masked, making it impossible to recover.
In MIMIC-IV, a similar phenomena can be observed, albeit slightly different. For the text field in entry in mimic-iv-note/2.2/note/discharge.csv.gz with note_id=13317644-DS-20, both Hunt and Hess are masked whereas Fisher is not masked.

I wonder if the context-specific rules can be added to the deidentification algorithm, similarly as suggested in #1507 ?

Thanks a lot for your efforts of maintaining and further developing the MIMIC database, it is a great resource!

Best,
Tilman

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant