-
My understanding is that Synthea runs each patient through all modules, and that most (maybe all) modules are probabilistic, which means each patient has some chance to "pass unscathed" by a module. That is, not ever have the condition of the module. Is it possible to obtain the complete list of all patients who had records produced by each module? For example, if module X gives each patient a 2% chance of being diagnosed with a given cancer, is it possible to obtain the list of those 2% somewhere? Without checking for the presence of the diagnoses codes/lab tests/synthetic patient data itself. I'm looking for some sort of external, authoritative, "ground truth" about the synthetic patients. Thanks! |
Beta Was this translation helpful? Give feedback.
Replies: 4 comments 6 replies
-
I guess my question could also be phrased as: Is it possible to establish (with certainty) from which state, of which module, came each row of data in the CSVs that Sythea produces? Thanks! |
Beta Was this translation helpful? Give feedback.
-
I realized the JSON export page (https://github.com/synthetichealth/synthea/wiki/JSON-Export) describes an option to export each patient's path through each module. Just what I was looking for. |
Beta Was this translation helpful? Give feedback.
-
The Json export record probably does have the module information directly. How do you tell which record or which patient is a result of which module? That's your original question, right? |
Beta Was this translation helpful? Give feedback.
-
You're right, lhs-open. I was just able to export each patient's module history (using option So I guess the correct answer to my question is: unfortunately no :/ This would be a great feature to have for purposes of learning electronic patient phenotyping. The module history is like the "answer key" to the exercise. The exercise is to identify patients with a given condition by querying the available patient data. I might be able to make do with just the list of states from each module, but here's my +1 for this feature to get implemented at some point: provide an option that lets you know, for each record in the output, exactly which state of which module produced it. Thanks for all. |
Beta Was this translation helpful? Give feedback.
You're right, lhs-open.
I was just able to export each patient's module history (using option
exporter.json.include_module_history
). I see that it gives you the list of states the patient went through, but not what records in the output were produced from that state (which is what I had asked for).So I guess the correct answer to my question is: unfortunately no :/
This would be a great feature to have for purposes of learning electronic patient phenotyping. The module history is like the "answer key" to the exercise. The exercise is to identify patients with a given condition by querying the available patient data.
I might be able to make do with just the list of states from each module…