GitHub

A Taxonomy for Advancing Systematic Error Analysis in Multi-site Electronic Health Record-based Clinical Concept Extraction

Welcome to our Error Taxonomy project! The finalized taxonomy is available in both .dtd and .owl formats. It is also compatible with several different open-source annotation tools, including MAE, Brat, and MedTator.

Full Taxonomy: https://github.com/OHNLP/ErrorAnalysis/tree/main/Taxonomy

Online demo: https://medtator.ohnlp.org/.

Select: Sample > Error Analysis of NLP System Results > Parse

Project Development Overview

We have iteratively developed and evaluated an error taxonomy based on existing literature, standards, real-world data, multisite case evaluations, and community feedback.

MedTator Error Annotation Module

Here's an example of our error annotation analytical module in MedTator. This module provides various features:

Error Summary Tab: Post-error summary statistics.
Error Distribution: Visualizes the distribution of errors.
Tag Distribution: t-SNE visualization of error textual cosine similarity.
Error List: Displays a list of errors.

This visual analytics module helps break down the number of false positive and negative cases, followed by error dimensions (annotation, contextual, linguistic, and logic), and each unique error type and specific concept.

Collaboration Sites

Mayo Clinic, Rochester, Minnesota University of Texas Health Science Center at Houston, Houston, Texas University of Massachusetts Chan Medical School, Boston, Massachusetts University of Pittsburgh, Pittsburgh, Pennsylvania University of Minnesota, Minneapolis, Minnesota Yale University, New Haven, Connecticut

Reference

Sunyang Fu, Liwei Wang, Huan He, Andrew Wen, Nansu Zong, Anamika Kumari, Feifan Liu, Sicheng Zhou, Rui Zhang, Chenyu Li, Yanshan Wang, Jennifer St Sauver, Hongfang Liu, Sunghwan Sohn, A taxonomy for advancing systematic error analysis in multi-site electronic health record-based clinical concept extraction, Journal of the American Medical Informatics Association, 2024;, ocae101, https://doi.org/10.1093/jamia/ocae101

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
DTD		DTD
OBO		OBO
OWL		OWL
OWX		OWX
Taxonomy		Taxonomy
images		images
.DS_Store		.DS_Store
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Taxonomy for Advancing Systematic Error Analysis in Multi-site Electronic Health Record-based Clinical Concept Extraction

Project Development Overview

MedTator Error Annotation Module

Collaboration Sites

Reference

About

Releases

Packages

OHNLP/ErrorAnalysis

Folders and files

Latest commit

History

Repository files navigation

A Taxonomy for Advancing Systematic Error Analysis in Multi-site Electronic Health Record-based Clinical Concept Extraction

Project Development Overview

MedTator Error Annotation Module

Collaboration Sites

Reference

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages