We provide preprocessed versions of public data sets used to evaluate scNym below.
All datasets are formatted following the AnnData
conventions.
- hPBMS Stimulated/Control -- We used the authors' annotations in the
.obs["cell"]
attribute. We split experimental conditions based on the authors annotations in the.obs["stim"]
attribute. We obtained source data from GEO at accession GSE96583.
- Tabula Muris 10x -- We used the authors' annotations in the
.obs["cell_ontology_class"]
attribute. We obtained source data from the authors' website. - Tabula Muris Smart-seq2 -- We used the authors' annotations in the
.obs["cell_ontology_class"]
attribute. We obtained source data from the authors' website. - Mouse Cell Atlas Lung data with manual annotations -- We added manual annotations in the
.obs["cell_ontology_class"]
attribute. We obtained source data from the authors' website. - 10x Visium Spatial Transcriptomics of Mouse Brain Section 0 -- We added manual annotations in the
.obs["cell_ontology_class"]
attribute. We obtained source data from 10x Genomics. - 10x Visium Spatial Transcriptomics of Mouse Brain Section 1 -- We added manual annotations in the
.obs["cell_ontology_class"]
attribute. We obtained source data from 10x Genomics. - Cross-Technology comparison in the Mouse Cortex -- We used the authors' annotations in the
"CellType"
attribute. We obtained source data from the Broad Institute Single Cell Portal. - Cross-Technology comparison in the Mouse Kidney -- We mapped the authors' annotations into the "cell ontology" namespace and stored these annotations in the
"cell_ontology_class"
attribute. We removed cells with ambiguous cell types under the cell ontology (2310 cells total). We identified unique experimental domains using the"Protocol"
attribute. We obtained data from GEO at GSE141115.
- Rat Aging Cell Atlas -- We translated the authors' annotations into the "cell ontology class" namespace and stored these annotations in the
.obs["cell_ontology_class"]
attribute. We obtained source data from GEO at accession GSE137869.