Popular Vote (popV) model for automated cell type annotation of single-cell RNA-seq data. We provide here pretrained models for plug-in use in your own analysis. Follow our tutorial to learn how to use the model for cell type annotation.
Model description
Tabula Sapiens is a benchmark, first-draft human cell atlas of over 1.1M cells from 28 organs of 24 normal human subjects. This work is the product of the Tabula Sapiens Consortium. Taking the organs from the same individual controls for genetic background, age, environment, and epigenetic effects, and allows detailed analysis and comparison of cell types that are shared between tissues.
Link to CELLxGENE: Link to the data in the CELLxGENE browser for interactive exploration of the data and download of the source data.
Training Code URL: Not provided by uploader.
Metrics
We provide here accuracies for each of the experts and the ensemble model. The validation set accuracies are computed on a 10% random subset of the data that was not used for training.
Cell Type | N cells | celltypist | knn on bbknn | knn on harmony | knn on scvi | onclass | scanvi | svm | xgboost | Consensus Prediction |
---|---|---|---|---|---|---|---|---|---|---|
CD4-positive, alpha-beta T cell | 754 | 0.76 | 0.85 | 0.85 | 0.79 | 0.00 | 0.74 | 0.74 | 0.78 | 0.86 |
CD4-positive, alpha-beta thymocyte | 441 | 0.70 | 0.47 | 0.76 | 0.66 | 0.00 | 0.65 | 0.66 | 0.70 | 0.76 |
B cell | 450 | 1.00 | 1.00 | 1.00 | 1.00 | 0.00 | 1.00 | 1.00 | 1.00 | 1.00 |
thymic fibroblast type 2 | 377 | 0.87 | 0.89 | 0.87 | 0.91 | 0.00 | 0.89 | 0.90 | 0.89 | 0.92 |
thymic fibroblast type 1 | 343 | 0.87 | 0.86 | 0.86 | 0.90 | 0.00 | 0.89 | 0.89 | 0.88 | 0.90 |
CD8-positive, alpha-beta T cell | 272 | 0.82 | 0.84 | 0.85 | 0.80 | 0.00 | 0.85 | 0.81 | 0.83 | 0.86 |
capillary endothelial cell | 263 | 0.80 | 0.85 | 0.88 | 0.85 | 0.00 | 0.83 | 0.85 | 0.84 | 0.87 |
naive thymus-derived CD4-positive, alpha-beta T cell | 266 | 0.49 | 0.44 | 0.49 | 0.34 | 0.00 | 0.52 | 0.40 | 0.50 | 0.56 |
smooth muscle cell | 228 | 0.96 | 0.97 | 0.97 | 0.98 | 0.00 | 0.99 | 0.98 | 0.98 | 0.98 |
vein endothelial cell | 213 | 0.83 | 0.83 | 0.88 | 0.85 | 0.00 | 0.81 | 0.86 | 0.86 | 0.88 |
plasma cell | 137 | 1.00 | 0.99 | 0.99 | 0.99 | 0.00 | 1.00 | 0.99 | 1.00 | 1.00 |
endothelial cell of artery | 85 | 0.92 | 0.92 | 0.93 | 0.84 | 0.00 | 0.91 | 0.92 | 0.93 | 0.94 |
macrophage | 86 | 0.98 | 0.99 | 1.00 | 0.99 | 0.00 | 0.99 | 0.99 | 1.00 | 1.00 |
CD8-positive, alpha-beta thymocyte | 70 | 0.70 | 0.74 | 0.79 | 0.58 | 0.00 | 0.72 | 0.70 | 0.72 | 0.80 |
thymocyte | 47 | 0.67 | 0.74 | 0.79 | 0.60 | 0.00 | 0.78 | 0.67 | 0.78 | 0.81 |
naive thymus-derived CD8-positive, alpha-beta T cell | 36 | 0.31 | 0.38 | 0.41 | 0.14 | 0.00 | 0.34 | 0.53 | 0.60 | 0.65 |
natural killer cell | 47 | 0.93 | 0.95 | 0.92 | 0.85 | 0.00 | 0.95 | 0.97 | 0.92 | 0.96 |
endothelial cell of lymphatic vessel | 38 | 1.00 | 1.00 | 1.00 | 0.99 | 0.00 | 1.00 | 0.99 | 0.99 | 1.00 |
fibroblast | 25 | 0.47 | 0.20 | 0.19 | 0.51 | 0.00 | 0.87 | 0.81 | 0.76 | 0.73 |
vascular associated smooth muscle cell | 12 | 0.64 | 0.00 | 0.40 | 0.50 | 0.00 | 0.86 | 0.89 | 0.92 | 0.74 |
mesothelial cell | 15 | 0.93 | 0.93 | 0.87 | 0.81 | 0.00 | 0.90 | 0.87 | 0.87 | 0.87 |
erythrocyte | 18 | 0.95 | 0.92 | 0.92 | 0.94 | 0.00 | 0.89 | 0.94 | 0.97 | 0.94 |
neuro-medullary thymic epithelial cell | 14 | 0.96 | 0.96 | 0.96 | 0.96 | 0.00 | 1.00 | 1.00 | 1.00 | 1.00 |
neutrophil | 12 | 0.74 | 0.75 | 0.96 | 0.92 | 0.00 | 0.96 | 0.96 | 0.92 | 0.96 |
hematopoietic precursor cell | 9 | 0.78 | 0.20 | 0.89 | 0.89 | 0.00 | 0.94 | 0.94 | 0.89 | 0.94 |
myo-medullary thymic epithelial cell | 4 | 0.89 | 0.89 | 0.89 | 0.89 | 0.00 | 0.89 | 1.00 | 1.00 | 0.89 |
medullary thymic epithelial cell | 7 | 1.00 | 1.00 | 1.00 | 0.73 | 0.00 | 0.86 | 0.73 | 0.92 | 1.00 |
monocyte | 2 | 0.00 | 0.00 | 1.00 | 1.00 | 0.00 | 0.67 | 0.67 | 1.00 | 1.00 |
mast cell | 2 | 1.00 | 1.00 | 1.00 | 0.67 | 0.00 | 0.40 | 1.00 | 1.00 | 1.00 |
fast muscle cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
T follicular helper cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
innate lymphoid cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
endothelial cell | 0 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 |
The train accuracies are computed on the training data.
Cell Type | N cells | celltypist | knn on bbknn | knn on harmony | knn on scvi | onclass | scanvi | svm | xgboost | Consensus Prediction |
---|---|---|---|---|---|---|---|---|---|---|
CD4-positive, alpha-beta T cell | 6372 | 0.75 | 0.81 | 0.88 | 0.82 | 0.00 | 0.76 | 0.74 | 0.78 | 0.86 |
CD4-positive, alpha-beta thymocyte | 4122 | 0.72 | 0.44 | 0.83 | 0.80 | 0.00 | 0.75 | 0.70 | 0.76 | 0.82 |
B cell | 4012 | 1.00 | 0.99 | 1.00 | 1.00 | 0.00 | 1.00 | 1.00 | 1.00 | 1.00 |
thymic fibroblast type 2 | 3828 | 0.89 | 0.90 | 0.94 | 0.94 | 0.00 | 0.93 | 0.93 | 0.92 | 0.95 |
thymic fibroblast type 1 | 2927 | 0.86 | 0.85 | 0.93 | 0.91 | 0.00 | 0.91 | 0.91 | 0.90 | 0.93 |
CD8-positive, alpha-beta T cell | 2545 | 0.80 | 0.83 | 0.90 | 0.85 | 0.00 | 0.84 | 0.82 | 0.84 | 0.88 |
capillary endothelial cell | 2358 | 0.81 | 0.84 | 0.93 | 0.88 | 0.00 | 0.89 | 0.89 | 0.89 | 0.91 |
naive thymus-derived CD4-positive, alpha-beta T cell | 2257 | 0.49 | 0.41 | 0.65 | 0.58 | 0.00 | 0.56 | 0.50 | 0.56 | 0.65 |
smooth muscle cell | 1997 | 0.95 | 0.94 | 0.99 | 0.98 | 0.00 | 0.99 | 0.99 | 0.99 | 0.99 |
vein endothelial cell | 1835 | 0.84 | 0.81 | 0.92 | 0.88 | 0.00 | 0.87 | 0.88 | 0.89 | 0.91 |
plasma cell | 1441 | 0.99 | 0.99 | 0.99 | 0.99 | 0.00 | 0.99 | 1.00 | 1.00 | 1.00 |
endothelial cell of artery | 688 | 0.90 | 0.92 | 0.96 | 0.86 | 0.00 | 0.92 | 0.97 | 0.96 | 0.96 |
macrophage | 684 | 0.98 | 0.98 | 0.99 | 0.98 | 0.00 | 0.99 | 1.00 | 0.99 | 1.00 |
CD8-positive, alpha-beta thymocyte | 602 | 0.66 | 0.69 | 0.81 | 0.68 | 0.00 | 0.79 | 0.79 | 0.81 | 0.86 |
thymocyte | 408 | 0.68 | 0.62 | 0.80 | 0.68 | 0.00 | 0.84 | 0.85 | 0.88 | 0.91 |
naive thymus-derived CD8-positive, alpha-beta T cell | 398 | 0.32 | 0.41 | 0.66 | 0.33 | 0.00 | 0.59 | 0.69 | 0.74 | 0.76 |
natural killer cell | 381 | 0.92 | 0.92 | 0.94 | 0.90 | 0.00 | 0.92 | 0.96 | 0.96 | 0.96 |
endothelial cell of lymphatic vessel | 333 | 0.97 | 0.97 | 0.98 | 0.96 | 0.00 | 0.98 | 0.99 | 0.98 | 0.99 |
fibroblast | 247 | 0.67 | 0.06 | 0.87 | 0.66 | 0.00 | 0.99 | 0.93 | 0.93 | 0.96 |
vascular associated smooth muscle cell | 229 | 0.63 | 0.08 | 0.92 | 0.85 | 0.00 | 0.98 | 0.97 | 0.98 | 0.98 |
mesothelial cell | 160 | 0.95 | 0.93 | 0.94 | 0.92 | 0.00 | 0.98 | 0.97 | 0.97 | 0.98 |
erythrocyte | 134 | 0.86 | 0.89 | 0.91 | 0.95 | 0.00 | 0.96 | 0.99 | 0.94 | 0.96 |
neuro-medullary thymic epithelial cell | 121 | 0.91 | 0.92 | 0.84 | 0.95 | 0.00 | 0.96 | 1.00 | 0.99 | 0.97 |
neutrophil | 109 | 0.61 | 0.73 | 0.91 | 0.91 | 0.00 | 0.95 | 1.00 | 1.00 | 1.00 |
hematopoietic precursor cell | 83 | 0.74 | 0.05 | 0.90 | 0.90 | 0.00 | 0.97 | 1.00 | 0.99 | 1.00 |
myo-medullary thymic epithelial cell | 73 | 0.85 | 0.82 | 0.97 | 0.93 | 0.00 | 0.98 | 1.00 | 1.00 | 0.97 |
medullary thymic epithelial cell | 36 | 0.84 | 0.76 | 0.78 | 0.72 | 0.00 | 0.97 | 1.00 | 1.00 | 0.97 |
monocyte | 24 | 0.00 | 0.00 | 0.86 | 0.62 | 0.00 | 0.91 | 0.98 | 0.98 | 0.98 |
mast cell | 22 | 0.88 | 0.71 | 0.87 | 0.84 | 0.00 | 0.92 | 0.96 | 1.00 | 0.98 |
fast muscle cell | 21 | 0.00 | 0.00 | 0.98 | 0.86 | 0.00 | 0.98 | 1.00 | 1.00 | 0.98 |
T follicular helper cell | 6 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.27 | 0.80 | 0.86 | 0.60 |
innate lymphoid cell | 2 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.14 | 0.57 | 1.00 | 1.00 |
endothelial cell | 1 | 0.00 | 0.00 | 0.00 | 0.00 | 0.00 | 0.33 | 1.00 | 1.00 | 0.00 |
References
Tabula Sapiens reveals transcription factor expression, senescence effects, and sex-specific features in cell types from 28 human organs and tissues, The Tabula Sapiens Consortium; bioRxiv, doi: https://doi.org/10.1101/2024.12.03.626516