Tissue heterogeneity is supported by single-cell sequencing technology, and spatial transcriptomics provides an important technological platform to study the problem, continuous changes in cell state, and microenvironment interactions. Yet the high dimensionality and sparsity of data, batch effects, and spatial point mixing may confuse model inferences. In this review, we discuss recent progress of interpretable model and method developments in single-cell and spatial omics analysis with biological applications, especially for the deep generative model single-cell Variational Inference (scVI) as well as its multimodal extensions total Variational Inference (totalVI) and Multimodal Variational Inference (MultiVI); the spatial mapping method Tangram; and the intercellular communication inference frameworks NicheNet and CellChat. We additionally discuss how pre-trained base models like single-cell Bidirectional Encoder Representations from Transformers (scBERT) and single-cell Generative Pre-trained Transformer (scGPT) can be used for cross-dataset transfer, and the problems this poses to interpretability. This work highlights the construction of an evidential chain centered around “gene/pathway – spatial domain – communication network”, and indicates that spatial colocalization, multimodal consistency, and uncertainty assessment can enhance the confidence and repeatability of mechanistic inference, hence providing a testable hypothesis-generating platform for tumor immunology studies.
References
[1] Chen, C., Liu, X., Zhou, M., Li, Z., Du, Z., Lin, Y. (2025) Lightweight and real-time driver fatigue detection based on MG-YOLOv8 with facial multi-feature fusion. Journal of Imaging, 11(11), 385.
[2] Yamamoto, T., Cockburn, K., Greco, V., Kawaguchi, K. (2022) Probing the rules of cell coordination in live tissues by interpretable machine learning based on graph neural networks. PLOS Computational Biology, 18(9), e1010477.
[3] Guan, J., Cheng, H. Y., Wu, Y. P., Tian, C., Qi, J. Y. (2025) Multi-target tracking for star sensor based on CenterTrack deep learning model. Scientific Reports, 15(1), 37125.
[4] Dohmen, J., Baranovskii, A., Ronen, J., Uyar, B., Franke, V., Akalin, A. (2022) Identifying tumor cells at the single-cell level using machine learning. Genome Biology, 23(1), 123.
[5] Kim, Y. S., Choi, J., Lee, S. H. (2023) Single-cell and spatial sequencing application in pathology. Journal of Pathology and Translational Medicine, 57(1), 43-51.
[6] Zhi, Y., Wang, Q., Zi, M., Zhang, S., Ge, J., Liu, K., Lu, L., Fan, C., Yan, Q., Shi, L., Chen, P., Fan, S., Liao, Q., Guo, C., Wang, F., Gong, Z., Xiong, W., Zeng, Z. (2024) Spatial transcriptomic and metabolomic landscapes of oral submucous fibrosis‐derived Oral squamous cell carcinoma and its tumor microenvironment. Advanced Science, 11(12), 2306515.
[7] Lopez, R., Regier, J., Cole, M. B., Jordan, M. I., Yosef, N. (2018) Deep generative modeling for single-cell transcriptomics. Nature Methods, 15(12), 1053-1058.
[8] Zhao, M., Huang, H., He, F., Fu, X. (2023) Current insights into the hepatic microenvironment and advances in immunotherapy for hepatocellular carcinoma. Frontiers in Immunology, 14, 1188277.
[9] Ning, M., Lu, D., Liang, D., Ren, P. G. (2025) Single-cell RNA sequencing advances in revealing the development and progression of MASH: the identifications and interactions of non-parenchymal cells. Frontiers in Molecular Biosciences, 12, 1513993.
[10] Gayoso, A., Steier, Z., Lopez, R., Regier, J., Nazor, K. L., Streets, A., Yosef, N. (2021) Joint probabilistic modeling of single-cell multi-omic data with totalVI. Nature Methods, 18(3), 272-282.
[11] Ashuach, T., Gabitto, M. I., Koodli, R. V., Saldi, G. A., Jordan, M. I., Yosef, N. (2023) MultiVI: deep generative model for the integration of multimodal data. Nature Methods, 20(8), 1222-1231.
[12] Argelaguet, R., Arnol, D., Bredikhin, D., Deloro, Y., Velten, B., Marioni, J. C., Stegle, O. (2020) MOFA+: a statistical framework for comprehensive integration of multi-modal single-cell data. Genome Biology, 21(1), 111.
[13] Hackenberg, M., Brunn, N., Vogel, T., Binder, H. (2025) Infusing structural assumptions into dimensionality reduction for single-cell RNA sequencing data to identify small gene sets. Communications Biology, 8(1), 414.
[14] de Lucio Delgado, A., Villegas Rubio, J. A., Riaño-Galán, I., Pérez Gordón, J. (2023) Effect of the use of gnrh analogs in low-grade cerebral glioma. Children, 10(1), 115.
[15] Qian, C., Xin, Y., Qi, C., Wang, H., Dong, B. C., Zack, D. J., Blackshaw, S., Hattar, S., Zhou, F. Q., Qian, J. (2024) Intercellular communication atlas reveals Oprm1 as a neuroprotective factor for retinal ganglion cells. Nature Communications, 15(1), 2206.
[16] Dixit, A., Parnas, O., Li, B., Chen, J., Fulco, C. P., Jerby-Arnon, L., Marjanovic, N. D., Dionne, D., Burks, T., Raychowdhury, R., Adamson, B., Norman, T. M., Lander, E. S., Weissman, J. S., Friedman, N., Regev, A. (2016) Perturb-seq: Dissecting molecular circuits with scalable single-cell RNA profiling of pooled genetic screens. Cell, 167(7), 1853-1866.
[17] Biancalani, T., Scalia, G., Buffoni, L., Avasthi, R., Lu, Z., Sanger, A., Tokcan, N., Vanderburg, C. R., Segerstolpe, Å., Zhang, M., Avraham-Davidi, I., Vickovic, S., Nitzan, M., Ma, S., Subramanian, A., Lipinski, M., Buenrostro, J., Brown, N. B., Fanelli, D., Zhuang, X., Macosko, E. Z., Regev, A. (2021) Deep learning and alignment of spatially resolved single-cell transcriptomes with Tangram. Nature Methods, 18(11), 1352-1362.
[18] Browaeys, R., Saelens, W., Saeys, Y. (2020) NicheNet: modeling intercellular communication by linking ligands to target genes. Nature Methods, 17(2), 159-162.
[19] Jin, S., Guerrero-Juarez, C. F., Zhang, L., Chang, I., Ramos, R., Kuan, C. H., Myung, P., Plikus, M. V., Nie, Q. (2021) Inference and analysis of cell-cell communication using CellChat. Nature Communications, 12(1), 1088.
[20] Yang, F., Wang, W., Wang, F., Fang, Y., Tang, D., Huang, J., Lu, H., Yao, J. (2022) scBERT as a large-scale pretrained deep language model for cell type annotation of single-cell RNA-seq data. Nature Machine Intelligence, 4(10), 852-866.
[21] Cui, H., Wang, C., Maan, H., Pang, K., Luo, F., Duan, N., Wang, B. (2024) scGPT: toward building a foundation model for single-cell multi-omics using generative AI. Nature Methods, 21(8), 1470-1480.
[22] Chen, J., Xu, H., Tao, W., Chen, Z., Zhao, Y., Han, J. D. J. (2023) Transformer for one stop interpretable cell type annotation. Nature Communications, 14(1), 223.
[23] Bhattacharyya, S., Ehsan, S. F., Karacosta, L. G. (2023) Phenotypic maps for precision medicine: a promising systems biology tool for assessing therapy response and resistance at a personalized level. Frontiers in Network Physiology, 3, 1256104.
[24] Hu, Y., Lin, Z., Xie, M., Yuan, W., Li, Y., Rao, M., Liu, Y. H., Shen, W., Zhang, L., Zhou, X. M. (2025) MaskGraphene: an advanced framework for interpretable joint representation for multi-slice, multi-condition spatial transcriptomics. Genome Biology, 26(1), 1-45.
[25] Hao, G., Fan, Y., Yu, Z., Su, Y., Zhu, H., Wang, F., Chen, X., Yang, Y., Wang, G., Wong, K. C., Li, X. (2025) Topological identification and interpretation for single-cell epigenetic regulation elucidation in multi-tasks using scAGDE. Nature Communications, 16(1), 1691.
Share and Cite
Ge, L. (2025) Advances in Explainable Models for Single-cell Transcriptomics and Spatial Transcriptomics Data Analysis. Scientific Research Bulletin, 2(6), 57-62.
https://doi.org/10.71052/srb2024/YUAG4486
