颜林林 (2023-08-30 08:09):
#paper doi:10.1016/j.crmeth.2023.100547. Cell Reports Methods, 2023, An introduction to representation learning for single-cell data analysis. 机器学习方法的效果常依赖于数据质量,也与所选择的特征(即数据的表示方法)有关,而表示学习(representation learning)能够通过模型自身去学习数据的表示,这在有足够数据的情况下是非常适合的。单细胞测序数据分析正好是这样一个场景。本文综述了单细胞测序数据分析各个环节(包括数据预处理、超参数优化、下游分析、生物学验证等)中,表示学习方法的应用及应注意的关键点。
IF:4.300Q2 Cell reports methods, 2023-08-28. DOI: 10.1016/j.crmeth.2023.100547 PMID: 37671013 PMCID:PMC10475795
An introduction to representation learning for single-cell data analysis
翻译
Abstract:
Single-cell-resolved systems biology methods, including omics- and imaging-based measurement modalities, generate a wealth of high-dimensional data characterizing the heterogeneity of cell populations. Representation learning methods are routinely used to analyze these complex, high-dimensional data by projecting them into lower-dimensional embeddings. This facilitates the interpretation and interrogation of the structures, dynamics, and regulation of cell heterogeneity. Reflecting their central role in analyzing diverse single-cell data types, a myriad of representation learning methods exist, with new approaches continually emerging. Here, we contrast general features of representation learning methods spanning statistical, manifold learning, and neural network approaches. We consider key steps involved in representation learning with single-cell data, including data pre-processing, hyperparameter optimization, downstream analysis, and biological validation. Interdependencies and contingencies linking these steps are also highlighted. This overview is intended to guide researchers in the selection, application, and optimization of representation learning strategies for current and future single-cell research applications.
翻译
回到顶部