来自杂志 Nature Methods 的文献。
当前共找到 4 篇文献分享。
1.
徐炳祥 (2026-02-28 20:05):
#paper doi: 10.1038/s41592-023-02139-9 nature methods, 2024, A fast, scalable and versatile tool for analysis of single-cell omics data。本文介绍了一种端到端的单细胞数据分析工具snapATAC2,其核心创新在于提出了一种不基于距离的谱嵌入算法,通过Lanczos方法隐式计算拉普拉斯矩阵的特征向量,彻底避免了传统谱嵌入需要构建细胞-细胞相似性矩阵的内存瓶颈,从而实现了与细胞数量线性的时间和空间复杂度。该算法在大量合成与真实数据集上展现出卓越性能且天然支持scRNA-seq、scHi-C、单细胞DNA甲基化等单细胞多组学数据的联合嵌入,在准确性、鲁棒性和可扩展性上全面超越现有线性/非线性方法,且无需GPU、无需繁琐的超参数调优。
Abstract:
AbstractSingle-cell omics technologies have revolutionized the study of gene regulation in complex tissues. A major computational challenge in analyzing these datasets is to project the large-scale and high-dimensional data into … >>>
AbstractSingle-cell omics technologies have revolutionized the study of gene regulation in complex tissues. A major computational challenge in analyzing these datasets is to project the large-scale and high-dimensional data into low-dimensional space while retaining the relative relationships between cells. This low dimension embedding is necessary to decompose cellular heterogeneity and reconstruct cell-type-specific gene regulatory programs. Traditional dimensionality reduction techniques, however, face challenges in computational efficiency and in comprehensively addressing cellular diversity across varied molecular modalities. Here we introduce a nonlinear dimensionality reduction algorithm, embodied in the Python package SnapATAC2, which not only achieves a more precise capture of single-cell omics data heterogeneities but also ensures efficient runtime and memory usage, scaling linearly with the number of cells. Our algorithm demonstrates exceptional performance, scalability and versatility across diverse single-cell omics datasets, including single-cell assay for transposase-accessible chromatin using sequencing, single-cell RNA sequencing, single-cell Hi-C and single-cell multi-omics datasets, underscoring its utility in advancing single-cell analysis. <<<
翻译
2.
徐炳祥 (2026-01-31 21:09):
#paper doi: 10.1038/s41592-018-0033-z Nature methods, 2018, SAVER: gene expression recovery for single-cell RNA sequencing。本文是单细胞RNA-seq缺失值插补方面的经典论文之一。作者给出了一种能借助单细胞文库中其他细胞的基因表达水平填补单个细胞数据缺失的算法。算法利用泊松分布建模基因表达计数,用Gamma分布对其均值建模。利用细胞间基因表达水平的相互回归估计此Gamma分布的均值。通过假定变异率在细胞群体中恒定来估计Gamma分布的形状参数。最终实现确实表达水平的填补和测得表达计数的纠偏。在模拟和真实单细胞RNA-seq数据集中算法性能均得到了验证。本文为单细胞数据的缺失值插补提供了一个可行的理论框架,是后续众多研究的基础。
3.
徐炳祥 (2025-07-31 20:32):
#paper doi: 10.1038/nmeth.3329 Nature Methods, 2015, Identification of active transcriptional regulatory elements from GRORO-seq data。哺乳动物基因组上存在大量非编码的活跃的双向转录区域,这些区域往往发挥调控元件(如增强子)的作用,因而成为表观遗传研究的重要对象。GRO-seq/PRO-seq是测定这些转录调控元件(TRE)的最有效高通量手段。这篇旧文报道了一种称为dREG的有监督学习算法,可在GRO-seq数据中系统性检测TRE。预测是利用SVM模型学习GRO-seq在基因启动子位点处的信号特征进而外推至整个基因组实现的。通过与DNase-seq和多种组蛋白修饰位点的联合分析,本文将基因组上的顺式调控元件大致分为活跃转录的调控元件、开放但不转录的调控元件、有增强子特征但不开放的调控元件和绝缘子四个大类。通过每类中特征性的转录因子结合位点富集情况和eQTL/GWAS数据的富集验证了这些结果的可靠性。本文虽旧,但对理解顺式作用元件的组成与功能有重要参考意义。PRO-seq/GRO-seq虽然操作繁琐、对实验员技术和耐心要求很高,但也是探测新生转录本和活跃调控元件的金标准,值得额外关注。
4.
白鸟 (2024-10-31 23:04):
#paper DOI: 10.1038/nmeth.3364,MiXCR: software for comprehensive adaptive immunity profiling. 2015年,milaboratory推出了MiXCR软件,MiXCR是二代测序TCR/BCR免疫组库分析软件,能有效处理双端和单端测序,考虑序列质量,纠正PCR错误并识别种系超突变。 (1)MiXCR比对:将测序读数与T细胞或B细胞受体的V、D、J 和 C基因进行比对; (2)MiXCR组装:利用上一步获得的比对结果组装成克隆型(以提取特定基因区域,如 CDR3); (3)结果导出和绘图:将比对结果或克隆型结果导出和绘图; MiXCR软件分学术版本和商业版本,软件封装得很好,几乎为所有商业/通用试剂盒开发了定制预配置的分析流程,单命令即可完成操作。
回到顶部