少颖-focus reverse aging
(2025-07-05 05:54):
#paper doi: https://doi.org/10.1101/2025.06.11.659105
标题:X-Atlas/Orion: Genome-wide Perturb-seq Datasets via a Scalable Fix-Cryopreserve Platform for Training Dose-Dependent Biological Foundation Models
发表年份:2025年
总结:目前虚拟细胞的技术因为数据集的发表得到了比较大的提升,解决了大约40-50%的核心数据问题。X-Atlas/Orion 数据集的构建大约18-25人参与,其中有斯坦福大学教授的学生和谷歌公司的高管, 也有参与过药物研发整个流程的人,开发这个数据集的公司里面大佬云集,有斯坦福大学前教授,也有诺奖获得者,也有强生公司前CEO,阵容堪称世界顶尖。感悟:虚拟细胞的设计是顶尖科学家做的事情,所以这件事情会让人很有成就感。我可以参与,但是需要做好投入大量时间的准备。
公众号文章有更详细解读和分析:https://mp.weixin.qq.com/s/evVxdkRds8ZCbXVgnlmWAg
bioRxiv,
2025-6-16.
DOI: 10.1101/2025.06.11.659105
X-Atlas/Orion: Genome-wide Perturb-seq Datasets via a Scalable Fix-Cryopreserve Platform for Training Dose-Dependent Biological Foundation Models
翻译
Abstract:
AbstractThe rapid expansion of massively parallel sequencing technologies has enabled the development of foundation models to uncover novel biological findings. While these have the potential to significantly accelerate scientific discoveries by creating AI-driven virtual cell models, their progress has been greatly limited by the lack of large-scale high-quality perturbation data, which remains constrained due to scalability bottlenecks and assay variability. Here, we introduce “Fix-Cryopreserve-ScRNAseq” (FiCS) Perturb-seq, an industrialized platform for scalable Perturb-seq data generation. We demonstrate that FiCS Perturb-seq exhibits high sensitivity and low batch effects, effectively capturing perturbation-induced transcriptomic changes and recapitulating known biological pathways and protein complexes. In addition, we release X-Atlas: Orion edition (X-Atlas/Orion), the largest publicly available Perturb-seq atlas. This atlas, generated from two genome-wide FiCS Perturb-seq experiments targeting all human protein-coding genes, comprises eight million cells deeply sequenced to over 16,000 unique molecular identifiers (UMIs) per cell. Furthermore, we show that single guide RNA (sgRNA) abundance can serve as a proxy for gene knockdown (KD) efficacy. Leveraging the deep sequencing and substantial cell numbers per perturbation, we also show that stratification by sgRNA expression can reveal dose-dependent genetic effects. Taken together, we demonstrate that FiCS Perturb-seq is an efficient and scalable platform for high-throughput Perturb-seq screens. Through the release of X-Atlas/Orion, we highlight the potential of FiCS Perturb-seq to address current scalability and variability challenges in data generation, advance foundation model development that incorporates gene-dosage effects, and accelerate biological discoveries.
翻译
Related Links: