文献收藏与分享平台

61.

颜林林 (2022-08-12 07:42):

#paper doi:10.1016/j.ccell.2022.07.002 Cancer Cell, 2022, Integrative analysis of drug response and clinical outcome in acute myeloid leukemia. 这是一项关于AML（急性骨髓性白血病）的长达10年的真实世界临床研究，收集了来自多个中心的 805 名患者（942 个样本），对样本进行基因组和转录组的测序，同时使用离体细胞培养进行药物反应实验，此外还利用NLP技术整理和分析患者的病历数据。在数据分析方面，使用反卷积方法，通过转录组数据推断出样本的细胞类群组成，并结合临床信息和组学数据分析结果，识别出影响药物响应情况的因素（如年龄、基因表达、细胞分化状态等）。所建立的模型，揭示了单个基因 PEAR1 是患者生存的最强预测因子之一。所形成的数据集，也提供了一个在线交互式网站进行分析展示。分析方面基本都是很多生信数据挖掘类文章的常见套路，并没有特别新颖之处，但得益于长时间积累的队列及其完整的临床信息，作为一个重要的数据集资源，以及单病种的真实世界研究实例，也还是很有价值的。此外，关于药物响应的细胞实验部分相对独立，与患者预后进行关联解释并不容易，大概也是为了提升文章份量而加入的。

IF:48.800Q1 Cancer cell, 2022-08-08. DOI: 10.1016/j.ccell.2022.07.002 PMID: 35868306

Integrative analysis of drug response and clinical outcome in acute myeloid leukemia

翻译

Daniel Bottomly, Nicola Long, Anna Reister Schultz, Stephen E Kurtz, Cristina E Tognon, Kara Johnson, Melissa Abel, Anupriya Agarwal, Sammantha Avaylon, Erik Benton, ... >>>

Abstract:

Acute myeloid leukemia (AML) is a cancer of myeloid-lineage cells with limited therapeutic options. We previously combined ex vivo drug sensitivity with genomic, transcriptomic, and clinical annotations for a large … >>>

翻译

62.

颜林林 (2022-08-08 07:54):

#paper doi:10.1038/s41596-022-00728-0 Nature Protocols, 2022, I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction. 目前，关于蛋白质结构预测的工具，大多都只能处理单结构域蛋白。然而，自然界中广泛存在的蛋白质，更多是具有多个结构域的，各结构域之间会协同发挥功能，因此亟需开发对这类蛋白质进行结构及功能预测的算法工具。本文提供了一个流程，名为I-TASSER-MTD，用于多结构域蛋白质的结构与功能预测。通过整合如下步骤：基于序列分析结构域（sequence-based domain parsing）、单结构域结构折叠（single-domain structure folding）、结构域之间的结构组装（inter-domain structure assembly）、基于结构的功能注释（structure-based function annotation），并且在各个步骤中都引入了深度学习，以及整合其他诸如蛋白质交联、冷冻电镜等实验数据，来提升相应的准确度，从而提高整体的蛋白质结构功能预测效果，并最终封装成为一套全自动的分析流程。

IF:13.100Q1 Nature protocols, 2022-10. DOI: 10.1038/s41596-022-00728-0 PMID: 35931779

I-TASSER-MTD: a deep-learning-based platform for multi-domain protein structure and function prediction

翻译

Xiaogen Zhou, Wei Zheng, Yang Li, Robin Pearce, Chengxin Zhang, Eric W Bell, Guijun Zhang, Yang Zhang

Abstract:

Most proteins in cells are composed of multiple folding units (or domains) to perform complex functions in a cooperative manner. Relative to the rapid progress in single-domain structure prediction, there … >>>

翻译

63.

颜林林 (2022-08-05 21:59):

#paper doi:10.1038/s41586-022-05028-x Nature, 2022, A physical wiring diagram for the human immune system. 本文开发了一种名为SAVEXIS（scalable arrayed multi-valent extracellular interaction screen）的方法，高通量地筛选存在相互作用关系的细胞表面蛋白对，并用多种实验方法、文献支持、单细胞数据等来对所发现的结果进行验证，得到一套高质量的免疫细胞相互作用的连接关系图谱。

IF:50.500Q1 Nature, 2022-08. DOI: 10.1038/s41586-022-05028-x PMID: 35922511

A physical wiring diagram for the human immune system

翻译

Jarrod Shilts, Yannik Severin, Francis Galaway, Nicole Müller-Sienerth, Zheng-Shan Chong, Sophie Pritchard, Sarah Teichmann, Roser Vento-Tormo, Berend Snijder, Gavin J Wright

Abstract:

The human immune system is composed of a distributed network of cells circulating throughout the body, which must dynamically form physical associations and communicate using interactions between their cell-surface proteomes. … >>>

翻译

64.

颜林林 (2022-08-04 23:48):

#paper doi:10.1016/j.cell.2022.06.036 Cell, 2022, A cross-disorder dosage sensitivity map of the human genome. 作为专业背景是生信的我，经常会思考，纯计算的文章究竟能发到多好的杂志上，是否也有机会能刷刷顶刊主刊。或者换个说法，从码农转职而来的、没啥经费支持的研究人员，只凭借一台电脑（及其背后的互联网），是否也可以做出“顶级”生物学研究？之所以有此不自信，主要还是太多来自传统研究学者及其遵循的研究范式所提出的质疑，大家普遍认为“纯计算”本身不可信，总需要有“自己产出的生物数据”才算是可信和有意义的。然而，这篇登上《Cell》杂志的文章，却真是这样一个“纯计算”的案例。固然它是有Harvard和Broad institute的招牌加持，然而，其整合的来自17个数据源的基因组数据，都来自既往其他研究，涉及54种疾病，近百万例入组受试，重新分析并人工核对了罕见CNV突变，以及这些CNV在相应疾病背景下，对它们经由剂量效应而造成的表型影响，进行了评估。文章整合得到的数据，以及相应的分析方法及产出结果，其质量都并不逊色于大多数“直接产出生物数据”的工作。此外，文章的图表（包括补充材料的图表，比如Fig.S3）也都挺赏心悦目的。

IF:45.500Q1 Cell, 2022-08-04. DOI: 10.1016/j.cell.2022.06.036 PMID: 35917817

A cross-disorder dosage sensitivity map of the human genome

翻译

Ryan L Collins, Joseph T Glessner, Eleonora Porcu, Maarja Lepamets, Rhonda Brandon, Christopher Lauricella, Lide Han, Theodore Morley, Lisa-Marie Niestroj, Jacob Ulirsch, ... >>>

Abstract:

Rare copy-number variants (rCNVs) include deletions and duplications that occur infrequently in the global human population and can confer substantial risk for disease. In this study, we aimed to quantify … >>>

翻译

65.

颜林林 (2022-08-03 00:15):

#paper doi:10.1016/j.molmet.2022.101556 Molecular Metabolism, Tryptophan Metabolism is a Physiological Integrator Regulating Circadian Rhythms. 这是个关于昼夜节律生物钟的研究，比较经典的基于动物模型的生理学研究实验。通过对小鼠进行特定光照条件（12小时昼夜平分，或24小时全黑暗环境）饲养，使其适应该节律。之后通过控制饮食，减少或去除必需氨基酸的摄入，再改变特定光照条件，研究其对节律恢复的影响。根据小鼠的活动时间记录，研究其所表现出的节律，即生物钟调节结果。通过采集小鼠血液、肝脏组织等样本，进行质谱、液相色谱、转录组测序等检测。最终证明色氨酸代谢是关键的昼夜节律调节剂，其代谢受到光调控，并影响小鼠的昼夜节律调节。

IF:7.000Q1 Molecular metabolism, 2022-10. DOI: 10.1016/j.molmet.2022.101556 PMID: 35914650

Tryptophan metabolism is a physiological integrator regulating circadian rhythms

翻译

Abstract:

OBJECTIVE: The circadian clock aligns physiology with the 24-hour rotation of Earth. Light and food are the main environmental cues (zeitgebers) regulating circadian rhythms in mammals. Yet, little is known … >>>

翻译

66.

颜林林 (2022-08-02 23:38):

#paper doi:10.1101/2020.02.16.951657 bioRxiv, 2022, APA-Scan: Detection and Visualization of 3'-UTR Alternative Polyadenylation with RNA-seq and 3'-end-seq Data. 在真核生物中存在一种名为APA（可变的多聚腺苷酸）的机制，通过形成不同的可变剪接，使表达的基因的3'-UTR区域携带不同长度的poly-A（多聚腺苷酸）序列，从而实现精细调控基因表达（包括降解等）。本文开发了一个计算工具APA-Scan，能够基于RNA-seq数据，分析并充分考虑其相关区域的测序深度信息，鉴定APA事件，给出相应注释，并提供图形化展示，弥补了过去其他工具方法在这方面的缺失和不足。本文还通过对模拟数据和两个实际公共数据集（DaPars和APAtrap）进行分析评测，并使用qPCR实验进行了验证。

bioRxiv, 2020. DOI: 10.1101/2020.02.16.951657

APA-Scan: Detection and Visualization of 3’-UTR Alternative Polyadenylation with RNA-seq and 3’-end-seq Data

翻译

Naima Ahmed Fahmi , Khandakar Tanvir Ahmed , Jae-Woong Chang , Heba Nassereddeen , Deliang Fan , Jeongsik Yong , Wei Zhang

Abstract:

BackgroundThe eukaryotic genome is capable of producing multiple isoforms from a gene by alternative polyadenylation (APA) during pre-mRNA processing. APA in the 3’-untranslated region (3’-UTR) of mRNA produces transcripts with … >>>

BackgroundThe eukaryotic genome is capable of producing multiple isoforms from a gene by alternative polyadenylation (APA) during pre-mRNA processing. APA in the 3’-untranslated region (3’-UTR) of mRNA produces transcripts with shorter or longer 3’-UTR. Often, 3’-UTR serves as a binding platform for microRNAs and RNA-binding proteins, which affect the fate of the mRNA transcript. Thus, 3’-UTR APA is known to modulate translation and provides a mean to regulate gene expression at the post-transcriptional level. Current bioinformatics pipelines have limited capability in profiling 3’-UTR APA events due to incomplete annotations and a low-resolution analyzing power: widely available bioinformatics pipelines do not reference actionable polyadenylation (cleavage) sites but simulate 3’-UTR APA only using RNA-seq read coverage, causing false positive identifications. To overcome these limitations, we developed APA-Scan, a robust program that identifies 3’-UTR APA events and visualizes the RNA-seq short-read coverage with gene annotations.MethodsAPA-Scan utilizes either predicted or experimentally validated actionable polyadenylation signals as a reference for polyadenylation sites and calculates the quantity of long and short 3’-UTR transcripts in the RNA-seq data. APA-Scan works in three major steps: (i) calculate the read coverage of the 3’-UTR regions of genes; (ii) identify the potential APA sites and evaluate the significance of the events among two biological conditions; (iii) graphical representation of user specific event with 3’-UTR annotation and read coverage on the 3’-UTR regions. APA-Scan is implemented in Python3. Source code and a comprehensive user’s manual are freely available at https://github.com/compbiolabucf/APA-Scan.ResultAPA-Scan was applied to both simulated and real RNA-seq datasets and compared with two widely used baselines DaPars and APAtrap. In simulation APA-Scan significantly improved the accuracy of 3’-UTR APA identification compared to the other baselines. The performance of APA-Scan was also validated by 3’-end-seq data and qPCR on mouse embryonic fibroblast cells. The experiments confirm that APA-Scan can detect unannotated 3’ -UTR APA events and improve genome annotation.ConclusionAPA-Scan is a comprehensive computational pipeline to detect transcriptome-wide 3’-UTR APA events. The pipeline integrates both RNA-seq and 3’-end-seq data information and can efficiently identify the significant events with a high-resolution short reads coverage plots. <<<

翻译

67.

颜林林 (2022-08-01 01:02):

#paper doi:10.1093/bioinformatics/btac528 Bioinformatics, 2022, The K-mer File Format: a standardized and compact disk representation of sets of k-mers. 由k个字符连在一起的短串，称为k-mer，在生信的许多工具或分析过程中，如构建de Bruijn图（进行基因组组装）和创建序列索引（进行短序列比对），基本都会用到这个概念，并统计每种k-mer的出现频次，以及其他相关信息（如出现在基因组中的位置、与其他k-mer之间的关系）。随着k的增加，k-mer的种类呈几何数量增长，这给计算、存储都带来巨大开销。为此，本文开发了一种文件存储格式，用于存储k-mer信息，确保信息得以压缩存储的同时，还能保持高效的读写。说实话，这活不复杂，会点儿C++和Rust就能做，而且类似需求也不少。

Bioinformatics (Oxford, England), 2022-09-15. DOI: 10.1093/bioinformatics/btac528 PMID: 35904548

The K-mer File Format: a standardized and compact disk representation of sets of k-mers

翻译

Yoann Dufresne, Teo Lemane, Pierre Marijon, Pierre Peterlongo, Amatur Rahman, Marek Kokot, Paul Medvedev, Sebastian Deorowicz, Rayan Chikhi

Abstract:

SUMMARY: Bioinformatics applications increasingly rely on ad hoc disk storage of k-mer sets, e.g. for de Bruijn graphs or alignment indexes. Here, we introduce the K-mer File Format as a … >>>

翻译

68.

颜林林 (2022-07-31 07:26):

#paper doi:10.1016/j.ccell.2022.07.003 Cancer Cell, 2022, Dark genome, bright ideas: Recent approaches to harness transposable elements in immunotherapies. 占比达到近一半人类基因组的转座元件（transposable element，TE）是个需要继续深入研究的存在。这篇评论文章，快速综述了有关TE与免疫之间的关系，如TE具备的免疫原性，它能激活 DNA 或 RNA 的传感器，也能引发免疫系统反应，从而可能形成新的免疫治疗方法。本文相继描述了 TE 表达对抗肿瘤免疫的影响，以及如何通过介导 TE 表达、介导 TE 免疫原性、辅助 CAR-T 细胞等方式，来实现对肿瘤开展免疫治疗。补充点个人想法：在 DNA 水平上研究各类重复片段，一直是相当困难的，这也是这些序列区间通常被称为“dark genome”（暗黑基因组）的原因；这种困难类似于想要通过地面的投影去反推空中漂浮的大量物件，许多物件的投影彼此重叠而无法区分；而所幸新技术让我们能从长读长、多组学等角度，开始一层层剥开迷雾。

IF:48.800Q1 Cancer cell, 2022-08-08. DOI: 10.1016/j.ccell.2022.07.003 PMID: 35907399

Dark genome, bright ideas: Recent approaches to harness transposable elements in immunotherapies

翻译

Ashley Reid Cahn, Nina Bhardwaj, Nicolas Vabret

Abstract:

Transposable elements (TEs), which make up almost half of the human genome, often display altered expression in cancers. Here, we review recent progress in elucidating the role of TEs as … >>>

翻译

69.

颜林林 (2022-07-30 01:17):

#paper doi:10.15252/msb.202211017 Molecular Systems Biology, 2022, Computational estimation of quality and clinical relevance of cancer cell lines. 这是一篇关于肿瘤细胞系的综述，主要考察公开并被广泛使用的各肿瘤细胞系的质量。文章首先概述了当前不同癌种的细胞系公共资源，包括相应的多组学数据。接着，介绍可能对细胞系质量产生影响的因素，如交叉污染、传代过程中的突变积累、缺少微环境因素、分子和细胞状态等层面的异质性等。然后，针对这些问题，可以如何进行评估，综述了相应的不同计算方法（含工具）。最后，在讨论部分，展望未来的改进方向，诸如多组学整合、迁移学习的引入、单细胞数据的使用、可解释性的提高等。细胞系是肿瘤研究的重要体系，本文对其相应的资源选择和分析评估方法，都系统性地提供了汇总信息。

IF:8.500Q1 Molecular systems biology, 2022-07. DOI: 10.15252/msb.202211017 PMID: 35822563

Computational estimation of quality and clinical relevance of cancer cell lines

翻译

Lucia Trastulla, Javad Noorbakhsh, Francisca Vazquez, James McFarland, Francesco Iorio

Abstract:

Immortal cancer cell lines (CCLs) are the most widely used system for investigating cancer biology and for the preclinical development of oncology therapies. Pharmacogenomic and genome-wide editing screenings have facilitated … >>>

翻译

70.

颜林林 (2022-07-29 08:21):

#paper doi:10.1093/nar/gkac586 Nucleic Acid Research, 2022, De novo assembly of human genome at single-cell levels. 作者之前开发的一项名为 SMOOTH-seq 的技术，大致原理是：用 Tn5 转座子插入基因组DNA，使其随机片段化，然后用带有 barcode 的引物对片段进行链置换和扩增，再将双链末端分别连入一段序列以成环，进行滚环扩增，得到可供长读长测序的长片段，该长片段上带有多份原始序列片段，因而可以准确校正序列碱基。本文在此基础上进行了改进，使用 PacBio HiFi 和 Oxford Nanopore Technologies（ONT）两种测序平台，对 K562 和 HG002 两个细胞系进行单细胞测序。首次在单细胞水平上完成了具有高连续性的人类基因组组装。其结果包括：95 个 K562 细胞，总测序深度约37x（如果没理解错，应该每个细胞的测序深度为 37/95 = 0.4 x），NG50 约 2 Mb；30 个 HG002 细胞，每个细胞的测序深度约为 1G（相当于是 0.33x），NG50 约 1.3 Mb。按文章摘要的说法“开启了单细胞基因组从头组装实践的新篇章”。这个主题看似创新度很高，仔细推敲却不禁有些疑问：单细胞基因组测序很难区分不同类群细胞，因而应该只能在单细胞水平上分别进行组装，否则大量不同类群细胞混合起来组装，则又失去了原本的立意。但是，单个细胞的基因组覆盖度是不可能很全面的（文章提到平均覆盖率约是 41.7%，我猜提升测序数据量也未必对此会有大幅改善），这又很大程度上会限制组装本身，因而最终只能关注其中的结构变异鉴定结果。此外，单细胞基因组结果其实很难验证，很难用其他细胞的结果来评判当前被测细胞的结果是否准确，这应该也是一个逻辑上的硬伤。所以，最终这篇文章的贡献，除了两个细胞系的单细胞基因组测序数据本身外，大概主要还是在于实验方法摸索优化和技术方法建立吧，当然其数据分析方法过程也是值得参考的。

IF:16.600Q1 Nucleic acids research, 2022-07-22. DOI: 10.1093/nar/gkac586 PMID: 35819189 PMCID:PMC9303314

De novo assembly of human genome at single-cell levels

翻译

Haoling Xie, Wen Li, Yuqiong Hu, Cheng Yang, Jiansen Lu, Yuqing Guo, Lu Wen, Fuchou Tang

Abstract:

Genome assembly has been benefited from long-read sequencing technologies with higher accuracy and higher continuity. However, most human genome assembly require large amount of DNAs from homogeneous cell lines without … >>>

翻译

71.

颜林林 (2022-07-28 08:50):

#paper doi:10.1093/bioinformatics/btac137 Bioinformatics, 2022, BWA-MEME: BWA-MEM emulated with a machine learning approach. 看到李恒在Twitter上转发这篇文章，本以为大神又升级了bwa mem2，之后发现原来是他人的作品，得到了李恒钦点而已。作为某个知名软件的后继者，必然是要在某个方面有较大改进的，这篇的改进主要在性能。用于高通量测序数据的短序列比对算法，通常都是先用精确匹配种子（这几乎都是查表法在常数时间内完成），然后进行延伸匹配。而种子序列的长度选择，是一项比较有技巧性的事，太短可能导致重复匹配（hit）过多，太长则可能大量单词无匹配（在基因组上无该序列）却占据字典，导致字典过大。为此，过去也有一些算法，会采用变长种子来解决该问题（我也设想过这个策略，但惭愧的是，最终未能付诸实践）。而变长种子的策略，存在内存块大小不定、访问频繁等问题，会导致性能瓶颈。在本文中，通过机器学习的方法，在建立种子索引的阶段进行预处理，使得索引能够根据基因组序列数据进行适应，使不同长度种子的内存访问次数固定，从而获得性能提升。在最终的评测中，bwa-meme 能保持与 bwa-mem2 的输出相同，运行速度则提升了 3.45 倍。这篇文章的算法，可以再仔细深入学习下。

Bioinformatics (Oxford, England), 2022-04-28. DOI: 10.1093/bioinformatics/btac137 PMID: 35253835

BWA-MEME: BWA-MEM emulated with a machine learning approach

翻译

Youngmok Jung, Dongsu Han

Abstract:

MOTIVATION: The growing use of next-generation sequencing and enlarged sequencing throughput require efficient short-read alignment, where seeding is one of the major performance bottlenecks. The key challenge in the seeding … >>>

翻译

72.

颜林林 (2022-07-26 23:37):

#paper doi:10.1002/jbio.202100389 Journal of Biophotonics, 2022, Skin's green autofluorescence at dorsal centremetacarpus may become a novel biomarker for diagnosis of lung cancer. 肿瘤早筛是当下最热门的研发方向之一，过热到都似乎开始裁员的地步，因为大家都在同质化地走类似的路线（如甲基化测序）。而这篇来自上海交大的文章，另辟蹊径地采取对皮肤的自发荧光进行检测的方法，尝试将其用于肺癌早期筛查和诊断。这是一种真正无创的新型检测方法，其原理在于皮肤表皮的棘层中，存在一种角蛋白分子，在蓝光照射下会发出荧光。而这种荧光的强度，又与疾病状态相关。本文研究中纳入了临床实际病例和异体移植的小鼠肿瘤模型，从肺部感染或健康对照中分别区分肺癌，AUC分别可达到 0.871 和 0.813，证明了这是一种潜在的生物标志物，可用于肺癌早期筛查和诊断。

IF:2.000Q3 Journal of biophotonics, 2022-05. DOI: 10.1002/jbio.202100389 PMID: 35075788

Skin's green autofluorescence at dorsal centremetacarpus may become a novel biomarker for diagnosis of lung cancer

翻译

Mingchao Zhang, Yue Tao, Qing Chang, Kaixuan Wang, Tianqing Chu, Weihai Ying

Abstract:

It is critical to discover novel biomarkers of lung cancer for establishing economical technology for diagnosis of lung cancer. Our study has suggested that the autofluorescence (AF) of the skin … >>>

翻译

73.

颜林林 (2022-07-25 07:28):

#paper doi:10.1038/s41380-022-01661-0 Molecular Psychiatry, 2022, The serotonin theory of depression: a systematic umbrella review of the evidence. 这是一篇meta分析，而且还是一篇阴性结果的报道，按照很多“业内人”的观点，这样的“水文”是不屑一顾或羞于启齿的。本文研究血清素（serotonin，即5-羟色胺）是否与抑郁症病因有关。这是一个流行于大多数公众和专业研究人员的观点，人们普遍认为血清素降低与抑郁症有关。本文采取了“伞式”审查（umbrella review）方法，纳入多个不同领域对血清素系统进行的大量研究，以便为结论提供可及的最高证据等级支持。涵盖的六个领域分别是：(1) 血清素及其代谢物5-HIAA（5-羟吲哚乙酸）是否在抑郁症患者体液中含量更低；(2) 抑郁症患者的血清素受体是否表达水平更低；(3) 血清素转运蛋白（SERT）是否抑郁症患者中表达更高；(4) 色氨酸（5-羟色胺的前体）耗竭是否会导致抑郁症；(5) 抑郁症患者的 SERT 基因是否表达更高；(6) 抑郁症患者的SERT基因与压力之间是否存在相互作用。本文研究在 PROSPERO 注册（CRD42020207203），共纳入 17 项研究：12 项系统评价和meta分析（systematic reviews and meta-analyses），1 项协作meta分析（collaborative meta-analysis），1 项大型队列研究的meta分析（meta-analysis of large cohort studies），1 项系统评价和综述（systematic review and narrative synthesis），1 项遗传关联研究（genetic association study）和 1 项伞式审查（umbrella review）。最终在六个领域问题上，分别以各自可及的最大样本量（从数百到数万），否定了血清素活性标志物与抑郁症之间的关联，并建议“it is time to acknowledge that the serotonin theory of depression is not empirically substantiated（是时候承认抑郁症的血清素理论并没有经验实证）”。可见，能够明确下一个阴性结论（否定结论），也是相当不容易的。

IF:9.600Q1 Molecular psychiatry, 2023-Aug. DOI: 10.1038/s41380-022-01661-0 PMID: 35854107

The serotonin theory of depression: a systematic umbrella review of the evidence

翻译

Joanna Moncrieff, Ruth E Cooper, Tom Stockmann, Simone Amendola, Michael P Hengartner, Mark A Horowitz

Abstract:

The serotonin hypothesis of depression is still influential. We aimed to synthesise and evaluate evidence on whether depression is associated with lowered serotonin concentration or activity in a systematic umbrella … >>>

The serotonin hypothesis of depression is still influential. We aimed to synthesise and evaluate evidence on whether depression is associated with lowered serotonin concentration or activity in a systematic umbrella review of the principal relevant areas of research. PubMed, EMBASE and PsycINFO were searched using terms appropriate to each area of research, from their inception until December 2020. Systematic reviews, meta-analyses and large data-set analyses in the following areas were identified: serotonin and serotonin metabolite, 5-HIAA, concentrations in body fluids; serotonin 5-HT receptor binding; serotonin transporter (SERT) levels measured by imaging or at post-mortem; tryptophan depletion studies; SERT gene associations and SERT gene-environment interactions. Studies of depression associated with physical conditions and specific subtypes of depression (e.g. bipolar depression) were excluded. Two independent reviewers extracted the data and assessed the quality of included studies using the AMSTAR-2, an adapted AMSTAR-2, or the STREGA for a large genetic study. The certainty of study results was assessed using a modified version of the GRADE. We did not synthesise results of individual meta-analyses because they included overlapping studies. The review was registered with PROSPERO (CRD42020207203). 17 studies were included: 12 systematic reviews and meta-analyses, 1 collaborative meta-analysis, 1 meta-analysis of large cohort studies, 1 systematic review and narrative synthesis, 1 genetic association study and 1 umbrella review. Quality of reviews was variable with some genetic studies of high quality. Two meta-analyses of overlapping studies examining the serotonin metabolite, 5-HIAA, showed no association with depression (largest n = 1002). One meta-analysis of cohort studies of plasma serotonin showed no relationship with depression, and evidence that lowered serotonin concentration was associated with antidepressant use (n = 1869). Two meta-analyses of overlapping studies examining the 5-HT receptor (largest n = 561), and three meta-analyses of overlapping studies examining SERT binding (largest n = 1845) showed weak and inconsistent evidence of reduced binding in some areas, which would be consistent with increased synaptic availability of serotonin in people with depression, if this was the original, causal abnormaly. However, effects of prior antidepressant use were not reliably excluded. One meta-analysis of tryptophan depletion studies found no effect in most healthy volunteers (n = 566), but weak evidence of an effect in those with a family history of depression (n = 75). Another systematic review (n = 342) and a sample of ten subsequent studies (n = 407) found no effect in volunteers. No systematic review of tryptophan depletion studies has been performed since 2007. The two largest and highest quality studies of the SERT gene, one genetic association study (n = 115,257) and one collaborative meta-analysis (n = 43,165), revealed no evidence of an association with depression, or of an interaction between genotype, stress and depression. The main areas of serotonin research provide no consistent evidence of there being an association between serotonin and depression, and no support for the hypothesis that depression is caused by lowered serotonin activity or concentrations. Some evidence was consistent with the possibility that long-term antidepressant use reduces serotonin concentration. <<<

翻译

74.

颜林林 (2022-07-24 05:55):

#paper doi:10.1186/s12864-022-08762-8 BMC Genomics, 2022, Poly(a) selection introduces bias and undue noise in direct RNA-sequencing. 全转录组测序实验中，在初始的RNA提取环节后，经常会使用poly-A筛选方法，来富集mRNA。本文使用ONT平台，开展直接RNA测序（direct RNA-sequencing），并对同一样本，平行地采取使用和不适用poly-A筛选的方法。最终结果说明，省略该环节是合适的，虽然这么做可能轻微降低文库复杂度，但它能更有效避免该筛选环节带来的其他弊端，如需要更多RNA起始量、容易倾向地筛选出具有更长poly-A尾巴的mRNA、会导致差异表达基因也受到影响而更不稳定等。

IF:3.500Q2 BMC genomics, 2022-Jul-22. DOI: 10.1186/s12864-022-08762-8 PMID: 35869428

Poly(a) selection introduces bias and undue noise in direct RNA-sequencing

翻译

Marcus J Viscardi, Joshua A Arribere

Abstract:

BACKGROUND: Genome-wide RNA-sequencing technologies are increasingly critical to a wide variety of diagnostic and research applications. RNA-seq users often first enrich for mRNA, with the most popular enrichment method being … >>>

翻译

75.

颜林林 (2022-07-23 22:05):

#paper doi:10.1101/2022.07.21.500999 bioRxiv, 2022, High-resolution de novo structure prediction from primary sequence. 这篇预发表的文章，开发了一个工具，OmegaFold，可以基于单个蛋白的一级序列信息，预测三级结构。现在主流的方法，都需要依赖演化信息，即通过多序列比对作为辅助，进行蛋白质折叠结构的预测。而本文认为，蛋白从被翻译合成出来后，就会经历从一级序列自动折叠成为三级结构，因而这些演化信息对于结构预测而言并非必要。本文采取的深度模型，会依赖于一组预训练模型，帮助识别出一级序列中哪些氨基酸更为重要（即赋予不同的注意力），并采取基于BERT的语言模型技术，帮助进行蛋白质折叠的模型训练。最终实现的方法，可以有效解决孤儿蛋白（即当前结构数据库中缺乏其他可供参考的相近蛋白）的结构预测问题，且与AlphaFold等工具相比，在准确度上又有显著提升。

bioRxiv, 2022. DOI: 10.1101/2022.07.21.500999

High-resolution de novo structure prediction from primary sequence

翻译

Ruidong Wu , Fan Ding , Rui Wang , Rui Shen , Xiwen Zhang , Shitong Luo , Chenpeng Su , Zuofan Wu , Qi Xie , Bonnie Berger , ... >>>

Abstract:

Recent breakthroughs have used deep learning to exploit evolutionary information in multiple sequence alignments (MSAs) to accurately predict protein structures. However, MSAs of homologous proteins are not always available, such … >>>

翻译

76.

颜林林 (2022-07-22 00:00):

#paper doi:10.1056/NEJMe2207902 The New England Journal of Medicine, 2022, Setting the Benchmark for KRAS(G12C)-Mutated NSCLC. 这是一篇社论（Editorial），介绍了该期杂志上关于KRYSTAL-1二期临床试验的结果报道（doi:10.1056/NEJMoa2204619）。该临床试验的主角，是一种KRAS G12C抑制剂，阿达格拉西布（Adagrasib），其在此次临床试验中表现不错，对经过化疗与免疫治疗的携带KRAS G12C突变的患者，生存评估的指标（ORR、PFS和OS等），与此前另一个获批药物，索托拉西布（sotorasib）非常接近。这篇社论由此推测，这两个药物在机制上可能存在很大的重叠。此外，两个药物在代谢和动力学方面的差异（如穿越血脑屏障、在体内的半衰期等），则又为两个药物未来在选用时可采取的差异化，提供了方向提示。

The New England journal of medicine, 2022-07-14. DOI: 10.1056/NEJMe2207902 PMID: 35830645

Setting the Benchmark for KRASG12C -Mutated NSCLC

翻译

Antonio Passaro, Solange Peters

Abstract: No abstract available.

77.

颜林林 (2022-07-21 00:29):

#paper doi:10.1186/s13059-022-02726-7 Genome Biology, 2022, Integration of single-cell multi-omics data by regression analysis on unpaired observations. 受技术条件限制，绝大多数的单细胞多组学研究，其实都很难在同一细胞上同时检测多个不同组学。本文针对这个问题，基于“相似表达的靶基因的调控基因也相似”的直观认识和假设，采用回归分析方法，对scRNA-seq和ATAC-seq数据之间的关系进行关联和推断，使非配对的scRNA-seq和ATAC-seq实验（即并非同一细胞，而是在不同细胞上分别开展了这两项检测）中，可以通过其中一项数据（如ATAC-seq的染色质开放信息）去推断对应的被调控基因的表达。该方法在模拟数据和实测数据上进行评估，可以达到很高的准确度（与eQTL mapping进行对比，结果高度一致）。这为更好利用当前积累的大量非配对单细胞数据，提供了方法学上的支持。

IF:10.100Q1 Genome biology, 2022-07-19. DOI: 10.1186/s13059-022-02726-7 PMID: 35854350 PMCID:PMC9295346

Integration of single-cell multi-omics data by regression analysis on unpaired observations

翻译

Qiuyue Yuan, Zhana Duren

Abstract:

Despite recent developments, it is hard to profile all multi-omics single-cell data modalities on the same cell. Thus, huge amounts of single-cell genomics data of unpaired observations on different cells … >>>

翻译

78.

颜林林 (2022-07-20 07:49):

#paper doi:10.1101/2022.07.17.500374 bioRxiv, 2022, Genozip Dual-Coordinate VCF format enables efficient genomic analyses and alleviates liftover limitations. 这是一个“认真地做一件小事”的例子。在做基因组分析时，我们经常遭遇“究竟该用hg19还是hg38”的纠结，有时候不得不并行地分别使用两个参考基因组来进行两次差不多的分析，以避免由于使用liftOver之类的基因组坐标转换工具带来的信息丢失。这篇文章针对这个小小的（甚至不那么常见的）痛点，在兼容现有VCF格式的情况下，使其在同一个结果文件中带上两套基因组坐标，不仅不影响现有工具的使用，而且可以随时从中进行所需基因组坐标的提取。想法很简单，实现也不难，但却的确是有效解决了某些实际操作的问题。

bioRxiv, 2022. DOI: 10.1101/2022.07.17.500374

Genozip Dual-Coordinate VCF format enables efficient genomic analyses and alleviates liftover limitations

翻译

Divon Mordechai Lan , Gludhug Purnomo , Raymond Tobler , Yassine Souilmi , Bastien Llamas

Abstract:

We introduce Dual Coordinate VCF (DVCF), a file format that records genomic variants against two different reference genomes simultaneously and is fully compliant with the current VCF specification. As implemented … >>>

翻译

79.

颜林林 (2022-07-19 00:21):

#paper doi:10.1002/humu.24440 Human Mutation, 2022, Multi-omics analysis reveals multiple mechanisms causing Prader-Willi like syndrome in a family with a X;15 translocation. 这篇文章报道了一个患有PWS（Prader-Willi syndrome）遗传病的家庭，以及对其致病基因进行发现和确认的过程。PWS是一种神经发育疾病，且属于教科书级别的遗传病，因为它由一个遗传印记基因区域的变异所导致。所谓遗传印记，即该等位基因会记住其来源是父方或母方，并只在其中一方来源的染色体上的该基因才会表达。PWS就是与15q11.2区域相关，通常是该区域基因的父源拷贝缺失导致疾病。这篇文章报道的家庭，两位女儿都表现出该疾病相关症状（肥胖、智力障碍等），其母亲是携带者（存在一个15号染色体与X染色体的易位突变，translocation）。在本文中，分别使用了核型分析（karyotype）、FISH（染色体原位荧光杂交）、甲基化敏感的MLPA、短序列WGS、10x linked read WGS、转录组测序、ddPCR等方法，各方法都对应解决了在该遗传调查过程中要解决的某个环节的问题，最终确认了该致病基因，以及解释和推论出两个女儿患者的不同发病机制：一个是在15号染色体该区域表现为单亲二体（Uniparental disomy，UPD），另一个则是在印记基因上丧失了印记特性，即两条染色体上都能同时表达该SNRPN基因。对于遗传病研究人员或者从事遗传咨询工作的人员，这篇文章的整个研究过程，涉及的技术众多，逻辑条理清晰，非常具有学习价值。

IF:3.300Q2 Human mutation, 2022-11. DOI: 10.1002/humu.24440 PMID: 35842787

Multi-omics analysis reveals multiple mechanisms causing Prader-Willi like syndrome in a family with a X;15 translocation

翻译

Jesper Eisfeldt, Fatemah Rezayee, Maria Pettersson, Kristina Lagerstedt, Helena Malmgren, Anna Falk, Giedre Grigelioniene, Anna Lindstrand

Abstract:

Prader-Willi syndrome (PWS; MIM# 176270) is a neurodevelopmental disorder caused by the loss of expression of paternally imprinted genes within the PWS region located on 15q11.2. It is usually caused … >>>

翻译

80.

颜林林 (2022-07-18 06:00):

#paper doi:10.1101/2022.07.14.500036 bioRxiv, 2022, Trade-off between conservation of biological variation and batch effect removal in deep generative modeling for single-cell transcriptomics. 单细胞转录组测序数据分析中，需要对批次效应影响进行去除。这通常是对原本高维的数据进行降维，使其在更容易反映出数据结构特征的低维空间上，根据批次信息对数据进行矫正。这个过程很容易导致具有生物学意义的数据特征被误伤，而这样的生物学差异正是我们进行单细胞测序所要研究的对象。针对如何去除批次效应影响，以及如何保留生物学相关数据差异，这两个原本互相矛盾的目标，通常被单细胞测序分析工具根据其各自策略原则的不同，会被选取其中之一作为优先目标进行优化。在本文中，作者通过引入一种名为帕累托多任务学习（Pareto MTL）的多目标优化技术，使综合评估并权衡与两者有关的多种不同指标，以获得整体更优的目的。在这个过程中，还基于神经网络方法，提出一种名为交互信息神经估计（Mutual Information Neural Estimation，MINE）的指标，来帮助该平衡点的选取。文章使用了TM-MARROW和MACAQUE-RETINA等公共数据集，对方法进行了评估，并展示了MINE的效果，确实优于常用的MMD方法。

bioRxiv, 2022. DOI: 10.1101/2022.07.14.500036

Trade-off between conservation of biological variation and batch effect removal in deep generative modeling for single-cell transcriptomics

翻译

Hui Li , Davis J. McCarthy , Heejung Shim , Susan Wei

Abstract:

Single-cell RNA sequencing (scRNA-seq) technology has contributed significantly to diverse research areas in biology, from cancer to development. Since scRNA-seq data is high-dimensional, a common strategy is to learn low … >>>

翻译