来自杂志 PLoS computational biology 的文献。
当前共找到 5 篇文献分享。
1.
颜林林 (2024-07-20 14:59):
#paper doi:10.1371/journal.pcbi.1012232, PLOS Computational Biology, Ten simple rules for building and maintaining a responsible data science workflow. “十条简单规则(Ten simple rules)” 是 PLOS Comp. Bio. 杂志上非常受欢迎的系列评论文章,每次篇幅不长,谈一个主题,提供十条“规则”并逐一展开解释,这些建议通常来自该领域有经验者,因此建议本身往往都非常简明且中肯,很值得阅读。这一篇说的是如何构建并维持“负责任”的数据科学流程,简单说其实就是“如何不作恶”,提到的建议包括提前考虑研究可能导致的恶性结果、注意数据源的偏差、经常性复盘和审视、迭代更新评估方法及标准、保持透明度等。像我们这种天天跟数据打交道的人,用这篇作为一张日常检查清单,也是不错的选择。
2.
笑对人生 (2023-11-29 11:48):
#paper doi: 10.1371/journal.pcbi.1007531. Kumuthini J, et al. Ten simple rules for providing effective bioinformatics research support. PLoS Comput Biol. 2020 Mar 26;16(3):e1007531. 高通量测序技术的普及使得生物信息分析的需求逐渐增加,然而,作为一名生物信息分析工作者,应该如何有效地与实验室科学家合作呢?本研究的作者们结合自身多年交叉学科的数据分析经验,为此总结出十条简单但有效的规则。(1)充分了解实验设计。(2)明确数据分析项目的范围、达成路径和预期结果。(3)对数据进行全面和科学的管理,例如及时备份。(4)数据分析步骤和结果应该可追溯。(5)明确包含样本信息的元数据的存储形式和内容。(6)注意数据安全。(7)在项目的整个生命周期都需要进行质量控制。(8)选择合适的数据分析工具。正确选择的前提是了解使用工具的优点和局限性,是否被科研人员广泛使用。(9)及时记录数据分析过程中的每一次修改。(10)对于不可用的数据,及时与实验科学家沟通,必要时要重新调整该数据的用途。
Abstract:
Life scientists are increasingly turning to high-throughput sequencing technologies in their research programs, owing to the enormous potential of these methods. In a parallel manner, the number of core facilities … >>>
Life scientists are increasingly turning to high-throughput sequencing technologies in their research programs, owing to the enormous potential of these methods. In a parallel manner, the number of core facilities that provide bioinformatics support are also increasing. Notably, the generation of complex large datasets has necessitated the development of bioinformatics support core facilities that aid laboratory scientists with cost-effective and efficient data management, analysis, and interpretation. In this article, we address the challenges-related to communication, good laboratory practice, and data handling-that may be encountered in core support facilities when providing bioinformatics support, drawing on our own experiences working as support bioinformaticians on multidisciplinary research projects. Most importantly, the article proposes a list of guidelines that outline how these challenges can be preemptively avoided and effectively managed to increase the value of outputs to the end user, covering the entire research project lifecycle, including experimental design, data analysis, and management (i.e., sharing and storage). In addition, we highlight the importance of clear and transparent communication, comprehensive preparation, appropriate handling of samples and data using monitoring systems, and the employment of appropriate tools and standard operating procedures to provide effective bioinformatics support. <<<
翻译
3.
muton (2022-07-28 11:58):
#paper DOI: 10.1371/journal.pcbi.1009267 Unveiling functions of the visual cortex using task-specific deep neural networks.人类的视觉感知是一种复杂的认知能力,它是由大脑不同皮层区域控制调节的。然而目前这些区域的确切功能我们了解的仍不完全清楚,进而这些区域是如何协调视觉感知的也没有确切的答案。目前的观点认为视觉信息的转变过程是通过不同功能区域的层次化计算,通常我们概括为这些功能区域为腹侧和背侧视觉通路。无论是发现各个视觉皮层区域的确切功能还是利用计算建模的方法实现这种功能都是具有挑战性的,但也是我们的最终诉求。深度神经网络(DNNs)用于实现建模和预测视觉区域反应的一种较有前景的方法。本文通过比较不同视觉任务中的fMRI数据集与针对不同视觉任务优化过的DNN 模型子集的相关(作者选择了通过Taskonomy数据集训练的18个DNNs模型,这些模型分别对应于室内场景图片理解的18个不同任务的优化)发现了视觉信息沿腹侧和背侧视觉通路的结构化映射。低级视觉任务映射到早期视觉皮层,三维场景感知任务映射到背侧流,语义任务映射到腹侧流。文章的亮点可能就是通过模型和人脑实际数据相似性比较的方法能够得出哪些脑区贡献于哪些任务的这种思路。
Abstract:
The human visual cortex enables visual perception through a cascade of hierarchical computations in cortical regions with distinct functionalities. Here, we introduce an AI-driven approach to discover the functional mapping … >>>
The human visual cortex enables visual perception through a cascade of hierarchical computations in cortical regions with distinct functionalities. Here, we introduce an AI-driven approach to discover the functional mapping of the visual cortex. We related human brain responses to scene images measured with functional MRI (fMRI) systematically to a diverse set of deep neural networks (DNNs) optimized to perform different scene perception tasks. We found a structured mapping between DNN tasks and brain regions along the ventral and dorsal visual streams. Low-level visual tasks mapped onto early brain regions, 3-dimensional scene perception tasks mapped onto the dorsal stream, and semantic tasks mapped onto the ventral stream. This mapping was of high fidelity, with more than 60% of the explainable variance in nine key regions being explained. Together, our results provide a novel functional mapping of the human visual cortex and demonstrate the power of the computational approach. <<<
翻译
4.
颜林林 (2022-06-26 22:13):
#paper doi:10.1371/journal.pcbi.1009730 PLOS Computational Biology, 2022, Improved transcriptome assembly using a hybrid of long and short reads with StringTie. 这篇文章来自Johns Hopkins,开发了一个能够混合使用长读长及短读长测序数据进行转录组拼装的工具。高通量测序数据中,短读长平台的准确性高,但读长较短,难以覆盖完整转录本,而长读长平台虽然可以跨越多个外显子,帮助确定转录本剪切方式,但由于碱基准确度相对较差,因而也容易在比对时造成错误,影响转录本的确定。本文的工具,展示了由于测序错误导致的“嘈杂”比对,以及由此导致的搜索空间大幅增加。通过使用图论中的最大流量问题的解法,以及在“嘈杂”比对局部使用更准确的短读长数据,帮助确定正确的剪切位点,从而实现综合两种平台(长读长与短读长)的优势,且运算速度也并不弱于以往使用单一数据的工具算法。为评估此工具,本文除了使用模拟数据外,同时也选择了拟南芥、小鼠和人的多套真实数据集,在组装精读和输出的可正确注释的转录本等方面,都表现出符合预期的更好成绩。
Abstract:
Short-read RNA sequencing and long-read RNA sequencing each have their strengths and weaknesses for transcriptome assembly. While short reads are highly accurate, they are rarely able to span multiple exons. … >>>
Short-read RNA sequencing and long-read RNA sequencing each have their strengths and weaknesses for transcriptome assembly. While short reads are highly accurate, they are rarely able to span multiple exons. Long-read technology can capture full-length transcripts, but its relatively high error rate often leads to mis-identified splice sites. Here we present a new release of StringTie that performs hybrid-read assembly. By taking advantage of the strengths of both long and short reads, hybrid-read assembly with StringTie is more accurate than long-read only or short-read only assembly, and on some datasets it can more than double the number of correctly assembled transcripts, while obtaining substantially higher precision than the long-read data assembly alone. Here we demonstrate the improved accuracy on simulated data and real data from Arabidopsis thaliana, Mus musculus, and human. We also show that hybrid-read assembly is more accurate than correcting long reads prior to assembly while also being substantially faster. StringTie is freely available as open source software at https://github.com/gpertea/stringtie. <<<
翻译
5.
lsj (2022-05-31 22:29):
#paper:https://doi.org/10.1371/journal.pcbi.1003588 几千年来,哲学家们一直在争论意识的本质,以及它能否存在于其他物种中。而现在,随着人工智能技术越发成熟,人们对揭开意识的谜底更迫切——仅仅通过交互,即经典的图灵测试,可能已无法判断你是在和机器打交道,还是在和人打交道了,伴随而来的人工智能伦理挑战越发不容忽视。本文提出了一种定量刻画意识的理论,整合信息论,来探索意识的机制。主要有几部分关键内容,整合信息论简介、关于意识的公理、对意识的物理基质的假定、机制的系统与概念结构、整合信息论的局限性。
Abstract:
This paper presents Integrated Information Theory (IIT) of consciousness 3.0, which incorporates several advances over previous formulations. IIT starts from phenomenological axioms: information says that each experience is specific--it is … >>>
This paper presents Integrated Information Theory (IIT) of consciousness 3.0, which incorporates several advances over previous formulations. IIT starts from phenomenological axioms: information says that each experience is specific--it is what it is by how it differs from alternative experiences; integration says that it is unified--irreducible to non-interdependent components; exclusion says that it has unique borders and a particular spatio-temporal grain. These axioms are formalized into postulates that prescribe how physical mechanisms, such as neurons or logic gates, must be configured to generate experience (phenomenology). The postulates are used to define intrinsic information as "differences that make a difference" within a system, and integrated information as information specified by a whole that cannot be reduced to that specified by its parts. By applying the postulates both at the level of individual mechanisms and at the level of systems of mechanisms, IIT arrives at an identity: an experience is a maximally irreducible conceptual structure (MICS, a constellation of concepts in qualia space), and the set of elements that generates it constitutes a complex. According to IIT, a MICS specifies the quality of an experience and integrated information ΦMax its quantity. From the theory follow several results, including: a system of mechanisms may condense into a major complex and non-overlapping minor complexes; the concepts that specify the quality of an experience are always about the complex itself and relate only indirectly to the external environment; anatomical connectivity influences complexes and associated MICS; a complex can generate a MICS even if its elements are inactive; simple systems can be minimally conscious; complicated systems can be unconscious; there can be true "zombies"--unconscious feed-forward systems that are functionally equivalent to conscious complexes. <<<
翻译
回到顶部