当前共找到 1086 篇文献分享,本页显示第 981 - 1000 篇。
981.
张德祥
(2022-05-07 11:31):
#paper doi:10.11896/jsjkx.210500158 基于物理信息的神经网络: 最新进展与展望 神经网络的强大高效已经被人们认识,缺点也逐渐暴露,物理模型有严格的推导,对物理世界有很好的先验,结合起来效率提升,这篇中文文章调研了这个领域的最新进展,表一对理论和应用总结的不错,论文对NS方程也有大量的调研,值得参考。
计算机科学,
2022.
DOI: 10.11896/jsjkx.210500158
Abstract:
基于物理信息的神经网络(Physics-informed Neural Networks,PINN),是一类用于解决有监督学习任务的神经网络,它不仅尽力遵循训练数据样本的分布规律,而且遵守由偏微分方程描述的物理定律。与纯数据驱动的神经网络学习相比,PINN在训练过程中施加了物理信息约束,因此能用更少的数据样本学习得到更具泛化能力的模型。近年来,PINN已逐渐成为机器学习和计算数学交叉学科的研究热点,并在理论和应用方面都获得了相对深入的研究,取得了可观的进展。但PINN独特的网络结构在实际应用中也存在训练缓慢甚至不收敛、精度低等问题。文中在总结当前PINN研究的基础上,对其网络/体系设计及其在流体力学等多个领域中的应用进行了探究,并展望了进一步的研究方向。
>>>
基于物理信息的神经网络(Physics-informed Neural Networks,PINN),是一类用于解决有监督学习任务的神经网络,它不仅尽力遵循训练数据样本的分布规律,而且遵守由偏微分方程描述的物理定律。与纯数据驱动的神经网络学习相比,PINN在训练过程中施加了物理信息约束,因此能用更少的数据样本学习得到更具泛化能力的模型。近年来,PINN已逐渐成为机器学习和计算数学交叉学科的研究热点,并在理论和应用方面都获得了相对深入的研究,取得了可观的进展。但PINN独特的网络结构在实际应用中也存在训练缓慢甚至不收敛、精度低等问题。文中在总结当前PINN研究的基础上,对其网络/体系设计及其在流体力学等多个领域中的应用进行了探究,并展望了进一步的研究方向。
<<<
翻译
982.
张德祥
(2022-05-02 09:28):
#paper https://doi.org/10.48550/arXiv.2001.04385 Universal Differential Equations for Scientific Machine Learnin
我们提供一流的工具来求解微分方程
我们提供用于推导和拟合科学模型的工具
我们提供高级域特定建模工具,使科学建模更易于访问
我们提供科学机器学习中最新算法的高级实现
我们为所有常见科学编程语言的用户提供使用我们工具的能力
我们提供用于研究科学机器学习方法的工具
我们的目标是什么
我们构建的一切都与自动微分兼容
性能被视为优先事项,性能问题被视为错误
我们的软件包使用科学模拟和机器学习工具进行了常规和稳健的测试
我们紧跟计算硬件的进步,以确保与最新的高性能计算工具兼容。
https://mp.weixin.qq.com/s/jR_2A1IqqZ1J8idmXb9Tpg
arXiv,
2021.
DOI: 10.48550/arXiv.2001.04385
Abstract:
In the context of science, the well-known adage "a picture is worth a thousand words" might well be "a model is worth a thousand datasets." In this manuscript we introduce …
>>>
In the context of science, the well-known adage "a picture is worth a thousand words" might well be "a model is worth a thousand datasets." In this manuscript we introduce the SciML software ecosystem as a tool for mixing the information of physical laws and scientific models with data-driven machine learning approaches. We describe a mathematical object, which we denote universal differential equations (UDEs), as the unifying framework connecting the ecosystem. We show how a wide variety of applications, from automatically discovering biological mechanisms to solving high-dimensional Hamilton-Jacobi-Bellman equations, can be phrased and efficiently handled through the UDE formalism and its tooling. We demonstrate the generality of the software tooling to handle stochasticity, delays, and implicit constraints. This funnels the wide variety of SciML applications into a core set of training mechanisms which are highly optimized, stabilized for stiff equations, and compatible with distributed parallelism and GPU accelerators.
<<<
翻译
983.
张德祥
(2022-05-01 09:56):
#paper https://doi.org/10.48550/arXiv.2204.07953 Learning with Signatures mnist等识别100% ,这个结果一下子炸了锅了,reddit质疑诋毁一片, https://github.com/decurtoydiaz/learning_with_signatures/issues 的讨论也很激动,但是作者开放了代码,回应了质疑,https://www.kaggle.com/code/mlsnatcher/replicate-results-signature-model/notebook也有可以直接运行的代码,在issue5讨论中作者也承认了有一个不足,除了不认可,是否可以深入了解一下这个技术具体使用的方法?论文不用深度学习,使用了:The signature was first defined for smooth paths by Chen in the 60s (Chen, 1957; 1958; 1977) and was rediscovered in the 90s in the context of rough path theory;这个数学很难,想真正搞懂这个论文的底细很难,挑战很大,搞懂了也是本事,如果技术真的ok,那也是领先一步。
arXiv,
2022.
DOI: 10.48550/arXiv.2204.07953
Abstract:
In this work we investigate the use of the Signature Transform in the context of Learning. Under this assumption, we advance a supervised framework that potentially provides state-of-the-art classification accuracy …
>>>
In this work we investigate the use of the Signature Transform in the context of Learning. Under this assumption, we advance a supervised framework that potentially provides state-of-the-art classification accuracy with the use of few labels without the need of credit assignment and with minimal or no overfitting. We leverage tools from harmonic analysis by the use of the signature and log-signature, and use as a score function RMSE and MAE Signature and log-signature. We develop a closed-form equation to compute probably good optimal scale factors, as well as the formulation to obtain them by optimization. Techniques of Signal Processing are addressed to further characterize the problem. Classification is performed at the CPU level orders of magnitude faster than other methods. We report results on AFHQ, MNIST and CIFAR10, achieving 100% accuracy on all tasks assuming we can determine at test time which probably good optimal scale factor to use for each category.
<<<
翻译
984.
笑对人生
(2022-05-01 00:03):
#paper Transfer learning between preclinical models and human tumors identifies a conserved NK cell activation signature in anti-CTLA-4 responsive tumors, Genome Med. 2021 Aug 11;13(1):129. doi: 10.1186/s13073-021-00944-5.
目前,单细胞转录组测序在临床前动物实验研究应用较为普遍,然而,如何将这些新的发现迁移到人类的肿瘤单细胞转录组研究中,仍然是一大挑战。该研究利用机器学习中迁移学习方法,识别出在Anti-CTLA-4响应小鼠和人类肿瘤中共有的NK细胞状态特征,并发现该特征与患者更长的总生存期相关,能用于预测ICBs治疗疗效。最近,NK细胞的研究在CNS频繁“出镜”,可能NK细胞的过继细胞疗法在临床上取得较大进展有关,这也提示我们,相比于T细胞,NK细胞尚未有很大的研究空白,借助目前单细胞转录组测序技术,可能会找到一些有趣的新发现。
IF:10.400Q1
Genome medicine,
2021-08-11.
DOI: 10.1186/s13073-021-00944-5
PMID: 34376232
PMCID:PMC8356429
Abstract:
BACKGROUND: Tumor response to therapy is affected by both the cell types and the cell states present in the tumor microenvironment. This is true for many cancer treatments, including immune …
>>>
BACKGROUND: Tumor response to therapy is affected by both the cell types and the cell states present in the tumor microenvironment. This is true for many cancer treatments, including immune checkpoint inhibitors (ICIs). While it is well-established that ICIs promote T cell activation, their broader impact on other intratumoral immune cells is unclear; this information is needed to identify new mechanisms of action and improve ICI efficacy. Many preclinical studies have begun using single-cell analysis to delineate therapeutic responses in individual immune cell types within tumors. One major limitation to this approach is that therapeutic mechanisms identified in preclinical models have failed to fully translate to human disease, restraining efforts to improve ICI efficacy in translational research.METHOD: We previously developed a computational transfer learning approach called projectR to identify shared biology between independent high-throughput single-cell RNA-sequencing (scRNA-seq) datasets. In the present study, we test this algorithm's ability to identify conserved and clinically relevant transcriptional changes in complex tumor scRNA-seq data and expand its application to the comparison of scRNA-seq datasets with additional data types such as bulk RNA-seq and mass cytometry.RESULTS: We found a conserved signature of NK cell activation in anti-CTLA-4 responsive mouse and human tumors. In human metastatic melanoma, we found that the NK cell activation signature associates with longer overall survival and is predictive of anti-CTLA-4 (ipilimumab) response. Additional molecular approaches to confirm the computational findings demonstrated that human NK cells express CTLA-4 and bind anti-CTLA-4 antibodies independent of the antibody binding receptor (FcR) and that similar to T cells, CTLA-4 expression by NK cells is modified by cytokine-mediated and target cell-mediated NK cell activation.CONCLUSIONS: These data demonstrate a novel application of our transfer learning approach, which was able to identify cell state transitions conserved in preclinical models and human tumors. This approach can be adapted to explore many questions in cancer therapeutics, enhance translational research, and enable better understanding and treatment of disease.
<<<
翻译
985.
旺旺小小酥
(2022-05-01 00:02):
#paper 预期寿命、人力资本与提前退休行为[J].经济研究,2021,56(09):90-106.
作者根据计划生育政策、预期寿命延长、人力资本投资和内生退休决策构建OLG模型。本人从自己打工人的视角看这篇文章,作者深入讨论,寿命延长,咱们打工人能否提前退休根据个人成长性、家庭环境、工资增长程度等等因素综合下来:①年轻的时候一定要多存钱,多读书,特别是人力资本投入影响特别重要。 ②作者还谈到了生育政策的外生影响,主要就是家庭子女增加导致的养育成本上升,也会使得人在老的时候会不想动弹只想躺平
经济研究,
2021.
Abstract:
随着人均预期寿命的延长和受教育年限的提高,包括中国在内的世界各国劳动者的实际退休年龄却普遍低于法定退休年龄,呈现提前退休趋势。本文试图在生育受到约束的制度环境下对上述反常现象进行理论解释,通过构建一个动态一般均衡世代交叠(OLG)模型,考察预期寿命延长对劳动者的人力资本投资、退休年龄选择的影响机制,并结合中国的现实经济进行数值模拟。研究表明:当预期寿命提高时,人力资本投资能够在生命周期中获得更高的工资率回报,劳动者在少年期倾向于进行更多的人力资本投资,提高受教育年限;人力资本积累、有效工资率上升和利率下降引起的收入效应超过了替代效应,使得劳动者在老年期增加对闲暇的需求,有能力和意愿提早退休,减少终生劳动供给时间;进一步地,本文发现放松生育控制政策也会使劳动者享受更多闲暇时间的意愿增强,选择提前退休。本文还考察了个体退休行为的异质性,发现随着预期寿命的延长,相对穷人而言,富人的退休年龄和终生劳动供给更低。本文的结论有益于厘清劳动者在老年期的退休决策机理,为如何推进退休制度改革提供理论依据。
>>>
随着人均预期寿命的延长和受教育年限的提高,包括中国在内的世界各国劳动者的实际退休年龄却普遍低于法定退休年龄,呈现提前退休趋势。本文试图在生育受到约束的制度环境下对上述反常现象进行理论解释,通过构建一个动态一般均衡世代交叠(OLG)模型,考察预期寿命延长对劳动者的人力资本投资、退休年龄选择的影响机制,并结合中国的现实经济进行数值模拟。研究表明:当预期寿命提高时,人力资本投资能够在生命周期中获得更高的工资率回报,劳动者在少年期倾向于进行更多的人力资本投资,提高受教育年限;人力资本积累、有效工资率上升和利率下降引起的收入效应超过了替代效应,使得劳动者在老年期增加对闲暇的需求,有能力和意愿提早退休,减少终生劳动供给时间;进一步地,本文发现放松生育控制政策也会使劳动者享受更多闲暇时间的意愿增强,选择提前退休。本文还考察了个体退休行为的异质性,发现随着预期寿命的延长,相对穷人而言,富人的退休年龄和终生劳动供给更低。本文的结论有益于厘清劳动者在老年期的退休决策机理,为如何推进退休制度改革提供理论依据。
<<<
翻译
986.
masion
(2022-04-30 23:26):
#paper https://doi.org/10.1016/j.envint.2021.106438, Environment International 151 (2021) 。本文在全球范围内调查了38个涵盖来自淡水、海水和土壤不同栖息地生态系统中的3178个样本细菌和原生生物序列数据对,绘制了地球跨生态系统的微生物群落分布图。结果表明,原生生物和细菌的群落特征在栖息地之间和栖息地内具有强烈的相关性,而营养微生物群落结构在栖息地之间存在根本性差异。土壤中的微生物群最为异质和多样。原生生物群落主要由土壤中的捕食者和水生环境中的光养生物组成。这导致原生生物总数与细菌丰富度之比的变化,在海洋中最高,而捕食性原生生物与细菌之比在土壤中最高。海洋生境中捕食性原生生物的分类单元丰富度和相对丰富度与细菌丰富度呈正相关。这些联系在土壤中有所不同,在森林和草原土壤中,捕食性原生生物的丰富度和相对丰富度与细菌丰富度呈正相关,而在农业土壤中则没有。我们的结果表明,人为压力对较高营养水平的影响大于对较低营养水平的影响,从而导致微生物群中的营养结构解耦。这些结果表明,人为因素可能会对微生物群落的营养结构产生负面影响,特别是对高营养水平的影响,因此,人工生态系统中生态系统功能降低可能部分归因于营养复杂性的降低。
Abstract:
The colossal project of mapping the microbiome on Earth is rapidly advancing, with a focus on individual microbial groups. However, a global assessment of the associations between predatory protists and …
>>>
The colossal project of mapping the microbiome on Earth is rapidly advancing, with a focus on individual microbial groups. However, a global assessment of the associations between predatory protists and their bacterial prey is still missing at a cross-ecosystem level. This knowledge is critical to better understand the importance of top-down links in structuring microbiomes. Here, we examined 38 sequence-based datasets of paired bacterial and protistan taxa, covering 3,178 samples from diverse habitats including freshwater, marine and soils. We show that community profiles of protists and bacteria strongly correlated across and within habitats, with trophic microbiome structures fundamentally differing across habitats. Soils hosted the most heterogenous and diverse microbiomes. Protist communities were dominated by predators in soils and phototrophs in aquatic environments. This led to changes in the ratio of total protists to bacteria richness, which was highest in marine, while that of predatory protists to bacteria was highest in soils. Taxon richness and relative abundance of predatory protists positively correlated with bacterial richness in marine habitats. These links differed between soils, predatory protist richness and the relative abundance of predatory protists positively correlated with bacterial richness in forest and grassland soils, but not in agricultural soils. Our results suggested that anthropogenic pressure affects higher trophic levels more than lower ones leading to a decoupled trophic structure in microbiomes. Together, our cumulative overview of microbiome patterns of bacteria and protists at the global scale revealed major patterns and differences of the trophic structure of microbiomes across Earth's habitats, and show that anthropogenic factors might have negative effects on the trophic structure within microbiomes. Furthermore, the increased impact of anthropogenic factors on especially higher trophic levels suggests that often-observed reduced ecosystem functions in anthropogenic systems might be partly attributed to a reduction of trophic complexity.
<<<
翻译
987.
翁凯
(2022-04-30 23:23):
#paper DOI: 10.1126/science.1192788 science, 2011, How to Grow a Mind: Statistics, Structure, and Abstraction. 这是一篇综述,提出了在我看来比较可信的关于人脑如何学习的解释。人脑学习的一个特点是只需少量样本量(或者说数据很稀疏)就能学得很好,尤其是对因果关联的学习。作者认为学习效率高是因为用了抽象知识指导学习,并认为贝叶斯定理能很好地解释是如何用抽象知识指导学习的。而且贝叶斯方法可以有效利用多种形式的抽象知识,从而避免了传统方法需要穷举各种可能(一个个很长的数值向量)的需要。至于是如何从数据学到抽象知识的,比如是如何知道哪种形式是正确的,作者提到了各种形式(树、空间、环、次序……)都可以用graph表示,然后可以用分层贝叶斯模型来生成所需的graph,并且非参形式的分层贝叶斯模型自动蕴含了奥卡姆剃刀,只在数据需要时引入更多变量。不过,有些重要问题仍然没有被分层贝叶斯模型解决,比如学习到底是如何开始的?总得有什么作为基础吧?作者指出,有些贝叶斯建模者认为哪怕是最抽象的概念(比如因果关系的概念)原则上也是可以被学习的。作者还有一些讨论,比如什么Turing complete compositional representations,还有人脑具体如何实现贝叶斯算法,但目前不是我的兴趣(或者其实更是今晚我没有时间重新仔细看了……虽然2011年这篇文献出来的时候我就读过)。有兴趣的朋友可以直接找文献看。
Abstract:
In coming to understand the world-in learning concepts, acquiring language, and grasping causal relations-our minds make inferences that appear to go far beyond the data available. How do we do …
>>>
In coming to understand the world-in learning concepts, acquiring language, and grasping causal relations-our minds make inferences that appear to go far beyond the data available. How do we do it? This review describes recent approaches to reverse-engineering human learning and cognitive development and, in parallel, engineering more humanlike machine learning systems. Computational models that perform probabilistic inference over hierarchies of flexibly structured representations can address some of the deepest questions about the nature and origins of human thought: How does abstract knowledge guide learning and reasoning from sparse data? What forms does our knowledge take, across different domains and tasks? And how is that abstract knowledge itself acquired?
<<<
翻译
988.
小擎子
(2022-04-30 21:50):
#paper doi: 10.1126/science.aan4236 Nature, 2017, Gut microbiome modulates response to anti–PD-1 immunotherapy in melanoma patients。发现肠道微生物组可以调节黑色素瘤患者对检查点免疫疗法的反应。患者分为有应答组(R)和无应答组(NR),R组的特点是肠道微生物组有高多样性和丰度的Ruminococcaceae(瘤胃球菌科)/Faecalibacterium(粪杆菌),而NR组的肠道微生物有低多样性和高相对丰度的Bacteroidales(拟杆菌)。通过小鼠模型发现,肠道微生物组通过调节免疫细胞浸润影响了抗肿瘤免疫应答。
Abstract:
Preclinical mouse models suggest that the gut microbiome modulates tumor response to checkpoint blockade immunotherapy; however, this has not been well-characterized in human cancer patients. Here we examined the oral …
>>>
Preclinical mouse models suggest that the gut microbiome modulates tumor response to checkpoint blockade immunotherapy; however, this has not been well-characterized in human cancer patients. Here we examined the oral and gut microbiome of melanoma patients undergoing anti-programmed cell death 1 protein (PD-1) immunotherapy ( = 112). Significant differences were observed in the diversity and composition of the patient gut microbiome of responders versus nonresponders. Analysis of patient fecal microbiome samples ( = 43, 30 responders, 13 nonresponders) showed significantly higher alpha diversity ( < 0.01) and relative abundance of bacteria of the Ruminococcaceae family ( < 0.01) in responding patients. Metagenomic studies revealed functional differences in gut bacteria in responders, including enrichment of anabolic pathways. Immune profiling suggested enhanced systemic and antitumor immunity in responding patients with a favorable gut microbiome as well as in germ-free mice receiving fecal transplants from responding patients. Together, these data have important implications for the treatment of melanoma patients with immune checkpoint inhibitors.
<<<
翻译
989.
笑对人生
(2022-04-30 21:43):
#paper The single-cell transcriptional landscape of mammalian organogenesis Cao J, et al. Nature. 2019 Feb;566(7745):496-502. doi: 10.1038/s41586-019-0969-x
该文章是利用一种名为sci-RNA-seq3超高通量单细胞测序技术,该技术一次实验可完成大约200万个细胞转录组测序建库,单人完成的时间为1周,成本为每个细胞0.01美元。该研究主要是对小鼠不同发育阶段的61个胚胎(E9.5到E13.5)的单细胞转录图谱进行了描述,该图谱命名为MOCA(mouse organogenesis cell altas)。文章虽然不是很新,但这是monocle3首次在scRNAseq(单细胞转录组测序)的应用案例,提供monocle3详尽的基本原理和分析思路。从文章的作者列表来看,也发现有monocle3软件开发者的名字。monocle3是一款用于scRNAseq拟时序分析的工具,为monocle2更新版本。虽然,monocle2是目前已发表文章的应用较为广泛的一款版本,但是它在实际使用时存在一些问题,第一,monocle2使用的细胞降维方式与seurat(一款流行的,能独立完成从细胞-基因表达矩阵到细胞降维聚类分群的scRNAseq工具)并不兼容;第二,该版本已被作者弃用并停止维护,实际应用中发现一些bug,却难以找到解决的方案。在生信分析中,如何选择软件往往是一个难题(这可能也是很多评测文章出现的原因)。作为一名工具的使用者,可以在充分理解算法原理的基础上,结合自己的研究,并通过调试,最终做出适当的选择。
Abstract:
Mammalian organogenesis is a remarkable process. Within a short timeframe, the cells of the three germ layers transform into an embryo that includes most of the major internal and external …
>>>
Mammalian organogenesis is a remarkable process. Within a short timeframe, the cells of the three germ layers transform into an embryo that includes most of the major internal and external organs. Here we investigate the transcriptional dynamics of mouse organogenesis at single-cell resolution. Using single-cell combinatorial indexing, we profiled the transcriptomes of around 2 million cells derived from 61 embryos staged between 9.5 and 13.5 days of gestation, in a single experiment. The resulting 'mouse organogenesis cell atlas' (MOCA) provides a global view of developmental processes during this critical window. We use Monocle 3 to identify hundreds of cell types and 56 trajectories, many of which are detected only because of the depth of cellular coverage, and collectively define thousands of corresponding marker genes. We explore the dynamics of gene expression within cell types and trajectories over time, including focused analyses of the apical ectodermal ridge, limb mesenchyme and skeletal muscle.
<<<
翻译
990.
Vincent
(2022-04-30 21:26):
#paper https://doi.org/10.1038/s41467-020-17678-4 A deep learning model to predict RNA-Seq expression of tumours from whole slide images. Nature Comm (2020) 深度学习模型(CNN)在医学影像中有广泛的应用,最近也有研究指出可以通过病理图片来预测DNA突变和突变数,但是还没有研究关注过是否可以通过病理图片来预测基因表达,这篇文章填补了这部分空白。文章提出了一种基于多任务弱监督的深度学习模型 HE2RNA, 使用TCGA不同癌症类型数据(WSI + RNA-seq)进行训练,发现能准确预测基因的数量主要取决于训练数据集的大小,对这些被准确预测的基因进行富集分析,发现他们集中在免疫和T细胞调控,细胞周期,和癌症hallmark的通路上。最后文章还展现HE2RNA可以用于基因表达的空间可视化(预测基因在slide上表达)和提高MSI预测效果
IF:14.700Q1
Nature communications,
2020-08-03.
DOI: 10.1038/s41467-020-17678-4
PMID: 32747659
PMCID:PMC7400514
Abstract:
Deep learning methods for digital pathology analysis are an effective way to address multiple clinical questions, from diagnosis to prediction of treatment outcomes. These methods have also been used to …
>>>
Deep learning methods for digital pathology analysis are an effective way to address multiple clinical questions, from diagnosis to prediction of treatment outcomes. These methods have also been used to predict gene mutations from pathology images, but no comprehensive evaluation of their potential for extracting molecular features from histology slides has yet been performed. We show that HE2RNA, a model based on the integration of multiple data modes, can be trained to systematically predict RNA-Seq profiles from whole-slide images alone, without expert annotation. Through its interpretable design, HE2RNA provides virtual spatialization of gene expression, as validated by CD3- and CD20-staining on an independent dataset. The transcriptomic representation learned by HE2RNA can also be transferred on other datasets, even of small size, to increase prediction performance for specific molecular phenotypes. We illustrate the use of this approach in clinical diagnosis purposes such as the identification of tumors with microsatellite instability.
<<<
翻译
991.
Ricardo
(2022-04-30 21:13):
#paper DOI: 10.1109/WACV51458.2022.00162. Uncertainty Learning towards Unsupervised Deformable Medical Image Registration. WACV(2022) 这篇文章没啥新意,感觉有点灌水。总而言之,在前列腺MRI图像中的配准工作,加入了分割标签作为形变场的约束,同时提出了一种基于laplace分布的模型不确定度估计的方法。嗯,没了。
Abstract:
Uncertainty estimation in medical image registration enables surgeons to evaluate the operative risk based on the trustworthiness of the registered image data thus of paramount importance for practical clinical applications. …
>>>
Uncertainty estimation in medical image registration enables surgeons to evaluate the operative risk based on the trustworthiness of the registered image data thus of paramount importance for practical clinical applications. Despite the recent promising results obtained with deep unsupervised learning-based registration methods, reasoning about uncertainty of unsupervised registration models remains largely unexplored. In this work, we propose a predictive module to learn the registration and uncertainty in correspondence simultaneously. Our framework introduces empirical randomness and registration error based uncertainty prediction. We systematically assess the performances on two MRI datasets with different ensemble paradigms. Experimental results highlight that our proposed framework significantly improves the registration accuracy and uncertainty compared with the baseline.
<<<
翻译
992.
Ricardo
(2022-04-30 21:07):
#paper https://doi.org/10.1016/j.media.2020.101939 Image registration: Maximum likelihood, minimum entropy and deep learning. MIA(2021) 作者在这篇文章里给pair-wise和group-wise的配准任务提出了一个基于maximum profile likelihood (MPL)的理论框架,并利用渐进分析方法证明了基于MPL的配准过程实际上是最小化生成联合图像数据分布的联合熵(minimizes an upper bound on the joint entropy of the distribution that generates the joint image data)。通过优化闭合形式的profile likelihood,作者推导出了groupwise配准的congealing 方法。这篇文章很多看不懂的地方,后面还得慢慢读。
IF:10.700Q1
Medical image analysis,
2021-04.
DOI: 10.1016/j.media.2020.101939
PMID: 33388458
PMCID:PMC8046343
Abstract:
In this work, we propose a theoretical framework based on maximum profile likelihood for pairwise and groupwise registration. By an asymptotic analysis, we demonstrate that maximum profile likelihood registration minimizes …
>>>
In this work, we propose a theoretical framework based on maximum profile likelihood for pairwise and groupwise registration. By an asymptotic analysis, we demonstrate that maximum profile likelihood registration minimizes an upper bound on the joint entropy of the distribution that generates the joint image data. Further, we derive the congealing method for groupwise registration by optimizing the profile likelihood in closed form, and using coordinate ascent, or iterative model refinement. We also describe a method for feature based registration in the same framework and demonstrate it on groupwise tractographic registration. In the second part of the article, we propose an approach to deep metric registration that implements maximum likelihood registration using deep discriminative classifiers. We show further that this approach can be used for maximum profile likelihood registration to discharge the need for well-registered training data, using iterative model refinement. We demonstrate that the method succeeds on a challenging registration problem where the standard mutual information approach does not perform well.
<<<
翻译
993.
Ricardo
(2022-04-30 20:52):
#paper https://doi.org/10.1016/j.media.2021.102292 Deformable MR-CT image registration using an unsupervised, dual-channel network for neurosurgical guidance. MIA(2022) 微创颅内神经外科手术的精度可能会受到变形的脑组织结构的影响,例如,在神经内窥镜路径中,由于脑脊液的流出导致脑组织变形达10毫米。这篇文章提出了一种基于深度学习的无监督配准方法,用于术前MR和术中CT之间的配准。MR和CT之间的配准属于跨模态配准问题,由于难以衡量不同模态图像之间的相似性, 跨模态配准问题一直以来都比较难做。这篇文章的主要思路就是利用cyclegan将不同模态的图像转换成同模态图像,从而进行模态内的配准。另一方面,与其使用determistic cyclegan, 作者使用了probabilitic cyclegan,这样就可以输出模型对于预测的形变场的不准确度的估计,这种不准确度的估计可以进一步拿来作为形变场的约束。
IF:10.700Q1
Medical image analysis,
2022-01.
DOI: 10.1016/j.media.2021.102292
PMID: 34784539
PMCID:PMC10229200
Abstract:
PURPOSE: The accuracy of minimally invasive, intracranial neurosurgery can be challenged by deformation of brain tissue - e.g., up to 10 mm due to egress of cerebrospinal fluid during neuroendoscopic …
>>>
PURPOSE: The accuracy of minimally invasive, intracranial neurosurgery can be challenged by deformation of brain tissue - e.g., up to 10 mm due to egress of cerebrospinal fluid during neuroendoscopic approach. We report an unsupervised, deep learning-based registration framework to resolve such deformations between preoperative MR and intraoperative CT with fast runtime for neurosurgical guidance.METHOD: The framework incorporates subnetworks for MR and CT image synthesis with a dual-channel registration subnetwork (with synthesis uncertainty providing spatially varying weights on the dual-channel loss) to estimate a diffeomorphic deformation field from both the MR and CT channels. An end-to-end training is proposed that jointly optimizes both the synthesis and registration subnetworks. The proposed framework was investigated using three datasets: (1) paired MR/CT with simulated deformations; (2) paired MR/CT with real deformations; and (3) a neurosurgery dataset with real deformation. Two state-of-the-art methods (Symmetric Normalization and VoxelMorph) were implemented as a basis of comparison, and variations in the proposed dual-channel network were investigated, including single-channel registration, fusion without uncertainty weighting, and conventional sequential training of the synthesis and registration subnetworks.RESULTS: The proposed method achieved: (1) Dice coefficient = 0.82±0.07 and TRE = 1.2 ± 0.6 mm on paired MR/CT with simulated deformations; (2) Dice coefficient = 0.83 ± 0.07 and TRE = 1.4 ± 0.7 mm on paired MR/CT with real deformations; and (3) Dice = 0.79 ± 0.13 and TRE = 1.6 ± 1.0 mm on the neurosurgery dataset with real deformations. The dual-channel registration with uncertainty weighting demonstrated superior performance (e.g., TRE = 1.2 ± 0.6 mm) compared to single-channel registration (TRE = 1.6 ± 1.0 mm, p < 0.05 for CT channel and TRE = 1.3 ± 0.7 mm for MR channel) and dual-channel registration without uncertainty weighting (TRE = 1.4 ± 0.8 mm, p < 0.05). End-to-end training of the synthesis and registration subnetworks also improved performance compared to the conventional sequential training strategy (TRE = 1.3 ± 0.6 mm). Registration runtime with the proposed network was ∼3 s.CONCLUSION: The deformable registration framework based on dual-channel MR/CT registration with spatially varying weights and end-to-end training achieved geometric accuracy and runtime that was superior to state-of-the-art baseline methods and various ablations of the proposed network. The accuracy and runtime of the method may be compatible with the requirements of high-precision neurosurgery.
<<<
翻译
994.
Ricardo
(2022-04-30 20:39):
#paper https://doi.org/10.48550/arXiv.1806.09055 DARTS: differentiable architecture search ICLR(2019) Neural Architectural Search (NAS) 这个问题是出了名的消耗算力,动不动就需要消耗上千个gpu hour,基本也只能在顶级的研究机构做这类研究。这篇文章没有使用类似于进化算法或者强化学习这样的方法在离散和不可微的空间中搜索网络架构, 而是通过对神经网络的架构表征进行松弛,将NAS问题转化为一个可微分的形式,从而能够使用梯度下降法在连续空间中搜索神经网络架构。作者将这个问题建模成一个bilevel的优化问题,然后提出了一个类似于EM算法的优化方法,通过交替优化模型架构参数\alpha和模型权重w来找到较优的模型架构\alpha 。由于优化过程中涉及二阶导的计算,作者进一步对二阶导的计算做了松弛,将其转化为形式为一阶导的估计,从而进一步降低了方法的复杂度。结果也都很漂亮,相比于之前那些动辄需要上千个gpu day的计算量,darts方法只需要几个gpu day的计算,而且也能达到差不多的效果。
arXiv,
2019.
DOI: 10.48550/arXiv.1806.09055
Abstract:
This paper addresses the scalability challenge of architecture search by formulating the task in a differentiable manner. Unlike conventional approaches of applying evolution or reinforcement learning over a discrete and …
>>>
This paper addresses the scalability challenge of architecture search by formulating the task in a differentiable manner. Unlike conventional approaches of applying evolution or reinforcement learning over a discrete and non-differentiable search space, our method is based on the continuous relaxation of the architecture representation, allowing efficient search of the architecture using gradient descent. Extensive experiments on CIFAR-10, ImageNet, Penn Treebank and WikiText-2 show that our algorithm excels in discovering high-performance convolutional architectures for image classification and recurrent architectures for language modeling, while being orders of magnitude faster than state-of-the-art non-differentiable techniques. Our implementation has been made publicly available to facilitate further research on efficient architecture search algorithms.
<<<
翻译
995.
张贝
(2022-04-30 20:39):
#paper DOI: 10.1038/s41580-021-00407-0 Nat Rev Mol Cell Biol.,2021,A guide to machine learning for biologists. 近几十年来,随着生物数据集规模与复杂性的大幅增长,机器学习越来越多的用于为潜在生物过程构建信息与预测模型。然而具体的机器学习方法多种多样,令人眼花缭乱。对于不同类型的生物数据,该如何选择特定的机器学习技术?本文是一篇2021年发表在Nature Reviews Molecular Cell Biology 上的综述文章,向读者简要介绍了一些关键的机器学习技术:既包括分类、回归、聚类模型等传统机器学习方法,也包括最近开发和广泛使用的涉及深度神经网络的技术。本文描述了不同的技术如何适用于特定类型的生物学数据,并指出着手进行涉及机器学习的实验时需要考虑的要点。最后,本文还讨论了一些机器学习研究的新方向。
Abstract:
The expanding scale and inherent complexity of biological data have encouraged a growing use of machine learning in biology to build informative and predictive models of the underlying biological processes. …
>>>
The expanding scale and inherent complexity of biological data have encouraged a growing use of machine learning in biology to build informative and predictive models of the underlying biological processes. All machine learning techniques fit models to data; however, the specific methods are quite varied and can at first glance seem bewildering. In this Review, we aim to provide readers with a gentle introduction to a few key machine learning techniques, including the most recently developed and widely used techniques involving deep neural networks. We describe how different techniques may be suited to specific types of biological data, and also discuss some best practices and points to consider when one is embarking on experiments involving machine learning. Some emerging directions in machine learning methodology are also discussed.
<<<
翻译
996.
颜林林
(2022-04-30 18:41):
#paper doi:10.1016/j.ccell.2022.04.002 Cancer Cell, 2022, The translational challenges of precision oncology. 这是一篇新近发表在Cancer Cell上的关于精准肿瘤学(precision oncology)的综述。所谓精准肿瘤学,是指基于肿瘤分子特征进行肿瘤诊治决策。这篇综述回顾了与肿瘤分子特征相关的研究历史和当前研究进展,从肿瘤发生、肿瘤预防、早期检测、新辅助治疗、微小病变残留监测、药物耐受、肿瘤演化过程、肿瘤转移等诊治不同阶段环节,讨论了相应重要分子特征的发现及应用。本文对于目前在肿瘤基因检测行业中涉及到的各类应用,包括涉及的临床队列研究和相关资源,都有提及,整体上内容全面、逻辑脉络清晰。比较适合初学者,快速了解这个方向的产业应用和临床应用,并强烈建议可追溯其参考文献,对各个具体应用场景,进行深入探索和学习。
Abstract:
The translational challenges in the field of precision oncology are in part related to the biological complexity and diversity of this disease. Technological advances in genomics have facilitated large sequencing …
>>>
The translational challenges in the field of precision oncology are in part related to the biological complexity and diversity of this disease. Technological advances in genomics have facilitated large sequencing efforts and discoveries that have further supported this notion. In this review, we reflect on the impact of these discoveries on our understanding of several concepts: cancer initiation, cancer prevention, early detection, adjuvant therapy and minimal residual disease monitoring, cancer drug resistance, and cancer evolution in metastasis. We discuss key areas of focus for improving cancer outcomes, from biological insights to clinical application, and suggest where the development of these technologies will lead us. Finally, we discuss practical challenges to the wider adoption of molecular profiling in the clinic and the need for robust translational infrastructure.
<<<
翻译
997.
June
(2022-04-30 14:29):
#paper https://doi.org/10.1038/s41392-022-00936-w该研究通过将 RT 与核酸酶野生型 Cas9 (WT-PE) 融合来进行大规模基因组操作,设计了一种新的先导编辑系统。与传统的先导编辑器(PE2)不同,这种新系统同时在目标位点引入了一个 DSB 和一个 3' 延伸的瓣,然后通过内源机制将它们整合到基因组中。当它与配对的 pegRNA 结合时,WT-PE 实现了高效的大规模基因组编辑,包括大片段缺失和染色体易位。因此, WT-PE 系统可能有助于建模或治疗与大片段畸变相关的疾病。
IF:40.800Q1
Signal transduction and targeted therapy,
2022-04-20.
DOI: 10.1038/s41392-022-00936-w
PMID: 35440051
PMCID:PMC9018734
Abstract:
Large scale genomic aberrations including duplication, deletion, translocation, and other structural changes are the cause of a subtype of hereditary genetic disorders and contribute to onset or progress of cancer. …
>>>
Large scale genomic aberrations including duplication, deletion, translocation, and other structural changes are the cause of a subtype of hereditary genetic disorders and contribute to onset or progress of cancer. The current prime editor, PE2, consisting of Cas9-nickase and reverse transcriptase enables efficient editing of genomic deletion and insertion, however, at small scale. Here, we designed a novel prime editor by fusing reverse transcriptase (RT) to nuclease wild-type Cas9 (WT-PE) to edit large genomic fragment. WT-PE system simultaneously introduced a double strand break (DSB) and a single 3' extended flap in the target site. Coupled with paired prime editing guide RNAs (pegRNAs) that have complementary sequences in their 3' terminus while target different genomic regions, WT-PE produced bi-directional prime editing, which enabled efficient and versatile large-scale genome editing, including large fragment deletion up to 16.8 megabase (Mb) pairs and chromosomal translocation. Therefore, our WT-PE system has great potential to model or treat diseases related to large-fragment aberrations.
<<<
翻译
998.
大象城南
(2022-04-30 14:19):
#paper https://doi.org/10.1002/hbm.25739 推测为血管源性的脑白质高信号(WMH)常在健康老年人群的MRI上有发现。WMH还与衰老和认知能力下降有关。本文使用包含认知健康老年人MRI数据的纵向数据集(基线N=231人,年龄范围在64~87岁之间),比较并验证了FreeSurfer (T1w)、UBO Detector (T1W + FLAIR)和FSL-BIANCA(T1w+FLAIR)三种脑白质高信号提取的算法的有效性。作为参考,我们在T1w、3D (3D) FLAIR和二维(2D) FLAIR图像中手动分割WMH,并用于评估不同自动化算法的分割精度。此外,我们评估了算法提供的WMH体积与Fazekas评分和年龄的关系。FreeSurfer低估了WMH的体积,其骰子相似系数最差(DSC = 0.434),但其WMH的体积与Fazekas得分有很强的相关性(rs = 0.73)。BIANCA在3D FLAIR图像中实现了最高DSC(0.602)。然而,在2D FLAIR图像中(rs = 0.41),与Fazekas得分的关系仅为中等,在探索人体内轨迹时检测到许多异常值WMH体积(2D FLAIR: ~30%)。UBO Detector在DSC中与BIANCA在两种模式下的表现相似,在2D FLAIR(0.531)中达到了最佳DSC,无需定制训练数据集。此外,它与Fazekas评分有很高的相关性(2D FLAIR: rs = 0.80)。总之,我们的结果强调了仔细考虑选择的WMH分割算法和mr模态的重要性。
Abstract:
White matter hyperintensities (WMH) of presumed vascular origin are frequently found in MRIs of healthy older adults. WMH are also associated with aging and cognitive decline. Here, we compared and …
>>>
White matter hyperintensities (WMH) of presumed vascular origin are frequently found in MRIs of healthy older adults. WMH are also associated with aging and cognitive decline. Here, we compared and validated three algorithms for WMH extraction: FreeSurfer (T1w), UBO Detector (T1w + FLAIR), and FSL's Brain Intensity AbNormality Classification Algorithm (BIANCA; T1w + FLAIR) using a longitudinal dataset comprising MRI data of cognitively healthy older adults (baseline N = 231, age range 64-87 years). As reference we manually segmented WMH in T1w, three-dimensional (3D) FLAIR, and two-dimensional (2D) FLAIR images which were used to assess the segmentation accuracy of the different automated algorithms. Further, we assessed the relationships of WMH volumes provided by the algorithms with Fazekas scores and age. FreeSurfer underestimated the WMH volumes and scored worst in Dice Similarity Coefficient (DSC = 0.434) but its WMH volumes strongly correlated with the Fazekas scores (r = 0.73). BIANCA accomplished the highest DSC (0.602) in 3D FLAIR images. However, the relations with the Fazekas scores were only moderate, especially in the 2D FLAIR images (r = 0.41), and many outlier WMH volumes were detected when exploring within-person trajectories (2D FLAIR: ~30%). UBO Detector performed similarly to BIANCA in DSC with both modalities and reached the best DSC in 2D FLAIR (0.531) without requiring a tailored training dataset. In addition, it achieved very high associations with the Fazekas scores (2D FLAIR: r = 0.80). In summary, our results emphasize the importance of carefully contemplating the choice of the WMH segmentation algorithm and MR-modality.
<<<
翻译
999.
吴增丁
(2022-04-29 16:29):
#paper https://doi.org/10.1038/s43018-022-00356-3
Cell type and gene expression deconvolution with BayesPrism enables Bayesian integrative analysis across bulk and single-cell RNA sequencing in oncology (2022)
最近为了能从bulkRNAseq数据中分析出肿瘤一致性,所以在找一款比较好用的cellar deconvolution的软件。这方面引用量最好的是CIBERSORT及CIBERSORTx,但是这两款软件存在显示的缺点是只能online分析,不能本地化部署。看到前几天(2022年4月25日)刚在Nature Cancer上发表的BayesPrism,它可以本地化部署且提供的源码,赶紧读一读且拿来了试用。
该软件采用了贝叶斯统计模型,利用已经对cell type/ cell states注释过的single cell data作为 Reference,实现了从bulk RNAseq中推断出不同肿瘤细胞的组成及比例,而且还估计除了不同cell type的gene expression。而且从文章自己展示的性能看,已经超过了CIBERSORT/CIBERSORTx/Bisque/MUSiC 了,并且在肿瘤细胞10%以上的样本中,得到的表达谱和真实表达谱相关性大于0.9。
Abstract:
Inferring single-cell compositions and their contributions to global gene expression changes from bulk RNA sequencing (RNA-seq) datasets is a major challenge in oncology. Here we develop Bayesian cell proportion reconstruction …
>>>
Inferring single-cell compositions and their contributions to global gene expression changes from bulk RNA sequencing (RNA-seq) datasets is a major challenge in oncology. Here we develop Bayesian cell proportion reconstruction inferred using statistical marginalization (BayesPrism), a Bayesian method to predict cellular composition and gene expression in individual cell types from bulk RNA-seq, using patient-derived, scRNA-seq as prior information. We conduct integrative analyses in primary glioblastoma, head and neck squamous cell carcinoma and skin cutaneous melanoma to correlate cell type composition with clinical outcomes across tumor types, and explore spatial heterogeneity in malignant and nonmalignant cell states. We refine current cancer subtypes using gene expression annotation after exclusion of confounding nonmalignant cells. Finally, we identify genes whose expression in malignant cells correlates with macrophage infiltration, T cells, fibroblasts and endothelial cells across multiple tumor types. Our work introduces a new lens to accurately infer cellular composition and expression in large cohorts of bulk RNA-seq data.
<<<
翻译
1000.
尹志
(2022-04-28 22:10):
#paper https://doi.org/10.48550/arXiv.1503.03585 Deep Unsupervised Learning using Nonequilibrium Thermodynamics ICML (2015). 这是一篇还没完全看懂的论文,但是非常有意思。说起这篇文章的扩散模型大家一不定熟悉,但是提到最近大火的openai的工作dall-e 2,大家可能会更熟悉一点。对,Dall-E 2最早的启发就是这篇文章。本文受非平衡热力学的启发,设计了一个称之为扩散模型(diffusion model)的生成模型。我们知道,在机器学习中,对一堆数据的分布进行估计是一个极具挑战的事情。特别是要兼顾模型的灵活性(flexible)和过程的可解性(tractable)。如果把建模隐变量z到观测量x的映射作为任务,那么扩散模型的想法是,
假设整个映射是一个马尔科夫链(MC),然后数据的初始状态是由一步步不断添加高斯噪声,最终获得某种最终形态,那么反过来,可以将去噪的过程看做是生成的过程。我们针对这个MC过程进行训练,那么逆过程则可以作为生成模型生成符合分布的数据。是的,很像VAE。考虑到这类生成模型通过不断的改进,已经达到Dall-E 2的效果,值得我们深入理解背后的机制,以及是否可以在数据合成上产生更好的效果。
arXiv,
2015.
DOI: 10.48550/arXiv.1503.03585
Abstract:
A central problem in machine learning involves modeling complex data-sets using highly flexible families of probability distributions in which learning, sampling, inference, and evaluation are still analytically or computationally tractable. …
>>>
A central problem in machine learning involves modeling complex data-sets using highly flexible families of probability distributions in which learning, sampling, inference, and evaluation are still analytically or computationally tractable. Here, we develop an approach that simultaneously achieves both flexibility and tractability. The essential idea, inspired by non-equilibrium statistical physics, is to systematically and slowly destroy structure in a data distribution through an iterative forward diffusion process. We then learn a reverse diffusion process that restores structure in data, yielding a highly flexible and tractable generative model of the data. This approach allows us to rapidly learn, sample from, and evaluate probabilities in deep generative models with thousands of layers or time steps, as well as to compute conditional and posterior probabilities under the learned model. We additionally release an open source reference implementation of the algorithm.
<<<
翻译