当前共找到 1480 篇文献分享,本页显示第 41 - 60 篇。
41.
林海onrush (2026-02-17 00:50):
#paper, Coherent Ising machine based on polarization symmetry breaking in a driven Kerr resonator, DOI: 10.1038/ s41467-026-68794-6, 这篇论文提出并实验演示了一种新的光学相干伊辛机:不像传统 DOPO 型 CIM 那样用“相位”来编码自旋,而利用受驱动光纤 Kerr 环形谐振腔中的偏振自发对称性破缺来产生二元自旋,并用强度就能读出状态,从而显著降低相位稳定与本振同相探测带来的工程复杂度。具体机制是:外部泵浦沿腔的一个主偏振模 (E_1) 驱动,当扫频接近倾斜共振峰时,另一正交模 (E_2) 通过四波混频被参量产生并出现相差的两种简并相位态;再借助腔内一个“局域双折射缺陷,使 (E_2) 的相位在每个往返时间 (t_R) 后在两态之间强制交替,从而在混合偏振基 (E_pm=(E_1± iE_2)/sqrt2) 上表现为 (I_max) 与 (I_min) 的交替序列。因为初始相位由噪声随机选择,这就自然形成两种无偏的二元序列,对应自旋 (+1/-1),且只需对某一路 ( |E_+|^2 ) 做强度测量即可判别(文中图 1 给出了相位-强度映射与“跨两次往返观察时呈现 pitchfork 分岔”的解释)。更关键的是,作者利用此前发现的拓扑对称性保护工作区:该局域缺陷会把系统锁定到一个吸引子上,能在存在不完美 pi 相移或驱动偏振失配时仍消除漂移与偏置,从而实现长时间稳定、无需后选择的伊辛试验。 在实验上,他们用 57 m 标准单模光纤搭建环形腔(往返时间 273 ns、finesse≈42),以 1552 nm、4.69 GHz 重复率的 5 ps 脉冲列同步驱动,实现时间复用的自旋网络,自旋数 (N) 可通过脉冲挑选设置。耦合采用“测量-反馈”方式:读出每个脉冲在 ( |E_+|^2 ) 上的强度后,通过在驱动端给 (E_2) 分量施加相位调制来实现 J_ij 耦合。作为概念验证,作者实现了最近邻反铁磁的一维链:展示了 64 自旋单次运行中,随扫频跨过 SSB 点,自旋从噪声涨落逐步定态、网络伊辛能量下降并接近低能态;统计上 1500 次试验的末态能量分布与数值模拟高度一致,且在较弱耦合下分布在超过一小时的连续运行中保持稳定,期间无需人工调整、也不丢弃任何试验。进一步以“99% 命中基态所需时间”定义 time-to-solution,发现存在随规模变化的最优退火时间,并在 (N=10) 到 (100) 的链上测得的 time-to-solution 与 exp(sqrt{N}) 的标度更一致,暗示该平台具备扩展到更大问题规模的潜力;讨论部分也指出可通过调节腔 finesse 在速度、功耗与解质量间权衡,并且未来可借助 FPGA 等成熟方案扩展到更高连通度甚至全连接。
Liam Quinn, Yiqing Xu, Julien Fatome, Gian-Luca Oppo, Stuart G. Murdoch, Miro Erkintalo, Stéphane Coen
Abstract:
Abstract
Time-multiplexed networks of degenerate optical parametric oscillators have demonstrated remarkable success in simulating coupled Ising spins, thus providing a promising route to solving complex combinatorial optimization problems. In these systems, referred to as coherent Ising machines, spins are encoded in the oscillator phases, and measured at the system output using phase-sensitive techniques, making intricate phase stabilization necessary. Here, we introduce an optical Ising machine based on spontaneous polarization symmetry breaking in a coherently driven fiber Kerr nonlin… >>>
Abstract<br> Time-multiplexed networks of degenerate optical parametric oscillators have demonstrated remarkable success in simulating coupled Ising spins, thus providing a promising route to solving complex combinatorial optimization problems. In these systems, referred to as coherent Ising machines, spins are encoded in the oscillator phases, and measured at the system output using phase-sensitive techniques, making intricate phase stabilization necessary. Here, we introduce an optical Ising machine based on spontaneous polarization symmetry breaking in a coherently driven fiber Kerr nonlinear resonator. In our architecture, the spins are encoded in the polarization state, allowing robust, all-intensity readout with off-the-shelf telecom components. By operating in a newly-discovered regime where nonlinearity and topology lock the system’s symmetry, we eliminate drift and bias, enabling uninterrupted Ising trials at optical speeds for over an hour, without manual intervention. This all-fiber platform not only simplifies the hardware but also opens a path to more stable, high-throughput coherent optical optimization devices for applications from finance to drug design and beyond. <<<
42.
龙海晨 (2026-02-16 19:08):
#paper Chen Y, Gu Y, Hu Z, Sun X. Sample-specific perturbation of gene interactions identifies breast cancer subtypes. Brief Bioinform. 2021 Jul 20;22(4):bbaa268. doi: 10.1093/bib/bbaa268. PMID: 33126248; PMCID: PMC8293822.这是一篇通过生物信息学研究乳腺癌的文章,乳腺癌是一种高度异质性的疾病,根据基因表达谱对乳腺癌进行多种形式的分类。基因表达谱是变量,如果在不同时间点或不同条件下测量,可能会显示差异。相比之下,生物网络随着时间的推移和在不同的条件下相对稳定。在这项研究中从一个新的角度使用基因相互作用网络,根据相对基因表达值测量的个体特异性边缘扰动来探索乳腺癌的亚型。研究表明,根据个体水平上的基因相互作用扰动,乳腺癌有四种亚型。基于网络的乳腺癌新亚型在预后、体细胞突变、表型变化和丰富的通路方面表现出很强的异质性。基于网络的亚型与PAM50亚型和免疫组化指数密切相关。这项工作有助于我们从网络角度更好地理解乳腺癌的异质性和机制。
Yuanyuan Chen, Yu Gu, Zixi Hu, Xiao Sun
Abstract:
AbstractBreast cancer is a highly heterogeneous disease, and there are many forms of categorization for breast cancer based on gene expression profiles. Gene expression profiles are variables and may show differences if measured at different time points or under different conditions. In contrast, biological networks are relatively stable over time and under different conditions. In this study, we used a gene interaction network from a new point of view to explore the subtypes of breast cancer based on individual-specific edge perturbations measured by relative gene expression value. Our study… >>>
AbstractBreast cancer is a highly heterogeneous disease, and there are many forms of categorization for breast cancer based on gene expression profiles. Gene expression profiles are variables and may show differences if measured at different time points or under different conditions. In contrast, biological networks are relatively stable over time and under different conditions. In this study, we used a gene interaction network from a new point of view to explore the subtypes of breast cancer based on individual-specific edge perturbations measured by relative gene expression value. Our study reveals that there are four breast cancer subtypes based on gene interaction perturbations at the individual level. The new network-based subtypes of breast cancer show strong heterogeneity in prognosis, somatic mutations, phenotypic changes and enriched pathways. The network-based subtypes are closely related to the PAM50 subtypes and immunohistochemistry index. This work helps us to better understand the heterogeneity and mechanisms of breast cancer from a network perspective. <<<
43.
ZĒNG Yíngzhū (Zoo) 曾莹珠 (2026-02-12 14:35):
#paper doi:10.1080/20008066 European Journal of Psychotraumatology, 2026, Improving post-traumatic stress symptoms in homeless-experienced women using narrative exposure therapy: a single-arm, open pilot study. 较高的NET干预完成率,加上PTSD和共病症状在前后的显著变化,表明NET可以作为解决HEW中严重PTSD症状的可行方案。未来有望将NET转化到其他类似的经历复杂和持续创伤暴露的创伤高敏感人群。
44.
DeDe宝 (2026-02-05 02:37):
#paper, https://doi.org/10.1371/journal.pcbi.1013879 Information uncertainty influences learning strategy from sequentially delayed rewards. Plos Computational Biology. 本研究探索信息不确定性对延时奖励学习策略的影响。奖励延迟出现时,人类如何将奖励与先前事件关联?研究者操纵了奖励信息的不确定性:分离条件(即时奖励和延迟奖励的信息则分别呈现)和整合条件(奖励以即时奖励和延迟奖励的总和形式呈现),以探索信息不确定性对学习策略的影响。研究主要比较了三种模型:回顾模型(Elg,基于时间序列更新先前选择的价值)、前瞻模型(Tab,仅系统更新与奖励相关的过往选择)和混合模型(Hybrid,通过β参数调整两个模型的权重)。行为数据分析表明,被试能够掌握延迟奖励的关联规则,且低信息不确定性有助于促进学习表现,初始的低不确定性环境的学习体验会形成 “认知启动”,持续影响后续高不确定性环境中的策略使用。模型比较结果表明,两种模型都能捕捉被试核心行为特征,低信息不确定性时,前瞻模型更能解释被试行为;高不确定性时,回顾模型成为有效补充。上述结果表明人类会根据信息不确定性灵活切换混合学习策略。
Sean R. Maulhardt, Alec Solway, Caroline J. Charpentier
Abstract:
When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for humans to solve given the temporal uncertainty in the environment. Research to date has attempted to isolate dimensions of delay and reward during decision-making, but algorithmic solutions to temporal learning problems and the effect of uncertainty on learning remain underexplored. To further our understanding, we adapted a reward learning task that creates a temporal credit assignment problem by combining sequ… >>>
When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for humans to solve given the temporal uncertainty in the environment. Research to date has attempted to isolate dimensions of delay and reward during decision-making, but algorithmic solutions to temporal learning problems and the effect of uncertainty on learning remain underexplored. To further our understanding, we adapted a reward learning task that creates a temporal credit assignment problem by combining sequentially delayed rewards, intervening events, and varying uncertainty via the amount of information presented during feedback. Using computational modeling, two learning strategies were developed: an eligibility trace, whereby previously selected actions are updated as a function of the temporal sequence, and a tabular update, whereby only systematically related past actions (rather than unrelated intervening events) are updated. We hypothesized that reduced information uncertainty would correlate with increased use of the tabular strategy, given the model’s capacity to incorporate additional feedback information. Both models effectively learned the task, and predicted choices made by participants (N = 142) as well as specific behavioral signatures of credit assignment. Consistent with our hypothesis, the tabular model outperformed the eligibility model under low information uncertainty, as evidenced by more accurate predictions of participants’ behavior and an increase in tabular weight. These findings provide new insights into the mechanisms implemented by humans to solve temporal credit assignment and adapt their strategy in varying environments. <<<
45.
刘昊辰 (2026-02-02 09:27):
#paper Particle Builder A Board Game for the Teaching of the Standard Model of Particle Physics at a Secondary Level.《Particle Builder》是一款于2016年由国际物理教师团队研发的桌游,后推出浏览器在线版本(支持与基础AI对战),专为高中阶段教学设计,通过7个难度递增的关卡,以互动gameplay传授粒子物理学标准模型的核心知识(如夸克、轻子、反物质等),经281名澳大利亚高中生测试,225人完成前后测,平均学习增益达0.16,媲美1.5周(约7小时)传统教学效果,且94%的学生认为其比常规科学课更有趣,88%认为更具参与感,物理版和在线版均免费向教师开放。下载地址:https://arxiv.org/pdf/2511.21116
arXiv, 2025-11-26T07:02:18Z. DOI: 10.48550/arXiv.2511.21116
Lachlan McGinness, Yutong Ma, Mohammad Attar, Andrew Carse, Yeming Chen, Thomas Green, Jeong-Yeon Ha, Yanbai Jin, Amy McWilliams, Theirry Panggabean ... >>>
Lachlan McGinness, Yutong Ma, Mohammad Attar, Andrew Carse, Yeming Chen, Thomas Green, Jeong-Yeon Ha, Yanbai Jin, Amy McWilliams, Theirry Panggabean, Zhengyu Peng, Jing Ru, Jiacheng She, Lujin Sun, Jialin Wang, Zilun Wei, Jiayuan Zhu <<<
Abstract:
We present Particle Builder, an online board game which teaches students about concepts from the Standard Model of Particle Physics at a high school level. This short activity resulted in a gain of 0.16, indicating that students learned a significant amount of particle physics knowledge. Students found the activity was more engaging and less difficult than a normal classroom lesson.
46.
林海onrush (2026-01-31 23:55):
#paper,DOI: arXiv:2406.03816,ReST-MCTS: LLM Self-Training via Process Reward Guided Tree Search,本文提出ReST-MCTS,一种将过程奖励(Process Reward)与改进的蒙特卡洛树搜索(MCTS)相结合的大语言模型自训练框架,旨在解决现有自训练方法仅依赖最终正确答案、却容易引入低质量中间推理的问题。该方法在仅已知最终正确答案的情况下,通过树搜索中的多次 rollout 自动推断每一步中间推理对通向正确解的贡献概率,从而生成高质量的过程奖励信号,用于同时训练策略模型和过程奖励模型。实验结果表明,在相同搜索预算下,ReST-MCTS*在推理准确率上优于 Best-of-N、Tree-of-Thought 等方法,并在多轮自训练中持续提升模型性能,显著超过 ReSTEM、Self-Rewarding 等已有自训练范式,验证了其在高质量推理轨迹获取和稳定自提升方面的有效性
arXiv, 2024-06-06T07:40:00Z. DOI: 10.48550/arXiv.2406.03816
Dan Zhang, Sining Zhoubian, Ziniu Hu, Yisong Yue, Yuxiao Dong, Jie Tang
Abstract:
Recent methodologies in LLM self-training mostly rely on LLM generating responses and filtering those with correct output answers as training data. This approach often yields a low-quality fine-tuning training set (e.g., incorrect plans or intermediate reasoning). In this paper, we develop a reinforced self-training approach, called ReST-MCTS*, based on integrating process reward guidance with tree search MCTS* for collecting higher-quality reasoning traces as well as per-step value to train policy and reward models. ReST-MCTS* circumvents the per-step manual annotation typically used to trai… >>>
Recent methodologies in LLM self-training mostly rely on LLM generating responses and filtering those with correct output answers as training data. This approach often yields a low-quality fine-tuning training set (e.g., incorrect plans or intermediate reasoning). In this paper, we develop a reinforced self-training approach, called ReST-MCTS*, based on integrating process reward guidance with tree search MCTS* for collecting higher-quality reasoning traces as well as per-step value to train policy and reward models. ReST-MCTS* circumvents the per-step manual annotation typically used to train process rewards by tree-search-based reinforcement learning: Given oracle final correct answers, ReST-MCTS* is able to infer the correct process rewards by estimating the probability this step can help lead to the correct answer. These inferred rewards serve dual purposes: they act as value targets for further refining the process reward model and also facilitate the selection of high-quality traces for policy model self-training. We first show that the tree-search policy in ReST-MCTS* achieves higher accuracy compared with prior LLM reasoning baselines such as Best-of-N and Tree-of-Thought, within the same search budget. We then show that by using traces searched by this tree-search policy as training data, we can continuously enhance the three language models for multiple iterations, and outperform other self-training algorithms such as ReST$^\text{EM}$ and Self-Rewarding LM. We release all code at https://github.com/THUDM/ReST-MCTS. <<<
47.
尹志 (2026-01-31 23:53):
#paper https://arxiv.org/abs/2601.21571. arxiv 2026. Shaping capabilities with token-level data filtering。文档级过滤过渡到Token 级过滤确实是很直接的想法,但用良好的工程实现获得洞见,确实是alec的风格。
arXiv, 2026-01-29T11:34:01Z. DOI: 10.48550/arXiv.2601.21571
Neil Rathi, Alec Radford
Abstract:
Current approaches to reducing undesired capabilities in language models are largely post hoc, and can thus be easily bypassed by adversaries. A natural alternative is to shape capabilities during pretraining itself. On the proxy task of removing medical capabilities, we show that the simple intervention of filtering pretraining data is highly effective, robust, and inexpensive at scale. Inspired by work on data attribution, we show that filtering tokens is more effective than filtering documents, achieving the same hit to undesired capabilities at a lower cost to benign ones. Training models… >>>
Current approaches to reducing undesired capabilities in language models are largely post hoc, and can thus be easily bypassed by adversaries. A natural alternative is to shape capabilities during pretraining itself. On the proxy task of removing medical capabilities, we show that the simple intervention of filtering pretraining data is highly effective, robust, and inexpensive at scale. Inspired by work on data attribution, we show that filtering tokens is more effective than filtering documents, achieving the same hit to undesired capabilities at a lower cost to benign ones. Training models spanning two orders of magnitude, we then demonstrate that filtering gets more effective with scale: for our largest models, token filtering leads to a 7000x compute slowdown on the forget domain. We also show that models trained with token filtering can still be aligned on the forget domain. Along the way, we introduce a methodology for labeling tokens with sparse autoencoders and distilling cheap, high-quality classifiers. We also demonstrate that filtering can be robust to noisy labels with sufficient pretraining compute. <<<
48.
钟鸣 (2026-01-31 23:42):
#paper doi:10.1038/s41598-024-54874-4 Python farming as a flexible and efficient form of agricultural food security 本研究旨在评估两种大型蟒蛇(网纹蟒和缅甸蟒)在养殖场中的生长效率,探究其作为新型畜牧业的潜力。方法是定期测量12个月内蟒蛇的吻肛长和体重、摄食情况,并记录蟒蛇禁食20天的体重变化。结果发现,缅甸蟒日均增重远大于网纹蟒(42.6克vs19.7克),雌性生长快于雄性,早期生长速率(前2个月)和总摄食量是预测12个月生长的关键因素。禁食实验表明禁食期间日均体重损失仅0.004%,且恢复摄食后可继续快速生长,受饲料波动影响小。蟒的能量效率远高于恒温动物或因其是变温动物,鲑鱼、蟋蟀等变温动物能量效率与蟒蛇相当但稍高。较高的能量效率和抗禁食能力突出了蟒蛇养殖的潜力,但仍需考虑蛇文化的影响,且考虑其食物(雏鸡、鼠)的大规模供应。
D. Natusch, P. W. Aust, C. Caraguel, P. L. Taggart, V. T. Ngo, G. J. Alexander, R. Shine, T. Coulson
Abstract:
AbstractDiminishing natural resources and increasing climatic volatility are impacting agri-food systems, prompting the need for sustainable and resilient alternatives. Python farming is well established in Asia but has received little attention from mainstream agricultural scientists. We measured growth rates in two species of large pythons (Malayopython reticulatus and Python bivittatus) in farms in Thailand and Vietnam and conducted feeding experiments to examine production efficiencies. Pythons grew rapidly over a 12-month period, and females grew faster than males. Food int… >>>
AbstractDiminishing natural resources and increasing climatic volatility are impacting agri-food systems, prompting the need for sustainable and resilient alternatives. Python farming is well established in Asia but has received little attention from mainstream agricultural scientists. We measured growth rates in two species of large pythons (<i>Malayopython reticulatus</i> and <i>Python bivittatus</i>) in farms in Thailand and Vietnam and conducted feeding experiments to examine production efficiencies. Pythons grew rapidly over a 12-month period, and females grew faster than males. Food intake and growth rates early in life were strong predictors of total lifetime growth, with daily mass increments ranging from 0.24 to 19.7 g/day for <i>M. reticulatus</i> and 0.24 to 42.6 g/day for <i>P. bivittatus</i>, depending on food intake. Pythons that fasted for up to 4.2 months lost an average of 0.004% of their body mass per day, and resumed rapid growth as soon as feeding recommenced. Mean food conversion rate for dressed carcasses was 4.1%, with useable products (dressed carcass, skin, fat, gall bladder) comprising 82% of the mass of live animals. In terms of food and protein conversion ratios, pythons outperform all mainstream agricultural species studied to date. The ability of fasting pythons to regulate metabolic processes and maintain body condition enhances food security in volatile environments, suggesting that python farming may offer a flexible and efficient response to global food insecurity. <<<
49.
李翛然 (2026-01-31 23:04):
#paper doi:10.1038/s41586-025-10014-0 Nature Advancing regulatory variant effect prediction with AlphaGenome。 AlphaGenome,这是一个能够统一解读 DNA 非编码“暗物质”的深度学习模型。该模型可直接输入长达 1 兆碱基对(1 Mb)‍ 的 DNA 序列,以单碱基分辨率同时预测数千种基因组功能信号(如染色质可及性、转录因子结合、剪接等)在性能上,AlphaGenome 在 24 项基因组轨迹预测任务中的 22 项,以及 26 项变异效应预测任务中的 24 项 上达到了最先进水平。它能够准确预测非编码变异如何影响基因调控,例如成功解析了白血病相关癌基因 TAL1 附近变异的作用机制。我觉得他倒是和 kegg没有特别让我眼前一亮的。
IF:50.500Q1 Nature, 2026-01. DOI: 10.1038/s41586-025-10014-0 PMID: 41606153 PMCID:PMC12851941
Žiga Avsec, Natasha Latysheva, Jun Cheng, Guido Novati, Kyle R Taylor, Tom Ward, Clare Bycroft, Lauren Nicolaisen, Eirini Arvaniti, Joshua Pan ... >>>
Žiga Avsec, Natasha Latysheva, Jun Cheng, Guido Novati, Kyle R Taylor, Tom Ward, Clare Bycroft, Lauren Nicolaisen, Eirini Arvaniti, Joshua Pan, Raina Thomas, Vincent Dutordoir, Matteo Perino, Soham De, Alexander Karollus, Adam Gayoso, Toby Sargeant, Anne Mottram, Lai Hong Wong, Pavol Drotár, Adam Kosiorek, Andrew Senior, Richard Tanburn, Taylor Applebaum, Souradeep Basu, Demis Hassabis, Pushmeet Kohli <<<
Abstract:
Deep learning models that predict functional genomic measurements from DNA sequences are powerful tools for deciphering the genetic regulatory code. Existing methods involve a trade-off between input sequence length and prediction resolution, thereby limiting their modality scope and performance. We present AlphaGenome, a unified DNA sequence model, which takes as input 1 Mb of DNA sequence and predicts thousands of functional genomic tracks up to single-base-pair resolution across diverse modalities. The modalities include gene expression, transcription initiation, chromatin accessibility, h… >>>
Deep learning models that predict functional genomic measurements from DNA sequences are powerful tools for deciphering the genetic regulatory code. Existing methods involve a trade-off between input sequence length and prediction resolution, thereby limiting their modality scope and performance. We present AlphaGenome, a unified DNA sequence model, which takes as input 1 Mb of DNA sequence and predicts thousands of functional genomic tracks up to single-base-pair resolution across diverse modalities. The modalities include gene expression, transcription initiation, chromatin accessibility, histone modifications, transcription factor binding, chromatin contact maps, splice site usage and splice junction coordinates and strength. Trained on human and mouse genomes, AlphaGenome matches or exceeds the strongest available external models in 25 of 26 evaluations of variant effect prediction. The ability of AlphaGenome to simultaneously score variant effects across all modalities accurately recapitulates the mechanisms of clinically relevant variants near the TAL1 oncogene. To facilitate broader use, we provide tools for making genome track and variant effect predictions from sequence. <<<
50.
半面阳光 (2026-01-31 22:47):
#paper doi: https://doi.org/10.1038/s41467-025-67218-1. Nature Communications. 2025. Flexible read-aware genotype imputation from sequence using biobank sized reference panels. 这篇文章在先前的QUILT的基因型填充(Genotype Imputation)方法基础上开发了一个新的QUILT2方法。这个方法能够基于大规模单倍型参考panel数据,对低深度全基因组测序reads及游离DNA进行快速的单体型推断与基因型填充。此外,QUILT2还包含一个方法学上的创新,旨在通过NIPT数据实现对母体的和胎儿的基因组填充。
Zilong Li, Anders Albrechtsen, Robert W. Davies
Abstract:
Abstract
Inexpensive and accurate genotyping methods are essential to modern genomics and health risk prediction. Here we introduce QUILT2, a scalable and read-aware imputation method that can efficiently use biobank scale haplotype reference panels. This allows for fast and accurate imputation using short reads, as well as long reads (e.g. Oxford Nanopore Technologies (ONT) 1X, r2 = 0.937 at common SNPs), linked-reads and ancient DNA. In addition, QUILT2 contains a methodological innovation that is designed to enable imputation of the maternal and fetal genome using cell free non-invasiv… >>>
Abstract<br> Inexpensive and accurate genotyping methods are essential to modern genomics and health risk prediction. Here we introduce QUILT2, a scalable and read-aware imputation method that can efficiently use biobank scale haplotype reference panels. This allows for fast and accurate imputation using short reads, as well as long reads (e.g. Oxford Nanopore Technologies (ONT) 1X, r2 = 0.937 at common SNPs), linked-reads and ancient DNA. In addition, QUILT2 contains a methodological innovation that is designed to enable imputation of the maternal and fetal genome using cell free non-invasive prenatal testing (NIPT) data. Using a UK Biobank reference panel and simulated NIPT data, we see accurate imputation of the mother (0.25X, r2 = 0.966, common SNPs) and modest imputation of the fetus (0.25X, r2 = 0.465, fetal fraction of 10%) at low coverage, with fetal imputation accuracy rising with coverage (4.0X, fetal r2 = 0.894). We show using simulated data that this could enable both GWAS and PRS for the mother and fetus, which could create clinical opportunities, and if phenotypes can be collected alongside clinical NIPT, the potential for large GWAS. <<<
51.
哪有情可长 (2026-01-31 21:29):
#paper Multi-omics identifies key genetic and metabolic networks regulating spike organ development in wheat. Plant Cell,  18 October 2025 doi.org/10.1093/plcell/koaf250. 小麦是全球重要的粮食作物,穗部发育是决定穗粒数、籽粒大小等关键产量性状的核心过程,但其复杂的基因与代谢物互作调控机制尚不明确。以“陇春35”为研究材料,针对小穗、穗轴、小花(含子房、花药)、芒等组织,覆盖12个关键发育阶段,结合LC-MS/MS代谢组学与转录组测序技术,构建了小麦穗发育的高时空分辨率多组学图谱。研究发现代谢物在不同组织中的富集特异性,揭示了激素时空分布对穗型发育的影响。鉴定出调控籽粒大小的关键基因TaOPR3、GL1和 GL2,并证实其优异单倍型在现代育种过程中被利用。该图谱深刻解析了代谢物与基因表达网络的互作机制,为理解小麦产量的分子基础提供了全新视角
Yangyang Liu, Lili Zhang, Anting Zhu, Liping Shen, Jiaqi Zhang, Jun Chen, Guowei Chang, Changbin Yin, Ziying Wang, Zhiwen Sun ... >>>
Yangyang Liu, Lili Zhang, Anting Zhu, Liping Shen, Jiaqi Zhang, Jun Chen, Guowei Chang, Changbin Yin, Ziying Wang, Zhiwen Sun, Kuocheng Shen, Xiaowan Xu, Mengjing Sun, Mingming Xin, Jianhui Wu, Zefu Lu, Yiping Tong, Zhonghu He, Fei Lu, Yuanfeng Hao, Wei Chen, Zifeng Guo <<<
Abstract:
Abstract
Wheat (Triticum aestivum L.) spike development is tightly regulated by genetic and metabolic programs that drive organ growth and morphological changes. However, the dynamic interplay between metabolic shifts, gene expression patterns, and their regulatory roles during spike development, remains poorly characterized. To address this knowledge gap, we performed integrated metabolomic and transcriptomic profiling across 12 stages of wheat spike organ development. Our analysis detected 1,105 metabolites in 233 spike, spikelet, and floret samples, uncovering an uneven distribution of… >>>
Abstract<br> Wheat (Triticum aestivum L.) spike development is tightly regulated by genetic and metabolic programs that drive organ growth and morphological changes. However, the dynamic interplay between metabolic shifts, gene expression patterns, and their regulatory roles during spike development, remains poorly characterized. To address this knowledge gap, we performed integrated metabolomic and transcriptomic profiling across 12 stages of wheat spike organ development. Our analysis detected 1,105 metabolites in 233 spike, spikelet, and floret samples, uncovering an uneven distribution of phytohormone-related metabolites. The exogenous phytohormone treatments validated the regulatory roles of phytohormones in spike morphogenesis. High-resolution spatiotemporal data from carpel organs enabled the reconstruction of a regulatory network, identifying key genes (including 12-oxo-phytodienoic acid reductase3 (TaOPR3), Grain Length1 (GL1), and Grain Length2 (GL2)) as critical determinants of grain size. Genomic analyses revealed geographical differentiation in gene haplotypes and their selective retention during breeding, with superior alleles associated with increased grain size. This comprehensive dataset provides a valuable resource for understanding the molecular basis of wheat grain yield and offers potential targets for crop improvement. <<<
52.
徐炳祥 (2026-01-31 21:09):
#paper doi: 10.1038/s41592-018-0033-z Nature methods, 2018, SAVER: gene expression recovery for single-cell RNA sequencing。本文是单细胞RNA-seq缺失值插补方面的经典论文之一。作者给出了一种能借助单细胞文库中其他细胞的基因表达水平填补单个细胞数据缺失的算法。算法利用泊松分布建模基因表达计数,用Gamma分布对其均值建模。利用细胞间基因表达水平的相互回归估计此Gamma分布的均值。通过假定变异率在细胞群体中恒定来估计Gamma分布的形状参数。最终实现确实表达水平的填补和测得表达计数的纠偏。在模拟和真实单细胞RNA-seq数据集中算法性能均得到了验证。本文为单细胞数据的缺失值插补提供了一个可行的理论框架,是后续众多研究的基础。
Mo Huang, Jingshu Wang, Eduardo Torre, Hannah Dueck, Sydney Shaffer, Roberto Bonasio, John I. Murray, Arjun Raj, Mingyao Li, Nancy R. Zhang
53.
Vincent (2026-01-31 17:31):
#paper https://arxiv.org/abs/2201.11903 arxiv 2022. Chain-of-Thought Prompting Elicits Reasoning in Large Language Models. 这篇文首次提出了Chain-of-Thought(CoT)的思路,通过在少样本提示中显式提供中间自然语言推理步骤,可以显著提升大语言模型在复杂推理任务上的表现。作者在多种推理任务基准测试上展示了 CoT 的显著增益,尤其在 100B+ 参数规模模型上表现为一种随规模涌现(emergent)的能力。消融实验表明,性能提升并非仅来自“多算一步”,而是顺序化、可读的推理过程本身在发挥作用。该方法无需额外训练或微调,仅通过提示即可实现,因而得以广泛运用,为大模型的可解释推理研究开辟了新方向
arXiv, 2022-01-28T02:33:07Z. DOI: 10.48550/arXiv.2201.11903
Jason Wei, Xuezhi Wang, Dale Schuurmans, Maarten Bosma, Brian Ichter, Fei Xia, Ed Chi, Quoc Le, Denny Zhou
Abstract:
We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that chain of thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks. T… >>>
We explore how generating a chain of thought -- a series of intermediate reasoning steps -- significantly improves the ability of large language models to perform complex reasoning. In particular, we show how such reasoning abilities emerge naturally in sufficiently large language models via a simple method called chain of thought prompting, where a few chain of thought demonstrations are provided as exemplars in prompting. Experiments on three large language models show that chain of thought prompting improves performance on a range of arithmetic, commonsense, and symbolic reasoning tasks. The empirical gains can be striking. For instance, prompting a 540B-parameter language model with just eight chain of thought exemplars achieves state of the art accuracy on the GSM8K benchmark of math word problems, surpassing even finetuned GPT-3 with a verifier. <<<
54.
符毓 (2026-01-31 13:25):
#paper doi:10.1109/ECCE47101.2021.9595683 IEEE, 2021, Application of Flat Rectangular Wire Concentrated Winding for AC loss Reduction in Electrical Machines. 随着电机功率和转速要求的提高,高速运行时产生的损耗,特别是绕组交流损耗,已成为一个亟待解决的问题。扁线可以最大限度地减少由趋肤效应和邻近效应引起的绕组交流损耗。本文分析了传统圆形导线对交流损耗的影响,并将其与所提出的扁平矩形导线结构进行了比较。同时,对开槽和半闭槽定子结构进行了评估 结果表明采用扁平矩形导线结构可以显著降低交流损耗。分析和仿真结果表明,与开放式槽定子相比,半封闭式槽定子绕组交流损耗较低,但铁芯损耗和永磁体损耗较高。由于开放式槽定子可以容纳预成型的扁平矩形导线绕组,并且易于实现自动化绕线工艺
55.
小年 (2026-01-30 11:57):
#paper doi:10.1038/s41565-025-02080-2,Goerzen D, Heller DA, et al. Machine perception liquid biopsy identifies brain tumours via systemic immune and tumour microenvironment signature(Nature Nanotechnology, 2025) 该研究开发了一种“机器感知液体活检(MPLB)”的新技术。研究团队构建含 21 种量子阱缺陷修饰单壁碳纳米管的传感器阵列,结合近红外荧光光谱与 CatBoost 机器学习算法,对 739 例血浆样本(含胶质瘤、脑膜瘤等 4 类脑肿瘤及非肿瘤样本)进行分析,实现 98% 的脑肿瘤检测准确率和 71% 的肿瘤分型准确率,外部队列验证准确率达 89.8%,且能检出 WHO 1-2 级低级别肿瘤。通过定量蛋白质组学解析传感器表面 “蛋白冠”,鉴定出 2017 种富集蛋白,包括 ENPP2、S100A 家族等新型标志物,这些标志物来源于肿瘤细胞、肿瘤微环境及全身免疫系统。该技术突破血脑屏障导致的生物标志物稀缺限制,无需特殊样本处理,为脑肿瘤无创早期诊断、亚型区分及个性化治疗靶点发现提供了全新方案,也为其他缺乏有效生物标志物的疾病检测奠定技术基础。
Dana Goerzen, Mijin Kim, Chanel Schroff, Margaret Ngoc Hoang, Jaina Sarris Wollowitz, August Kolb, Jordain P. Walshon, Kathleen McCortney, Craig Horbinski, Kristyn Galbraith ... >>>
Dana Goerzen, Mijin Kim, Chanel Schroff, Margaret Ngoc Hoang, Jaina Sarris Wollowitz, August Kolb, Jordain P. Walshon, Kathleen McCortney, Craig Horbinski, Kristyn Galbraith, Sana Raoof, Matija Snuderl, Alban Ordureau, Daniel A. Heller <<<
56.
cellsarts (2026-01-30 00:13):
# Paper DOI:10.3389/fmicb.2019.008492019-04-24 The Biogeochemical Sulfur Cycle of Marine Sediments 海洋沉积物的生物地球化学硫循环 摘要:微生物通过异化硫酸盐还原作用将硫酸盐转化为硫化物,是缺氧海底有机质矿化的主要终末途径。生成的硫化物经化学或微生物氧化后,在硫循环中形成复杂的途径网络,进而产生多种中间态硫化合物,并部分重新转化为硫酸盐。这些中间产物包括单质硫、多硫化物、连四硫酸盐和亚硫酸盐,它们均可作为进一步微生物氧化、还原或歧化反应的底物。近年来,一些新的微生物发现,例如通过硫化物氧化电缆细菌实现长距离电子传递的现象,进一步增加了这一过程的复杂性。同位素交换反应在稳定同位素地球化学以及利用放射性示踪剂研究硫转化的实验中发挥着重要作用。微生物催化的这些过程部分具有可逆性,其逆反应会影响我们对放射性示踪实验的解释,并为同位素分馏提供了一种机制。本文综述了我们在理解海底硫循环方面所取得的进展及当前的研究现状,涵盖其微生物生态学、生物地球化学和同位素地球化学等多个领域。
Bo Barker Jørgensen, Alyssa J. Findlay, André Pellerin
57.
白鸟 (2026-01-29 16:02):
#paper DOI:10.1126/science.ads9530 文献名称:Deep contrastive learning enables genome-wide virtual screening 发表期刊:Science, 2026 文章概要:DrugCLIP模型,基于深度对比学习的框架,用于实现超大规模、超快速的全基因组虚拟筛选。核心问题:传统分子对接计算量巨大,无法高效处理全人类基因组(约10,000+个蛋白靶点)× 海量化合物库(如500百万分子)的组合(万亿级交互)。 算法思路:通过对比学习,将蛋白pocket(结合位点)和小分子嵌入到一个共享的潜在空间中。在这个空间里,相似度直接编码蛋白-分子结合的可能性,实现了开创性的万亿级全基因组筛选,是后AlphaFold时代的新范式,推动从“靶点-化合物”的一对一筛选转向“全基因组-全化学空间”的系统性探索。 亮点 速度极快:DrugCLIP可在一天内完成万亿级交互,真正实现全基因组规模。 准确性强:在多种基准上表现出色,EF1%(top 1%富集因子)等指标领先;支持多靶点筛选和泛化。 可解释性:嵌入空间可视化(t-SNE等)能直观展示蛋白-分子匹配模式。 开放性高:作者公开了大规模筛选数据库,研究者可直接查询/下载结果;早期版本代码已开源(NeurIPS 2023 DrugCLIP仓库)。 部署和应用 1.在线版本:提交输入文件,即可生成结果 2.GitHub开源版本:早期版本开源,可python调用; 局限性:结构依赖、计算资源和实验验证
Yinjun Jia, Bowen Gao, Jiaxin Tan, Jiqing Zheng, Xin Hong, Wenyu Zhu, Haichuan Tan, Yuan Xiao, Liping Tan, Hongyi Cai ... >>>
Yinjun Jia, Bowen Gao, Jiaxin Tan, Jiqing Zheng, Xin Hong, Wenyu Zhu, Haichuan Tan, Yuan Xiao, Liping Tan, Hongyi Cai, Yanwen Huang, Zhiheng Deng, Xiangwei Wu, Yue Jin, Yafei Yuan, Jiekang Tian, Wei He, Weiying Ma, Yaqin Zhang, Lei Liu, Chuangye Yan, Wei Zhang, Yanyan Lan <<<
Abstract:
Recent breakthroughs in protein structure prediction have opened new avenues for genome-wide drug discovery, yet existing virtual screening methods remain computationally prohibitive. We present DrugCLIP, a contrastive learning framework that achieves ultrafast and accurate virtual screening, up to 10 million times faster than docking, while consistently outperforming various baselines on in silico benchmarks. In wet-lab validations, DrugCLIP achieved a 15% hit rate for norepinephrine transporter, and structures of two identified inhibitors were determined in complex with the target protein. … >>>
Recent breakthroughs in protein structure prediction have opened new avenues for genome-wide drug discovery, yet existing virtual screening methods remain computationally prohibitive. We present DrugCLIP, a contrastive learning framework that achieves ultrafast and accurate virtual screening, up to 10 million times faster than docking, while consistently outperforming various baselines on in silico benchmarks. In wet-lab validations, DrugCLIP achieved a 15% hit rate for norepinephrine transporter, and structures of two identified inhibitors were determined in complex with the target protein. For thyroid hormone receptor interactor 12, a target that lacks holo structures and small-molecule binders, DrugCLIP achieved a 17.5% hit rate using only AlphaFold2-predicted structures. Finally, we released GenomeScreenDB, an open-access database providing precomputed results for ~10,000 human proteins screened against 500 million compounds, pioneering a drug discovery paradigm in the post-AlphaFold era. <<<
58.
惊鸿 (2026-01-29 10:07):
#paper DOI: 10.1056/NEJMoa2504747 英文标题: Customized Base-Editing Therapy for CPS1 Deficiency(基于案例内容概括) 发表时间: 2025年(《新英格兰医学杂志》正式发表) 核心突破 本研究首次为一名患有致死性氨甲酰磷酸合成酶1(CPS1)缺乏症的6月龄男婴(KJ Muldoon)量身定制了体内碱基编辑疗法,并在7个月内完成从设计到临床施用的全过程,实现了“N-of-1”个性化基因治疗的里程碑。患儿在接受两次静脉输注(0.1 mg/kg、0.3 mg/kg)后,血氨水平显著下降,能够耐受更高蛋白质摄入,且未再发生高血氨危象。 技术亮点 1. 平台化快速开发:采用腺嘌呤碱基编辑器(ABE)+ 脂质纳米颗粒(LNP) 递送系统,仅需针对患者特定突变定制向导RNA(gRNA),极大缩短研发周期。 2. 精准无创编辑:ABE直接化学修饰DNA碱基(A·T→G·C),无需切割双链,避免传统CRISPR-Cas9的脱靶和插入/缺失风险。 3. 可重复给药优势:LNP递送允许剂量调整(二次输注),克服了AAV载体免疫原性高、仅能单次给药的局限。 局限与展望 - 随访时间短:目前仅报道7周临床数据,长期安全性与持久性需进一步验证。 - 成本与可及性:个性化疗法开发成本高昂,需推动平台化生产以降低费用。 - 拓展潜力:该“平台化+定制化”模式可推广至其他由点突变引起的肝脏代谢遗传病,为数百万罕见病患者提供新希望。 总结 本研究不仅成功挽救一名危重患儿,更验证了个性化基因编辑疗法在极短时间内从概念到临床的可行性,标志着基因治疗从“一刀切”迈向“量身定制”的新纪元。未来需聚焦长期监测、成本优化及适应症拓展,让精准医疗惠及更多罕见病患者。 原文链接:"https://doi.org/10.1056/NEJMoa2504747" (https://doi.org/10.1056/NEJMoa2504747)
Kiran Musunuru, Sarah A. Grandinette, Xiao Wang, Taylor R. Hudson, Kevin Briseno, Anne Marie Berry, Julia L. Hacker, Alvin Hsu, Rachel A. Silverstein, Logan T. Hille ... >>>
Kiran Musunuru, Sarah A. Grandinette, Xiao Wang, Taylor R. Hudson, Kevin Briseno, Anne Marie Berry, Julia L. Hacker, Alvin Hsu, Rachel A. Silverstein, Logan T. Hille, Aysel N. Ogul, Nancy A. Robinson-Garvin, Juliana C. Small, Sarah McCague, Samantha M. Burke, Christina M. Wright, Sarah Bick, Venkata Indurthi, Shweta Sharma, Michael Jepperson, Christopher A. Vakulskas, Michael Collingwood, Katie Keogh, Ashley Jacobi, Morgan Sturgeon, Christian Brommel, Ellen Schmaljohn, Gavin Kurgan, Thomas Osborne, He Zhang, Kyle Kinney, Garrett Rettig, Christopher J. Barbosa, Sean C. Semple, Ying K. Tam, Cathleen Lutz, Lindsey A. George, Benjamin P. Kleinstiver, David R. Liu, Kim Ng, Sadik H. Kassim, Petros Giannikopoulos, Mohamad-Gabriel Alameh, Fyodor D. Urnov, Rebecca C. Ahrens-Nicklas <<<
59.
颜林林 (2026-01-29 01:05):
#paper doi:10.1016/j.jbi.2025.104971, Journal of Biomedical Informatics, 2026, Augmented intelligence for multimodal virtual biopsy in breast cancer using generative artificial intelligence. 这是一篇应用生成式AI来帮助提升诊断准确度的文章。在乳腺癌诊断中,活检(biopsy)虽是金标准,但由于其侵入性和滞后性,实践中还是需要仅基于影像学的诊断方法。当前使用的标准图像是全场数字乳腺摄影(FFDM),但它往往难以在致密型乳腺中准确辨识病灶,于是需要补充另一种图像,增强光谱乳腺摄影(CESM),来通过造影剂显著提升病灶可见度,但这种图像因辐射剂量和造影剂过敏风险,难以在所有患者中普及。这篇论文针对这一痛点,提出了一种解决方案:利用生成式人工智能(CycleGAN)在只有普通FFDM图像的情况下,合成出高质量的“虚拟”CESM图像,使诊断可以同时基于两种图像进行,从而提高准确度,并将其取名为“虚拟活检(virtual biopsy)”。虽然文章评估结果证实这种“虚构图像”的引入,相比只有标准图像的情况,的确能提升诊断的准确度。然而,在我看来,大概是由于只依赖标准图像的模型,并未充分把标准图像的信息利用起来,才为这篇文章的方法留下了提升空间。这种不直接改进原模型,而通过增加生成式虚构图像来补充信息的方法,让我想到那个关于数学家救火队员的段子:如果发现着火了怎么办?取出高压水枪灭火;如果发现没着火怎么办?先点火,然后取出高压水枪灭火。
60.
孤舟蓑笠翁 (2026-01-21 22:40):
paper 【doi】10.1038/s41588-025-02449-y;【发表年份】2026年;【期刊】Nature Genetics;【标题】Protein-protein interactions shape trans-regulatory impact of genetic variation on protein expression and complex traits。【内容总结】这篇论文想弄明白我们的基因差异(遗传变异)是如何通过影响蛋白质之间的相互作用,最终决定我们血液里蛋白质的多少(表达水平),并影响身高、疾病风险等复杂特征的;为此,研究人员重新分析了英国生物银行血浆蛋白质组计划(UK Biobank Pharma Proteomics Project, UKB-PPP)这个大型数据库,重点比较了影响蛋白质水平的遗传位点(pQTL)和影响信使RNA水平的位点(eQTL)的效应强弱,并开发了一种叫trans-PCO的新方法来找出那些能“远程调控”(trans调控)蛋白质表达的遗传变异,同时严格控制了数据分析过程以避免假信号。他们发现,那些“远程调控”蛋白质的遗传效应,其强度远超“远程调控”RNA的效应(trans-to-cis z-score比率更高),并且蛋白质表达水平的遗传差异有89-90%是由这种“远程调控”贡献的,而RNA只有60-79%;这些执行“远程调控”的基因往往更重要、更受进化约束(高pLI评分基因占27.2%),并且与36%的复杂疾病GWAS位点(例如类风湿性关节炎RA的位点)存在关联,而直接调控的基因仅关联10.1%;最关键的是,他们证实蛋白质相互作用网络是这种“远程调控”的主要途径,相关信号在相互作用网络中富集了5.4倍,并据此新发现了17,662个“远程调控”位点,这些位点能将多个看似不相关的疾病位点(如6个RA风险位点)联系到同一个蛋白质相互作用系统(如BAFF/APRIL系统)上,从而解释了疾病发生的可能通路。
回到顶部