当前共找到 4 篇文献分享。
1.
刘昊辰 (2025-06-03 16:34):
#paper AlphaZero-Edu Making AlphaZero Accessible to Everyone. AlphaZero-Edu 是基于 AlphaZero 数学框架开发的轻量化强化学习框架,专为教育场景和五子棋设计,具有模块化架构(解耦蒙特卡洛树搜索、自我对弈训练、策略价值网络)、资源高效训练(单块 NVIDIA RTX 3090 GPU 即可运行)和高度并行自我对弈数据生成(8 进程实现 3.2 倍加速)等特点。其状态特征采用 21 层张量(含当前状态和 20 层历史状态),输出包含策略概率分布和价值评估标量,并通过旋转 / 翻转数据增强提升泛化能力。训练中结合循环学习率调度器,使策略损失和价值损失均收敛,且在与 4 名人类玩家的对战中实现最高 100% 胜率,最低 60% 胜率(含 20% 平局)。该框架已开源,为学术研究和工业应用提供了可访问的基准。下载地址:https://arxiv.org/pdf/2504.14636
arXiv, 2025-04-20T14:29:39Z. DOI: 10.48550/arXiv.2504.14636
Abstract:
Recent years have witnessed significant progress in reinforcement learning,especially with Zero-like paradigms, which have greatly boosted thegeneralization and reasoning abilities of large-scale language models.Nevertheless, existing frameworks are often plagued by … >>>
Recent years have witnessed significant progress in reinforcement learning,especially with Zero-like paradigms, which have greatly boosted thegeneralization and reasoning abilities of large-scale language models.Nevertheless, existing frameworks are often plagued by high implementationcomplexity and poor reproducibility. To tackle these challenges, we presentAlphaZero-Edu, a lightweight, education-focused implementation built upon themathematical framework of AlphaZero. It boasts a modular architecture thatdisentangles key components, enabling transparent visualization of thealgorithmic processes. Additionally, it is optimized for resource-efficienttraining on a single NVIDIA RTX 3090 GPU and features highly parallelizedself-play data generation, achieving a 3.2-fold speedup with 8 processes. InGomoku matches, the framework has demonstrated exceptional performance,achieving a consistently high win rate against human opponents. AlphaZero-Eduhas been open-sourced at https://github.com/StarLight1212/AlphaZero_Edu,providing an accessible and practical benchmark for both academic research andindustrial applications. <<<
翻译
2.
少颖-focus reverse aging (2025-06-02 00:22):
#paper Cardiovascular aging: from cellular and molecular changes to therapeutic interventions doi: 10.20517/jca.2023.09 论文网址:https://pmc.ncbi.nlm.nih.gov/articles/PMC10238104/#S16 公众号阅读笔记链接:https://mp.weixin.qq.com/s/O9IqSEJpAwSGIlaAMap3mQ
Abstract:
Progressive age-induced deterioration in the structure and function of the cardiovascular system involves cardiac hypertrophy, diastolic dysfunction, myocardial fibrosis, arterial stiffness, and endothelial dysfunction. These changes are driven by complex … >>>
Progressive age-induced deterioration in the structure and function of the cardiovascular system involves cardiac hypertrophy, diastolic dysfunction, myocardial fibrosis, arterial stiffness, and endothelial dysfunction. These changes are driven by complex processes that are interconnected, such as oxidative stress, mitochondrial dysfunction, autophagy, inflammation, fibrosis, and telomere dysfunction. In recent years, the advances in research of cardiovascular aging, including the wide use of animal models of cardiovascular aging, elucidated an abundance of cell signaling pathways involved in these processes and brought into sight possible interventions, which span from pharmacological agents, such as metformin, sodium-glucose cotransporter 2-inhibitors, rapamycin, dasatinib and quercetin, to lifestyle changes. <<<
翻译
3.
DeDe宝 (2025-06-01 17:15):
#paper https://doi.org/10.1101/2024.07.02.601741 bioRxiv.2025. 3D directional tuning in the orofacial sensorimotor cortex during natural feeding and drinking 这篇文献研究了在自然摄食和饮水行为中,口腔面部感觉运动皮层(OSMCx)对舌头三维方向的编码特性,以及口腔触觉感觉丧失对这些编码特性的影响。实验对象为两只成年雄性恒河猴,实验任务分为摄食任务和饮水任务,目标脑区是OSMCx(OSMCx是大脑皮层中负责口腔面部感觉和运动功能的区域,包括初级运动皮层(MIo)、初级体感皮层(SIo)和皮层咀嚼区(CMA))。为了研究感觉反馈在控制舌头运动中的作用,研究者使用局部注射布比卡因(Bupivacaine HCL)和肾上腺素(Epinephrine)混合液,阻断三叉神经的感觉分支,以消除口腔触觉感觉。研究者在猴子的颅骨、下颌骨和舌头上植入直径为1毫米的钽珠,用于标记,并使用双平面视频放射照相术(biplanar videoradiography)以200 Hz的频率记录标记物的三维位置数据。同时,通过慢性植入的微电极阵列记录OSMCx中初级运动皮层(MIo)和初级体感皮层(SIo)的神经元放电活动。 实验结果表明,在摄食和饮水任务中,大多数MIo和SIo神经元表现出对三维舌运动方向的调谐,其中MIo神经元的方向调谐更为显著(尤其是在摄食任务中)。感觉神经阻断后,MIo和SIo中方向调谐的神经元比例显著下降,表明触觉反馈在控制舌运动方向中起着关键作用。。
Abstract:
AbstractDirectional tongue movements are crucial for feeding and speech, ensuring proper food positioning for chewing and swallowing, as well as accurate sound production. While directional tuning in the arm region … >>>
AbstractDirectional tongue movements are crucial for feeding and speech, ensuring proper food positioning for chewing and swallowing, as well as accurate sound production. While directional tuning in the arm region of the sensorimotor cortex during reaching tasks is well-studied, little is known about how 3D tongue direction is encoded in the orofacial sensorimotor cortex (OSMCx) during natural behaviors. Understanding this neural representation has important implications for rehabilitating individuals with orolingual dysfunctions. This study examines the directional tuning and population dynamics in OSMCx during naturalistic feeding and drinking, and how these are affected by sensory loss. Using biplanar videoradiography, we tracked implanted tongue markers in behaving rhesus macaques (Macaca mulatta) and simultaneously recorded 3D positional data with spiking activity from chronically implanted microelectrode arrays in primary motor (MIo) and somatosensory (SIo) areas of the orofacial cortex. In some sessions, tasks were preceded by bilateral nerve block injections to the sensory branches of the trigeminal nerve. Modulation to 3D tongue direction during feeding and drinking was found in most MIo and SIo neurons. Directional information in both individual- and population-level was higher in feeding and was more robust in MIo. Following sensory loss, alterations in tongue kinematics were accompanied by changes in directional information in MIo and SIo, manifesting as modifications in both individual neuron tuning characteristics and the broader dynamics of population-level neural activity. Overall, this study advances our understanding of how OSMCx contributes to complex, coordinated control of naturalistic tongue movements. It expands our current knowledge of orofacial control to three dimensions and demonstrates the specificity and adaptability of population activity in MIo and SIo in response to different behavioral contexts. <<<
翻译
4.
龙海晨 (2025-06-01 11:58):
#paper Xu Y, Lan F, Bi Q, Li X, Wang Z, Li Y, Li P, Long H, Du L. Comprehensive analysis of the prognosis and tumor immune microenvironment of ubiquitin-conjugating enzyme transport-related gene UBE2C in hepatocellular carcinoma. Discov Oncol. 2025 May 23;16(1):884. doi: 10.1007/s12672-025-02675-0. PMID: 40410642; PMCID: PMC12102447.这是我发的第一篇纯生物信息的sci,是研究肝细胞癌中泛素结合酶E2 C(UBE2C)预后意义和肿瘤免疫反应的机制的。研究发现细胞周期相关蛋白与UBE2C基因表达之间存在很高的相关性,肝癌样品中的UBE2C基因表达水平明显高于正常样本,UBE2C表达高的肝癌患者的存活率低于低表达患者。高表达的UBE2C基因与免疫抑制分子数量增加有关。 以上研究结果因为实验条件的限制,基于纯生信分析,未能进行分子生物学实验有一定的局限性。
回到顶部