来自杂志 PLOS Computational Biology 的文献。
当前共找到 2 篇文献分享。
1.
DeDe宝
(2026-02-05 02:37):
#paper, https://doi.org/10.1371/journal.pcbi.1013879 Information uncertainty influences learning strategy from sequentially delayed rewards. Plos Computational Biology.
本研究探索信息不确定性对延时奖励学习策略的影响。奖励延迟出现时,人类如何将奖励与先前事件关联?研究者操纵了奖励信息的不确定性:分离条件(即时奖励和延迟奖励的信息则分别呈现)和整合条件(奖励以即时奖励和延迟奖励的总和形式呈现),以探索信息不确定性对学习策略的影响。研究主要比较了三种模型:回顾模型(Elg,基于时间序列更新先前选择的价值)、前瞻模型(Tab,仅系统更新与奖励相关的过往选择)和混合模型(Hybrid,通过β参数调整两个模型的权重)。行为数据分析表明,被试能够掌握延迟奖励的关联规则,且低信息不确定性有助于促进学习表现,初始的低不确定性环境的学习体验会形成 “认知启动”,持续影响后续高不确定性环境中的策略使用。模型比较结果表明,两种模型都能捕捉被试核心行为特征,低信息不确定性时,前瞻模型更能解释被试行为;高不确定性时,回顾模型成为有效补充。上述结果表明人类会根据信息不确定性灵活切换混合学习策略。
PLOS Computational Biology,
2026-2-2.
DOI: 10.1371/journal.pcbi.1013879
Abstract:
When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for …
>>>
When receiving a reward after a sequence of multiple events, how do we determine which event caused the reward? This problem, known as temporal credit assignment, can be difficult for humans to solve given the temporal uncertainty in the environment. Research to date has attempted to isolate dimensions of delay and reward during decision-making, but algorithmic solutions to temporal learning problems and the effect of uncertainty on learning remain underexplored. To further our understanding, we adapted a reward learning task that creates a temporal credit assignment problem by combining sequentially delayed rewards, intervening events, and varying uncertainty via the amount of information presented during feedback. Using computational modeling, two learning strategies were developed: an eligibility trace, whereby previously selected actions are updated as a function of the temporal sequence, and a tabular update, whereby only systematically related past actions (rather than unrelated intervening events) are updated. We hypothesized that reduced information uncertainty would correlate with increased use of the tabular strategy, given the model’s capacity to incorporate additional feedback information. Both models effectively learned the task, and predicted choices made by participants (N = 142) as well as specific behavioral signatures of credit assignment. Consistent with our hypothesis, the tabular model outperformed the eligibility model under low information uncertainty, as evidenced by more accurate predictions of participants’ behavior and an increase in tabular weight. These findings provide new insights into the mechanisms implemented by humans to solve temporal credit assignment and adapt their strategy in varying environments.
<<<
翻译
2.
DeDe宝
(2026-01-05 05:19):
#paper doi: https://doi.org/10.1371/journal.pcbi.1013826 Long-term perceptual priors drive confidence bias that favors prior-congruent evidence. Plos Computational Biology
该研究探究了长期感知先验(long-term perceptual priors)如何影响人类的感知决策与信心判断,并揭示了信心判断中存在偏向先验一致证据的偏差机制。基于贝叶斯框架的模型认为感知决策(及其置信度)基于先验和似然的基于精度的加权,然而,一些研究发现先验对置信度影响更大。研究使用Confidence Forced-Choice Task以探究感知任务重长期先验的影响,在该任务中,被试需要连续两次判断刺激的运动方向并判断在哪一次判断的置信度更高。长期感知先验可能与判断边界垂直(不提供额外偏向)或者落在其中一个方向的区域(提供额外偏向)。研究结果表明,被试更倾向于认为与先验一致的判断置信度更高,说明长期认知先验对置信度的影响存在额外的确认性偏差。研究者提出WPPCE 模型(加权后验与先验一致证据)解释观察到的信心偏差。
PLOS Computational Biology,
2025-12-22.
DOI: 10.1371/journal.pcbi.1013826
Abstract:
According to the Bayesian framework, both our perceptual decisions and confidence about those decisions are based on the precision-weighted integration of prior expectations and incoming sensory information. While it is …
>>>
According to the Bayesian framework, both our perceptual decisions and confidence about those decisions are based on the precision-weighted integration of prior expectations and incoming sensory information. While it is generally assumed that priors influence both decisions and confidence in the same way, previous work has found priors to have a stronger impact at the confidence level, challenging this assumption. However, these patterns were found for high-level probabilistic expectations that are flexibly induced in the task context. It remains unclear whether this generalizes to low-level perceptual priors that are naturally formed through long term exposure. Here we investigated human participants’ confidence in decisions made under the influence of a long-term perceptual prior: the slow-motion prior. Participants viewed tilted moving-line stimuli for which the slow-motion prior biases the perceived motion direction. On each trial, they made two consecutive motion direction decisions followed by a confidence decision. We contrasted two conditions – one in which the prior impacted discrimination performance, and one in which it did not. We found a confidence bias favoring the condition in which the prior influenced discrimination decisions, even after accounting for performance differences. Computational modeling revealed this effect to be best explained by confidence using the prior-congruent evidence as an additional cue, beyond the posterior evidence used in the perceptual decision. This is in agreement with a confirmatory confidence bias favoring evidence congruent with low-level perceptual priors, revealing that, in line with high-level expectations, even long-term priors have a greater influence on the metacognitive level than on perceptual decisions.
<<<
翻译