文献收藏与分享平台

21.

林海onrush (2023-09-30 23:01):

#paper,A quantum walk control plane for distributed quantum computing in quantum networks,doi: 10.1109/QCE52317.2021.00048.这篇论文介绍了在量子网络中执行分布式量子计算的一种量子行走协议。该协议利用量子行走作为量子控制信号来执行分布式量子操作。研究考虑了一种离散时间 coined 量子行走模型的泛化，该模型考虑了网络图中的量子行走系统与网络节点内的量子寄存器之间的相互作用。该协议在逻辑上捕捉了分布式量子计算，抽象了硬件实现和通过通道传输量子信息。控制信号传输被映射到网络上行走系统的传播，而控制层与量子寄存器之间的交互被嵌入到硬币算子的应用中。论文还展示了如何使用量子行走系统执行分布式CNOT操作，从而证明了该协议在分布式量子计算方面的通用性。此外，论文还将该协议应用于在量子网络中进行纠缠分发的任务。感受：这篇论文探讨了在量子网络中执行分布式量子计算的新方法，通过量子行走来实现量子控制信号的传输和操作。这种抽象的方法有望在未来的量子计算领域发挥重要作用，为量子网络的发展提供了新的思路。量子计算领域的研究和应用一直是科学家们追求的目标之一，这篇论文的工作有助于推动量子计算技术的进一步发展，为未来的量子通信和计算提供了新的可能性。

2021 IEEE International Conference on Quantum Computing and Engineering (QCE), 2021. DOI: 10.1109/QCE52317.2021.00048

A quantum walk control plane for distributed quantum computing in quantum networks

翻译

Matheus Guedes de Andrade, Wenhan Dai, Saikat Guha, Don Towsley

Abstract: No abstract available.

22.

林海onrush (2023-09-01 00:00):

#paper,Supervised machine learning classification of psychosis biotypes based on brain structure: findings from the Bipolar-Schizophrenia network for intermediate phenotypes (B-SNIP),https://doi.org/10.1038/s41598-023-38101-0讨论了精神障碍的传统诊断方法与神经生物学关联的不足，并提出了使用基于大脑的生物标志物来捕获精神病结构的方法。研究以基于MRI图像的灰质密度（GMD）作为生物标志物，通过逻辑回归模型将精神病病例与健康对照进行分类。在不同生物型和诊断方案下，研究评估了六个模型的分类准确性，其中B1生物型模型显示了特异性证据，能够有效区分精神病病例和健康对照。基于GMD的B1分类器结果显示，其与病前智力负相关。研究结果表明，基于B-SNIP精神病生物型的方法可能是捕捉精神病神经生物学特征的有前途方法，并可辅助临床诊断。最近个人也在一直思考如何把脑科学神经科学的东西和量子计算结合研究，下来多读一读脑科学相关文献

IF:3.800Q1 Scientific reports, 2023-08-10. DOI: 10.1038/s41598-023-38101-0 PMID: 37563219

Supervised machine learning classification of psychosis biotypes based on brain structure: findings from the Bipolar-Schizophrenia network for intermediate phenotypes (B-SNIP)

翻译

Joshua D Koen, Leslie Lewis, Michael D Rugg, Brett A Clementz, Matcheri S Keshavan, Godfrey D Pearlson, John A Sweeney, Carol A Tamminga, Elena I Ivleva

Abstract:

Traditional diagnostic formulations of psychotic disorders have low correspondence with underlying disease neurobiology. This has led to a growing interest in using brain-based biomarkers to capture biologically-informed psychosis constructs. Building … >>>

翻译

23.

林海onrush (2023-08-01 00:03):

#paper,doi.org/10.1016/j.aim.2023.109194,Equivariant algebraic K-theory, G-theory and derived completions,论文研究的是群作用下的代数K理论的补全问题。主要内容可概括如下: 文章主要研究线性代数群作用在方案上的等ivariant代数K理论和G理论。目标是证明一个类似Atiyah-Segal在拓扑K理论中补全定理的结果。衍生补全的技术对此问题非常关键。Thomason在80年代就预测到需要一种同伦类的补全方法。本文使用第一作者2008年提出的衍生补全方法。Robert Thomason在建立与Atiyah-Segal对应的等变代数K理论的完备性定理时，发现了强限制性条件过于严格。他对等变代数G理论的情况提出了一个猜想，即对于线性代数群在概型上的作用，存在一个类似Atiyah和Segal的完备性定理，而不需要他之前证明的强限制性条件，这些条件也出现在原始的Atiyah-Segal定理中。本文的主要目标是在尽可能广泛的背景下，利用导出完备性技术，对该猜想进行证明，并考虑几个应用。解决方案足够广泛，允许所有线性代数群的作用，无论它们是否连通，并作用于任何有限型域上的准投影概型，无论它们是否正则或投影。因此，可以考虑大类的变体的等变代数G理论，例如所有的齐次概型（由一个齐次环作用的情况）和所有球状概型（由一个约化群作用的情况）。通过限制为分裂齐次概型的作用，还可以考虑对代数空间的作用。此外，通常也不需要将基概型限制为域，但主要是为了简化部分阐述。这使得可以得到广泛的应用，其中一些被简要概述，并计划在将来详细探讨。实际上，我们在续篇中讨论了将结果扩展到等变同伦K理论以及各种Riemann-Roch定理。通过将结果与先前已知的没有使用导出完备性的结果进行比较，可以看出如果不使用导出完备性，只能得到非常限制性的结果。

IF:1.500Q1 Advances in Mathematics, 2023. DOI: 10.1016/j.aim.2023.109194

Equivariant algebraic K-theory, G-theory and derived completions

翻译

Gunnar Carlsson , Roy Joshua

Abstract:

In the mid 1980s, while working on establishing completion theorems for equivariant Algebraic K-Theory similar to the well-known Atiyah-Segal completion theorem for equivariant topological K-theory, the late Robert Thomason found … >>>

In the mid 1980s, while working on establishing completion theorems for equivariant Algebraic K-Theory similar to the well-known Atiyah-Segal completion theorem for equivariant topological K-theory, the late Robert Thomason found the strong finiteness conditions that are required in such theorems to be too restrictive. Then he made a conjecture on the existence of a completion theorem in the sense of Atiyah and Segal for equivariant algebraic G-theory, for actions of linear algebraic groups on schemes that holds without any of the strong finiteness conditions that are required in such theorems proven by him, and also appearing in the original Atiyah-Segal theorem. The main goal of the present paper is to provide a proof of this conjecture in as broad a context as possible, making use of the technique of derived completion, and to consider several of the applications. Our solution is broad enough to allow actions by all linear algebraic groups, irrespective of whether they are connected or not, and acting on any quasi-projective scheme of finite type over a field, irrespective of whether they are regular or projective. This allows us therefore to consider the equivariant algebraic G-Theory of large classes of varieties like all toric varieties (for the action of a torus) and all spherical varieties (for the action of a reductive group). Restricting to actions by split tori, we are also able to consider actions on algebraic spaces. Moreover, the restriction that the base scheme be a field is also not required often, but is put in mainly to simplify some of our exposition. These enable us to obtain a wide range of applications, some of which are briefly sketched and which we plan to explore in detail in the future. In fact, we discuss an extension of our results to equivariant homotopy K-theory along with various Riemann-Roch theorems in a sequel. A comparison of our results with previously known results, none of which made use of derived completions, shows that without the use of derived completions one can only obtain results which are indeed very restrictive. <<<

翻译

24.

林海onrush (2023-06-30 23:49):

#paper,doi:10.1017/fms.2015.2，A THEORY OF COMPLEXITY,CONDITION,AND ROUNDOFF，计算复杂性理论作为评判算法的重要标准，研究各种复杂性类的范围问题具有数学和工程意义，作者开发了一个理论的复杂性数值计算，考虑到输入数据的条件，并允许舍入的计算。Shub和Smale在R上的计算 (这又遵循了由Cook、Karp和Levin等人提出的经典、离散、复杂性理论)。特别专注于决策问题的复杂性类，不同版本的P，NP和EXP的多项式和非确定性多项式。及指数时间。作者证明了这些复杂性类之间的一些基本关系，并提供自然NP完全问题。

Forum of Mathematics, Sigma, 2015. DOI: 10.1017/fms.2015.2

A THEORY OF COMPLEXITY, CONDITION, AND ROUNDOFF

翻译

FELIPE CUCKER

Abstract:

We develop a theory of complexity for numerical computations that takes into account the condition of the input data and allows for roundoff in the computations. We follow the lines … >>>

翻译

25.

林海onrush (2023-05-31 22:41):

#paper，Mastering the Game of Stratego with Model-Free Multiagent Reinforcement Learning，DOI: 10.1126/science.add4679，强化学习在军事战略模拟领域的尝试如何？作者团队给出了一个可行的思路：如何使用无模型的多智能体强化学习来掌握战略游戏Stratego。本文提出了DeepNash，一个能够学习玩不完美信息游戏Stratego1从零开始，直至达到人类专家的水平。战略游戏是人工智能尚未掌握的少数标志性棋盘游戏之一。(AI)还没有掌握的少数标志性棋盘游戏之一。这个流行的游戏有一个巨大的游戏树10535个节点，也就是说，比围棋大0175倍。它有它还有一个额外的复杂性，就是需要在不完美的信息下进行决策。在tratego中，决策是在大量没有明显的离散行动的情况下做出的。行动和结果之间没有明显的联系。情节很长，在玩家获胜之前往往有几百步棋，而且战略游戏中的情况不容易被分解为可管理的大小的子问题。由于这些原因，几十年来《策略》一直是人工智能领域的一个巨大挑战，而现有的人工智能方法几乎没有达到业余水平。业余水平的游戏。DeepNash使用了一种游戏理论的、无模型的深度强化学习方法，不需要搜索，它通过自我游戏来学习掌握Stratego。正则化纳什动力学（R-aD）算法是DeepNash的一个关键组成部分，它收敛到一个近似的纳什均衡，通过直接修改基础的多Agent学习动态性。DeepNash击败了Stratego中现有的最先进的人工智能方法。并在Gravon游戏平台上取得了年度（2022年）和历史上前三名的成绩。平台上取得了年度（2022年）和历史上的前三名，与人类专家玩家竞争。本文的工作很有意思，有进一步探索的空间。个人认为此思路在MOBA类游戏中有很强的可拓展性。

Science (New York, N.Y.), 2022-12-02. DOI: 10.1126/science.add4679 PMID: 36454847

Mastering the game of Stratego with model-free multiagent reinforcement learning

翻译

Julien Perolat, Bart De Vylder, Daniel Hennes, Eugene Tarassov, Florian Strub, Vincent de Boer, Paul Muller, Jerome T Connor, Neil Burch, Thomas Anthony, ... >>>

Abstract:

We introduce DeepNash, an autonomous agent that plays the imperfect information game Stratego at a human expert level. Stratego is one of the few iconic board games that artificial intelligence … >>>

翻译

26.

林海onrush (2023-04-30 23:31):

#paper,Tensor Decompositions and Applications∗,DOI: 10.1137/07070111X,张量分解在诸多领域都有深入的尝试，高阶张量（即N≥3的N路数组）在心理测量学、化学计量学、信号处理、数值线性代数、计算机视觉、数值分析、数据挖掘、神经科学、图分析和其他方面均有应用。这篇paper讲述了张量运算的基础，对张量的运算基础进行了详细的探讨，我认为非常有学习价值，未来会在很多行业的发展上具有重大贡献。

IF:10.800Q1 SIAM Review, 2009. DOI: 10.1137/07070111X

Tensor Decompositions and Applications

翻译

Tamara G. Kolda , Brett W. Bader

Abstract:

This survey provides an overview of higher-order tensor decompositions, their applications, and available software. A tensor is a multidimensional or N-way array. Decompositions of higher-order tensors (i.e., N-way arrays with … >>>

翻译

27.

林海onrush (2023-03-31 23:17):

#paper， BloombergGPT: A Large Language Model for Finance, doi:10.48550/arXiv.2303.17564, ChatGPT引爆的AI热潮也“烧到了”金融圈，彭博社重磅发布为金融界打造的大型语言模型（LLM）——BloombergGPT。3月30日，根据彭博社最新发布的报告显示，其构建迄今为止最大的特定领域数据集，并训练了专门用于金融领域的LLM，开发了拥有500亿参数的语言模型——BloombergGPT。报告显示，该模型依托彭博社的大量金融数据源，构建了一个3630亿个标签的数据集，支持金融行业内的各类任务。该模型在金融任务上的表现远超过现有模型，且在通用场景上的表现与现有模型也能一较高下。报告指出，从测试来看，BloombergGPT在五项任务中的四项（ConvFinQA，FiQA SA，FPB和Headline）表现最佳，在NER（Named Entity Recognition）中排名第二。因此，BloombergGPT有其优势性。

arXiv, 2023. DOI: 10.48550/arXiv.2303.17564

BloombergGPT: A Large Language Model for Finance

翻译

Shijie Wu, Ozan Irsoy, Steven Lu, Vadim Dabravolski, Mark Dredze, Sebastian Gehrmann, Prabhanjan Kambadur, David Rosenberg, Gideon Mann

Abstract:

The use of NLP in the realm of financial technology is broad and complex, with applications ranging from sentiment analysis and named entity recognition to question answering. Large Language Models … >>>

翻译

28.

林海onrush (2023-02-28 21:45):

#paper,doi: https://doi.org/10.1038/s41586-023-05859-2，Continuous Symmetry Breaking in a Two-dimensional Rydberg Array,,自发对称性破坏是对物质相位及其相关跃迁进行分类的基础。被打破的潜在对称性的性质决定了相的许多定性特性;离散与连续对称性破坏的情况说明了这一点。与离散情况相反，连续对称性的破坏导致无间隙Goldstone模式的出现，例如控制有序相的热力学稳定性.作者利用可编程的里德伯量子模拟器实现了二维偶极XY模型，展示了XY铁磁体和XY反铁磁体相关低温态的绝热制备。在铁磁情况下，表征了长程XY阶的存在。这项工作对XY相互作用的多体物理学做出了贡献，补充了最近利用里德伯-封锁机制实现表现出离散自旋旋转对称性的Ising型相互作用的工作。该文近期被收录于nature，也证实了工作的严谨和创新。

IF:50.500Q1 Nature, 2023-04. DOI: 10.1038/s41586-023-05859-2 PMID: 36848931

Continuous symmetry breaking in a two-dimensional Rydberg array

翻译

Abstract:

Spontaneous symmetry breaking underlies much of our classification of phases of matter and their associated transitions. The nature of the underlying symmetry being broken determines many of the qualitative properties … >>>

翻译

29.

林海onrush (2023-01-31 22:08):

#paper https://www.nature.com/articles/s42256-022-00569-2，Deep transfer operator learning for partial differential equations under conditional shift，“迁移学习「求解」偏微分方程，条件偏移下PDE的深度迁移算子学习",来自美国布朗大学和约翰斯·霍普金斯大学（JHU）的研究人员提出了一种新的迁移学习框架，用于基于深度算子网络 (DeepONet) 的条件转移下的任务特定学习（偏微分方程中的函数回归）。由于几何域和模型动力学的变化，研究人员展示了该方法在不同条件下涉及非线性偏微分方程的各种迁移学习场景的优势。尽管源域和目标域之间存在相当大的差异，但提出的迁移学习框架能够快速高效地学习异构任务。该研究发布在《Nature Machine Intelligence》上。深度学习已经成功地应用于模拟偏微分方程（PDE）描述的计算成本很高的复杂物理过程，并实现了卓越的性能，从而加速了不确定性量化、风险建模和设计优化等众多任务。但此类模型的预测性能通常受到用于训练的标记数据的可用性的限制。在许多情况下，收集大量且足够的标记数据集在计算上可能很棘手。此外，孤立学习（即为独特但相关的任务训练单个预测模型）可能非常昂贵。为了解决这个瓶颈，可以在称为迁移学习的框架中利用相关领域之间的知识。在这种情况下，来自在具有足够标记数据的特定域（源）上训练的模型的信息可以转移到只有少量训练数据可用的不同但密切相关的域（目标）。由于缺乏针对特定任务的算子（operator）学习和不确定性量化的 TL 方法，在这项工作中，研究人员提出了一个使用神经算子在条件转换下高效 TL 的新框架。在这项工作中，研究人员采用了更通用的深度神经算子 (DeepONet)，它使我们能够充分学习算子，从而对任意新输入和复杂域执行实时预测。重要的是，所提出的迁移学习框架能够在标记数据非常有限的领域中识别 PDE 算子。这项工作的主要贡献可归纳如下：提出了一种新的框架，用于在深度神经算子的条件转移下迁移学习问题。所提出的框架可用于快速高效的特定于任务的 PDE 学习和不确定性量化。利用 RKHS 和条件嵌入算子理论的原理来构建新的混合损失函数并对目标模型进行微调。所提出框架的优点和局限性通过各种迁移学习问题得到证明，包括由于域几何、模型动力学、材料特性、非线性等变化引起的分布变化。

IF:18.800Q1 Nature Machine Intelligence, 2022. DOI: 10.1038/s42256-022-00569-2

Deep transfer operator learning for partial differential equations under conditional shift

翻译

Somdatta Goswami , Katiana Kontolati , Michael D. Shields , George Em Karniadakis

Abstract:

Transfer learning enables the transfer of knowledge gained while learning to perform one task (source) to a related but different task (target), hence addressing the expense of data acquisition and … >>>

翻译

30.

林海onrush (2023-01-27 01:30):

#paper, Twist: Sound Reasoning for Purity and Entanglement in Quantum Programs,DOI: 10.48550/arXiv.2205.02287,作者引入了纯度表达式的概念，以在量子程序中对纠缠状态进行推理判断。类似于经典内存的指针，并通过执行被称为门的操作来对它们进行评估。由于纠缠的特殊形式存在，导致量子比特的测量结果是相关的现象，而纠缠可以决定算法的正确性和编程模式的适用性。将纯度表达形式化，可以作为自动推理量子程序中纠缠的核心工具，是指其评价不受量子比特的测量结果影响的表达式。本文主要贡献在于提出了Twist，这是第一种具有类型系统的语言，用于对纯度进行合理推理，使开发者能够使用类型注解来识别纯度表达式。最后证明了Twist可以表达量子算法，捕捉其中的编程错误，并支持一些其他语言不允许的程序。同时产生的运行时验证开销小于3.5%。整体而言，是一项基础且有意义的工作。

arXiv, 2022. DOI: 10.48550/arXiv.2205.02287

Twist: Sound Reasoning for Purity and Entanglement in Quantum Programs

翻译

Charles Yuan, Christopher McNally, Michael Carbin

Abstract:

Quantum programming languages enable developers to implement algorithms for quantum computers that promise computational breakthroughs in classically intractable tasks. Programming quantum computers requires awareness of entanglement, the phenomenon in which … >>>

翻译

31.

林海onrush (2022-12-31 23:26):

#paper，A Data-driven Sequential Localization Framework for Big Telco Data，IEEE Transactions on Knowledge and Data Engineering（2021），DOI: 10.1109/TKDE.2019.2961657 通讯基础设施的迅速发展带来了巨大的MR数据的累积。这些数据被移动物体生成,当连接到数据服务时被存储。地图标记或局部化这样的MR数据被认为对通讯和交通网络优化有很大的影响。为了在学习过程中处理数据密集型工作负载，华为诺亚团队使用物化视图以实现高效的在线本地化和轻量级索引技术用于周期性参数调优，以提高效率和可扩展性。真实数据的结果表明，与最先进的解决方案相比，该解决方案将中位数定位误差提高了 58.8%。重点勾画：文章简要介绍了隐马尔可夫模型（HMM），该模型捕获了两种类型的随机过程之间的联系：未观察到的状态转换过程和由每个未观察到状态的可观察变量组成的观察过程。首先进行了几个实验来验证以下问题：机器学习单点定位模型的有效性，排放和转移概率解决方案的有效性，以及顺序定位系统与最新基线相比的性能。设计实验来展示提出的索引技术的效率，以及参数调整对系统性能的影响。提出了一个数据驱动的框架，用于电信数据的顺序定位，并配备了一套全面的机器学习和数据管理技术。与最新的序列定位方法相比，作者提出的框架在中值误差方面实现了58.8%的改进，使解决方案在准确性和可采用性方面具有优势；提出了有效的数据访问和索引方法，以支持学习过程中涉及的数据密集型计算。

IF:8.900Q1 IEEE Transactions on Knowledge and Data Engineering, 2019. DOI: 10.1109/TKDE.2019.2961657

A Data-Driven Sequential Localization Framework for Big Telco Data

翻译

Fangzhou Zhu , Mingxuan Yuan , Xike Xie , Ting Wang , Shenglin Zhao , Weixiong Rao , Jia Zeng

Abstract:

The proliferation of telco networks and mobile terminals brings the accumulation of tremendous amounts of measure report(MR) data at a rapid pace. The MR data is generated by mobile objects … >>>

The proliferation of telco networks and mobile terminals brings the accumulation of tremendous amounts of measure report(MR) data at a rapid pace. The MR data is generated by mobile objects while connecting to data services and is stored in backend data centers. To geo-tag or localize such MR data is believed to have a profound effect on the analytics and optimizations of telco and traffic networks. However, MR records are of noisy and partial observations regarding to mobile objects' geo-locations and hence pose challenges to accurate telco data localization. There have been quite a few attempts. Single-point localization methods map a MR record to a location, but come out with limited accuracies due to the ignorance of spatiotemporal coherence of successive MR records. Recent efforts on sequential localization techniques alleviate this by mapping a sequence of MR records to a trajectory. However, existing solutions are often with assumptions on specific models, e.g., mobility and signal strength distributions, or priori knowledge on topology space, e.g., road networks, limiting the deployment in practice. To this end, we propose a data-driven framework to tackle the challenges in sequential telco localization. We solely use raw MR records and a public third-party GPS dataset for the learning of the correlations between mobile objects' locations and MR records, requiring no model assumptions and priori knowledge. To handle the data-intensive workloads during the learning process, we use materialized views for efficient online localization and light-weighted indexing techniques for periodical parameters tuning, in order to improve the efficiency and scalability. Results on real data show that our solution achieves 58.8 percent improvement in median localization errors compared with state-of-art sequential localization techniques that require hypothesis models and priori knowledge, making our solution superior in terms of effectiveness, efficiency, and employability. <<<

翻译

32.

林海onrush (2022-11-30 21:51):

#paper，https://doi.org/10.48550/arXiv.2211.16197，FJMP: Factorized Joint Multi-Agent Motion Prediction over Learned Directed Acyclic Interaction Graphs，该研究针对自动驾驶轨迹预测生成问题，提出了FJMP，一种学习有向无环相互作用图的因子分解多智能体联合运动预测框架.使用未来场景交互动力学作为稀疏有向交互图，边缘表示agent之间的显式交互，修剪图成有向无环图（DAG）并分解联合预测任务，根据 DAG 的部分排序，其中联合未来轨迹使用有向无环图神经网络DAGNN。在INTERACTION和Argoverse2数据集上，证明了FJMP与非因子化相比能得到准确且场景一致的联合轨迹预测。FJMP在交互的多智能体INTERACTION基准测试上取得SOTA。

arXiv, 2022. DOI: 10.48550/arXiv.2211.16197

FJMP: Factorized Joint Multi-Agent Motion Prediction over Learned Directed Acyclic Interaction Graphs

翻译

Luke Rowe, Martin Ethier, Eli-Henry Dykhne, Krzysztof Czarnecki

Abstract:

Predicting the future motion of road agents is a critical task in an autonomous driving pipeline. In this work, we address the problem of generating a set of scene-level, or … >>>

翻译

33.

林海onrush (2022-10-29 13:58):

#paper，Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning , url : https://arxiv.org/abs/1811.12808#，本论文回顾了用于解决模型评估、模型选择和算法选择三项任务的不同技术，并参考理论和实证研究讨论了每一项技术的主要优势和劣势。进而，给出建议以促进机器学习研究与应用方面的最佳实践。详细论文解析见下面pdf

arXiv, 2018. DOI: 10.48550/arXiv.1811.12808

Model Evaluation, Model Selection, and Algorithm Selection in Machine Learning

翻译

Sebastian Raschka

Abstract:

The correct use of model evaluation, model selection, and algorithm selection techniques is vital in academic machine learning research as well as in many industrial settings. This article reviews different … >>>

翻译

34.

林海onrush (2022-10-29 13:51):

#paper，Formal Algorithms for Transformers，url：https://arxiv.org/pdf/2207.09238.pdf，在过去5年多的时间里，Transfermers在多个领域表现出惊人的效果。但是，对于Transformers算法的描述基本都集中在使用图形、文字描述、或针对优化部分的解释，并没有一篇论文给出一个较为完整的Algorithm伪代码。deepmind官方给出了形式化算法伪代码，论文详解见下面PDF

arXiv, 2022. DOI: 10.48550/arXiv.2207.09238

Formal Algorithms for Transformers

翻译

Mary Phuong, Marcus Hutter

Abstract:

This document aims to be a self-contained, mathematically precise overview of transformer architectures and algorithms (*not* results). It covers what transformers are, how they are trained, what they are used … >>>

翻译

35.

林海onrush (2022-10-29 13:25):

#paper，CAUSAL DISCOVERY WITH REINFORCEMENT LEARNING，论文地址：https://arxiv.org/pdf/1906.04477.pdf，官方视频介绍：https://iclr.cc/virtual_2020/poster_S1g2skStPB.html，因果研究作为下一个潜在的热点，已经吸引了机器学习/深度学习领域的的广泛关注，因果研究中一个经典的问题是「因果发现」问题——从被动可观测的数据中发现潜在的因果图结构。此论文是华为诺亚方舟实验室被 ICLR 2020 接收的一篇满分论文。在此论文中，华为诺亚方舟实验室因果研究团队将强化学习应用到打分法的因果发现算法中，通过基于自注意力机制的 encoder-decoder 神经网络模型探索数据之间的关系，结合因果结构的条件，并使用策略梯度的强化学习算法对神经网络参数进行训练，最终得到因果图结构。在学术界常用的一些数据模型中，该方法在中等规模的图上的表现优于其他方法，包括传统的因果发现算法和近期的基于梯度的算法。同时该方法非常灵活，可以和任意的打分函数结合使用。

arXiv, 2019. DOI: 10.48550/arXiv.1906.04477

Causal Discovery with Reinforcement Learning

翻译

Shengyu Zhu, Ignavier Ng, Zhitang Chen

Abstract:

Discovering causal structure among a set of variables is a fundamental problem in many empirical sciences. Traditional score-based casual discovery methods rely on various local heuristics to search for a … >>>

翻译

36.

林海onrush (2022-09-30 22:25):

#paper arXiv, 2209.00796 (2022) , Diffusion Models: A Comprehensive Survey of Methods and Applications, Diffusion model在诸多领域都有着优异的表现，并且考虑到不同领域的应用中diffusion model产生了不同的变形，论文系统地介绍了diffusion model的应用研究，其中包含如下领域：计算机视觉，NLP、波形信号处理、多模态建模、分子图建模、时间序列建模、对抗性净化。工作的主要贡献总结如下：新的分类方法：我们对扩散模型和其应用提出了一种新的、系统的分类法。具体将模型分为三类：采样速度增强、最大似然估计增强、数据泛化增强。进一步地，将扩散模型的应用分为七类：计算机视觉，NLP、波形信号处理、多模态建模、分子图建模、时间序列建模、对抗性净化。全面地概述了现代扩散模型及其应用，展示了每种扩散模型的主要改进，和原始模型进行了必要的比较，并总结了相应的论文。扩散模型的基本思想是正向扩散过程来系统地扰动数据中的分布，然后通过学习反向扩散过程恢复数据的分布，这样就了产生一个高度灵活且易于计算的生成模型。

arXiv, 2022. DOI: 10.48550/arXiv.2209.00796

Diffusion Models: A Comprehensive Survey of Methods and Applications

翻译

Ling Yang, Zhilong Zhang, Shenda Hong, Runsheng Xu, Yue Zhao, Yingxia Shao, Wentao Zhang, Ming-Hsuan Yang, Bin Cui

Abstract:

Diffusion models are a class of deep generative models that have shown impressive results on various tasks with a solid theoretical foundation. Despite demonstrated success than state-of-the-art approaches, diffusion models … >>>

翻译

37.

林海onrush (2022-08-07 22:47):

#paper arXiv:2207.03530v1 [cs.RO] 7 Jul 2022，VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning，https://deepai.org/publication/vmas-a-vectorized-multi-agent-simulator-for-collective-robot-learning 剑桥大学提出多智能体联合强化学习框架VMAS 虽然许多多机器人协调问题可以通过精确的算法得到最佳解决，但解决方案在机器人的数量上往往是不可扩展的。多智能体强化学习（MARL）作为解决这类问题的一个有希望的解决方案，在机器人界越来越受到关注。然而，仍然缺乏能够快速有效地找到大规模集体学习任务解决方案的工具。在这项工作中，介绍了VMAS。VMAS是一个开源的框架，为高效的MARL基准测试而设计。它由一个用PyTorch编写的矢量二维物理引擎和一套12个具有挑战性的多机器人场景组成。其他场景可以通过一个简单的模块化接口来实现。本文展示了矢量化是如何在不增加复杂性的情况下在加速硬件上实现并行仿真的，比较了VMAS和目前的最优框架OpenAI MPE，表明了其速度超过了MPE100倍，同时本文使用VMAS进行了各种基准测试，表明了现有算法存在的挑战。 VMAS 能够在 10 秒内执行 30,000 次并行仿真，速度提高了 100 倍以上。使用 VMAS 的 RLlib 接口，我们使用各种基于近端策略优化 (PPO) 的 MARL 算法对我们的多机器人场景进行基准测试。 VMAS 的场景在最先进的 MARL 算法的正交方法。 VMAS 框架可在以下网址获得并可进行复现：https://github.com/proroklab/VectorizedMultiAgentSimulator

arXiv, 2022. DOI: arXiv:2207.03530

VMAS: A Vectorized Multi-Agent Simulator for Collective Robot Learning

翻译

Matteo Bettini, Ryan Kortvelesy, Jan Blumenkamp, Amanda Prorok

Abstract:

While many multi-robot coordination problems can be solved optimally by exact algorithms, solutions are often not scalable in the number of robots. Multi-Agent Reinforcement Learning (MARL) is gaining increasing attention … >>>

翻译