神经模仿线性系统 - 弹性和学习

论文标题

神经模仿线性系统 - 弹性和学习

Neuromimetic Linear Systems -- Resilience and Learning

论文作者

Sun, Zexin, Baillieul, John

论文摘要

在我们最近关于{\ em神经元素控制理论}的工作的基础上，报道了有关弹性和神经启发的量化的新结果。一词神经映射是指具有生物运动控制神经生物学特征的特征的模型。与以前的工作一样，重点是我们所说的{\ em completeplete}线性系统，其特征在于状态的尺寸比更多的输入和输出通道。本文的具体贡献包括提出的{\ em弹性}观察者，其操作可容忍输出通道间歇性甚至完整的辍学。将这些想法与我们先前关于弹性稳定性的工作联系起来，建立了弹性的分离原则。我们还提出了一个{\ em原理量化}，其中控制信号被编码为简单的离散输入，这些输入是通过多个输入通道共同起作用的，这些输入是过度模型的标志。与神经模拟范式保持一致，提出了一个{\ em仿真}问题，这又定义了最佳量化问题。讨论了几种可能的解决方案，包括直接组合优化，HEBBIAN类迭代学习算法以及深度Q学习方法（DQN）方法。对于所考虑的问题，机器学习方法的优化方法提供了有关最佳和附近次优的解决方案之间比较的宝贵见解。这些对于理解对间歇性和渠道辍学的韧性类型很有用，这些弹性和渠道辍学者早些时候证明了连续系统。

Building on our recent work on {\em neuromimetic control theory}, new results on resilience and neuro-inspired quantization are reported. The term neuromimetic refers to the models having features that are characteristic of the neurobiology of biological motor control. As in previous work, the focus is on what we call {\em overcomplete} linear systems that are characterized by larger numbers of input and output channels than the dimensions of the state. The specific contributions of the present paper include a proposed {\em resilient} observer whose operation tolerates output channel intermittency and even complete dropouts. Tying these ideas together with our previous work on resilient stability, a resilient separation principle is established. We also propose a {\em principled quantization} in which control signals are encoded as simple discrete inputs which act collectively through the many channels of input that are the hallmarks of the overcomplete models. Aligned with the neuromimetic paradigm, an {\em emulation} problem is proposed and this in turn defines an optimal quantization problem. Several possible solutions are discussed including direct combinatorial optimization, a Hebbian-like iterative learning algorithm, and a deep Q-learning (DQN) approach. For the problems being considered, machine learning approaches to optimization provide valuable insights regarding comparisons between optimal and nearby suboptimal solutions. These are useful in understanding the kinds of resilience to intermittency and channel dropouts that were earlier demonstrated for continuous systems.

下载PDF全文

下载文献需遵守相关版权规定

论文标题