For reinforcement learning control in continuous Spaces, a Q-learning method based on a self-organizing fuzzy RBF (radial basis function) network is proposed.
针对连续空间下的强化学习控制问题,提出了一种基于自组织模糊rbf网络的Q学习方法。
According to the problem of mobile robot navigation in the unknown environment, a hybrid control method based on hierarchical reinforcement learning (HRL) is proposed.
针对未知环境下的移动机器人导航问题,本文提出了一种基于分层式强化学习的混合式控制方法。
An average reward reinforcement learning algorithm for control Markov chains is presented.
讨论平均准则控制马氏链的强化学习算法。
应用推荐