【markov_decision_process】什么意思_英语markov_decision_process的翻译_音标_读音_用法_例句_在线翻译

markov decision process

马尔可夫决策过程

马尔可夫决策过程(MDP ,Markov Decision Processes) 是强化学习的数学模型,因此,通常顺序型任务中的强化学习问题可以通过马尔可夫决策过程建模 [5]...

基于282个网页-相关网页

决策过程

其实这是一个典型的马尔科夫决策过程(Markov decision process,MDP)。马尔科夫决策过程(Markov decision process,MDP)：Agent 可感知到其环境的不同状态集合，并且有它可执行的动作集合。

基于78个网页-相关网页

马尔科夫决策过程

在最后，我们对马尔科夫决策过程（MarKOv Decision Process）进行一个简单的介绍，这一过程是所有增强学习的基础，并且人们认为，一切增强学习的问题都可以转化为一个马尔科夫决策过程。

基于26个网页-相关网页

Markov决策过程

...一个Agent（通常是一个机器人）选择菜个动作来改变状态，那么决策问题可以描述为一个Markov决策过程（Markov Decision Process，MDP）。MDP的优点在于可以采用决策论在行动不确定上进行量化决策。

基于12个网页-相关网页

短语

Partially Observable Markov Decision Process 马尔可夫决策过程 ; 部分可观测马尔可夫决策过程 ; 夫决策过程 ; 夫判决过程

Semi-Markov Decision Process 半马尔可夫决策过程 ; 半Markov决策过程

Partial Observable Markov Decision Process 部分可观测的马尔 ; 部分可观测马氏决策过程

factored markov decision process 可分解马尔可夫决策过程

Bayesian Markov decision process 贝叶斯马尔可夫决策过程

更多收起网络短语

计算机科学技术 | 电子、通信与自动控制技术 | 经济学

·2,447,543篇论文数据，部分数据来源于NoteExpress

abstract: Markov decision processes (MDPs), named after Andrey Markov, provide a mathematical framework for modeling decision making in situations where outcomes are partly random and partly under the control of a decision maker. MDPs are useful for studying a wide range of optimization problems solved via dynamic programming and reinforcement learning.

以上来源于: WordNet

The scheme is formulated by Constrained Markov Decision Process (CMDP), which is solved by Linearly Programming (LP).

该方案被建模为约束马尔可夫决策过程(CMDP)，并采用线性规划(LP)求解此CMDP。

youdao
The optimal model of inspection and maintenance for the deteriorating system is presented with the semi-Markov decision process.

提出了一类基于半马氏决策过程的劣化失效系统检测与维修优化模型。

youdao
Reinforcement learning based on Markov decision process is a way of on-line learning, which can be applied to single agent environment.

基于马尔科夫过程的强化学习作为一种在线学习方式，能够很好地应用于单智能体环境中。

youdao

更多双语例句

应用推荐

$firstVoiceSent

- 来自原声例句