登录    注册    忘记密码

详细信息

Application of Bayesian Networks and Reinforcement Learning in Intelligent Control Systems in Uncertain Environments  ( EI收录)  

文献类型:期刊文献

英文题名:Application of Bayesian Networks and Reinforcement Learning in Intelligent Control Systems in Uncertain Environments

作者:Zhu, Liefeng[1]; Luo, Yongbiao[2]

机构:[1] School of Mechanical and Electrical Engineering, Shaoxing University, Zhejiang, Shaoxing, 312000, China; [2] College of Yuanpei, Shaoxing University, Zhejiang, Shaoxing, 31200, China

年份:2024

卷号:35

期号:2

起止页码:1

外文期刊名:Journal of Computers (Taiwan)

收录:EI(收录号:20242016075731)

语种:英文

外文关键词:Control systems - Intelligent control - Learning algorithms - Learning systems - Reinforcement learning - Uncertainty analysis

外文摘要:Reinforcement learning is a machine learning paradigm that focuses on how an agent can perform actions in an environment to achieve a certain goal. The agent learns through interaction with the environment, observing the state and making decisions to maximize its reward. Reinforcement learning has wide applications in intelligent control systems. However, one limitation of reinforcement learning is the uncertainty in handling the environment model. Usually, reinforcement learning is performed without a clear model, which requires estimating environmental uncertainty and state transitions. Bayesian Networks are effective in modeling uncertainty, which can aid in establishing a probabilistic model of environmental dynamics. This allows for the integration of uncertainty information into the environmental model, leading to a more accurate understanding of the dynamic characteristics of the environment. In this study, we propose a reinforcement learning algorithm based on Bayesian Networks. We utilize optimal generalized residual differentiation, parallel integration causal directional reasoning, and other modeling techniques to address reinforcement learning tasks. The main idea is to utilize the prior distribution to estimate the uncertainty of unknown parameters. Then, the obtained observation information is used to calculate the posterior distribution in order to acquire knowledge. Experiments demonstrate that this approach is feasible in intelligent control systems operating in uncertain environments. ? 2024 Codon Publications. All rights reserved.

参考文献:

正在载入数据...

版权所有©绍兴文理学院 重庆维普资讯有限公司 渝B2-20050021-8
渝公网安备 50019002500408号 违法和不良信息举报中心