
全感知条件下基于奖励塑形的Q-learning算法及仿真
陈嘉楠, 彭军海, 黄华
全感知条件下基于奖励塑形的Q-learning算法及仿真
Q-learning Algorithm and Simulation Based on Reward Shaping Under Comprehensive Recognition
{{custom_ref.label}} |
{{custom_citation.content}}
{{custom_citation.annotation}}
|
/
〈 |
|
〉 |