Command Control and Simulation >
Multi-target Point-Track Association Method Based on Reinforcement Learning
Received date: 2021-11-18
Request revised date: 2021-12-29
Online published: 2022-04-28
Copyright
Aiminging at the problem of multi-target point-track association in dense clutter environment, based on the reinforcement learning(RL) method, a multi-target point-track association method based on Q-learning is proposed. First, according to the movement state of the target in the whole process, a Markov decision process(MDP) model is established. Secondly, the paper uses the degree of correlation between the states to form a strategy function, selects the correct action, and sets the corresponding reward function. Finally, considering that false measurements are difficult to distinguish when the clutter is dense, combined with the prior information of the target, the Q-meter re-learning link is added to further optimize the correlation accuracy. The simulation results show that in both non-maneuvering and strong maneuvering environments, the method in this paper can accurately correlate to the measurement of the target, and has a better point-track-track correlation performance.
DING Guo-sheng , CAI Min-jie . Multi-target Point-Track Association Method Based on Reinforcement Learning[J]. Command Control and Simulation, 2022 , 44(2) : 43 -48 . DOI: 10.3969/j.issn.1673-3819.2022.02.009
| [1] |
黄小平, 王岩. 卡尔曼滤波原理及应用—MATLAB 仿真[M]. 北京: 电子工业出版社, 2015.
|
| [2] |
|
| [3] |
|
| [4] |
|
| [5] |
孙启臣, 郭伟震, 闫倩倩, 等. 一种基于灰关联分析的多目标跟踪算法[J]. 鲁东大学学报(自然科学版), 2017, 33(1): 20-25,97.
|
| [6] |
|
| [7] |
周志华. 机器学习[M]. 北京: 清华大学出版社, 2016.
|
| [8] |
|
| [9] |
|
| [10] |
|
| [11] |
|
| [12] |
|
| [13] |
|
| [14] |
董豪, 丁子涵. Deep Reinforcement Learaing[M]. 北京: 电子工业出版社, 2021.
|
/
| 〈 |
|
〉 |