基于强化学习的游戏赛车控制算法的实践任务书

 2021-08-20 01:08

1. 毕业设计(论文)主要目标:

本项目主要的研究目标是实现在模拟器环境中用强化学习训练赛车,使其跑得一个尽可能好的成绩。

2. 毕业设计(论文)主要内容:

学习一个合适的模拟器环境的代码及相关知识,学习强化学习的算法,采用一种合适的强化学习算法来训练赛车,直到赛车在游戏中跑到一个较好的成绩。

剩余内容已隐藏,您需要先支付后才能查看该篇文章全部内容!

3. 主要参考文献

[1]Richard Sutton.Reinforcement learning: an introduction 2017,second edition [2]MorganClaypool.Algorithm for reinforcement learning [3]Anschel, O., Baram, N., and Shimkin, N. (2017). Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning. In the International Conference on Machine Learning (ICML).[4]郭宪,《深入浅出强化学习:原理入门》,2017[5] Anschel, O., Baram, N., and Shimkin, N. (2017). Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning. In the International Conference on Machine Learning (ICML).[6] Audiffren, J., Valko, M., Lazaric, A., and Ghavamzadeh, M. (2015). Maximum entropy semisupervised inverse reinforcement learning. In the International Joint Conference on Artificial Intelligence (IJCAI).[7] Bahdanau, D., Brakel, P., Xu, K., Goyal, A., Lowe, R., Pineau, J., Courville, A., and Bengio, Y. (2017). An actor-critic algorithm for sequence prediction. In the International Conference on Learning Representations (ICLR).[8] Baker, B., Gupta, O., Naik, N., and Raskar, R. (2017). Designing neural network architectures using reinforcement learning. In the International Conference on Learning Representations (ICLR).

剩余内容已隐藏,您需要先支付 10元 才能查看该篇文章全部内容!立即支付

以上是毕业论文任务书,课题毕业论文、开题报告、外文翻译、程序设计、图纸设计等资料可联系客服协助查找。