基于强化学习的游戏赛车控制算法的实践任务书-任务书网

1. 毕业设计（论文）主要目标：

本项目主要的研究目标是实现在模拟器环境中用强化学习训练赛车，使其跑得一个尽可能好的成绩。

2. 毕业设计（论文）主要内容：

学习一个合适的模拟器环境的代码及相关知识，学习强化学习的算法，采用一种合适的强化学习算法来训练赛车，直到赛车在游戏中跑到一个较好的成绩。

剩余内容已隐藏，您需要先支付后才能查看该篇文章全部内容！

3. 主要参考文献

[1]Richard Sutton.Reinforcement learning: an introduction 2017,second edition [2]MorganClaypool.Algorithm for reinforcement learning [3]Anschel, O., Baram, N., and Shimkin, N. (2017). Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning. In the International Conference on Machine Learning (ICML).[4]郭宪，《深入浅出强化学习：原理入门》，2017[5] Anschel, O., Baram, N., and Shimkin, N. (2017). Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning. In the International Conference on Machine Learning (ICML).[6] Audiffren, J., Valko, M., Lazaric, A., and Ghavamzadeh, M. (2015). Maximum entropy semisupervised inverse reinforcement learning. In the International Joint Conference on Artificial Intelligence (IJCAI).[7] Bahdanau, D., Brakel, P., Xu, K., Goyal, A., Lowe, R., Pineau, J., Courville, A., and Bengio, Y. (2017). An actor-critic algorithm for sequence prediction. In the International Conference on Learning Representations (ICLR).[8] Baker, B., Gupta, O., Naik, N., and Raskar, R. (2017). Designing neural network architectures using reinforcement learning. In the International Conference on Learning Representations (ICLR).

剩余内容已隐藏，您需要先支付 10元 才能查看该篇文章全部内容！立即支付

免费ai写开题、写任务书：免费Ai开题 | 免费Ai任务书 | 免费降AI率 | 免费降重复率 | 论文免费排版

注册

找回密码

基于强化学习的游戏赛车控制算法的实践任务书

1. 毕业设计（论文）主要目标：

2. 毕业设计（论文）主要内容：

3. 主要参考文献

您可能感兴趣的文章

登录

注册

找回密码

1. 毕业设计（论文）主要目标：

2. 毕业设计（论文）主要内容：

3. 主要参考文献

您可能感兴趣的文章