
 2021-08-20 01:08

1. 毕业设计(论文)主要目标:


2. 毕业设计(论文)主要内容:



3. 主要参考文献

[1]Richard Sutton.Reinforcement learning: an introduction 2017,second edition [2]MorganClaypool.Algorithm for reinforcement learning [3]Anschel, O., Baram, N., and Shimkin, N. (2017). Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning. In the International Conference on Machine Learning (ICML).[4]郭宪,《深入浅出强化学习:原理入门》,2017[5] Anschel, O., Baram, N., and Shimkin, N. (2017). Averaged-DQN: Variance reduction and stabilization for deep reinforcement learning. In the International Conference on Machine Learning (ICML).[6] Audiffren, J., Valko, M., Lazaric, A., and Ghavamzadeh, M. (2015). Maximum entropy semisupervised inverse reinforcement learning. In the International Joint Conference on Artificial Intelligence (IJCAI).[7] Bahdanau, D., Brakel, P., Xu, K., Goyal, A., Lowe, R., Pineau, J., Courville, A., and Bengio, Y. (2017). An actor-critic algorithm for sequence prediction. In the International Conference on Learning Representations (ICLR).[8] Baker, B., Gupta, O., Naik, N., and Raskar, R. (2017). Designing neural network architectures using reinforcement learning. In the International Conference on Learning Representations (ICLR).

剩余内容已隐藏,您需要先支付 10元 才能查看该篇文章全部内容!立即支付
