A3C 算法资料收集

2019-07-26 21:37:55

Paperhttps://arxiv.org/pdf/1602.01783.pdf

Code:

1. 超级马里奥:https://github.com/vietnguyen91/Super-mario-bros-A3C-pytorch

2. https://github.com/ikostrikov/pytorch-a3c

3. https://github.com/vy007vikas/PyTorch-ActorCriticRL

4. https://github.com/higgsfield/RL-Adventure-2

5. https://github.com/haarnoja/sac

6. Deep Reinforcement Learning, Summer 2019 (Samsung) https://github.com/dongminlee94/Samsung-DRL-Code/tree/master/3_A2C_DDPG

7.

Blog:

1. Using Keras and Deep Deterministic Policy Gradient to play TORCS: https://yanpanlau.github.io/2016/10/11/Torcs-Keras.html

2. https://www.cnblogs.com/wangxiaocvpr/p/5681483.html

Slides:

1. https://github.com/dongminlee94/Samsung-DRL-Code/blob/master/3_A2C_DDPG/A2C.pdf

Video Tutorial

1. Deep RL Bootcamp Lecture 4A: Policy Gradients: https://www.youtube.com/watch?v=S_gwYj1Q-44

 

=============   Super-mario-bros-A3C-pytorch  https://github.com/vietnguyen91/Super-mario-bros-A3C-pytorch

1. 在尝试运行超级马里奥的时候,提示如下的错误:

# python3 train.py
Traceback (most recent call last):
File "train.py", line 16, in <module>
from env import create_train_env
File "/home/wangxiao/data/Super-mario-bros-A3C-pytorch/src/env.py", line 8, in <module>
from nes_py.wrappers import BinarySpaceToDiscreteSpaceEnv
ImportError: cannot import name 'BinarySpaceToDiscreteSpaceEnv' from 'nes_py.wrappers' (/root/miniconda3/lib/python3.7/site-packages/nes_py/wrappers/__init__.py)

Solution can be found at this Link: http://melonicedlatte.com/programming/2019/06/09/202000.html

2. 遇到的第二个 bug 是:

# python3 train.py
Traceback (most recent call last):
File "train.py", line 92, in <module>
train(opt)
File "train.py", line 81, in train
process.start()
File "/root/miniconda3/lib/python3.7/multiprocessing/process.py", line 112, in start
self._popen = self._Popen(self)
File "/root/miniconda3/lib/python3.7/multiprocessing/context.py", line 284, in _Popen
return Popen(process_obj)
File "/root/miniconda3/lib/python3.7/multiprocessing/popen_spawn_posix.py", line 32, in __init__
super().__init__(process_obj)
File "/root/miniconda3/lib/python3.7/multiprocessing/popen_fork.py", line 20, in __init__
self._launch(process_obj)
File "/root/miniconda3/lib/python3.7/multiprocessing/popen_spawn_posix.py", line 47, in _launch
reduction.dump(process_obj, fp)
File "/root/miniconda3/lib/python3.7/multiprocessing/reduction.py", line 60, in dump
ForkingPickler(file, protocol).dump(obj)
File "/root/miniconda3/lib/python3.7/site-packages/torch/multiprocessing/reductions.py", line 179, in reduce_storage
raise RuntimeError("Cannot pickle CUDA storage; try pickling a CUDA tensor instead")
RuntimeError: Cannot pickle CUDA storage; try pickling a CUDA tensor instead 

Solution: do not use PyTorch 0.4.1 version, please use 0.4.0 version or maybe later version like 1.0.1 with: conda install pytorch=0.4.0 -c pytorch

3. 成功运行:

python3 train.py
Process 0. Episode 0
Process 0. Episode 1
Process 0. Episode 2
Process 0. Episode 3
。 。 。

==

最新文章

  1. 深入理解CSS动画animation
  2. HttpWebRequest请求时无法发送具有此谓词类型的内容正文。
  3. Z - Fighting 和 Depth-bias
  4. 深入理解Java内存模型(六)——final
  5. Oracle按用户进行统计信息更新
  6. 我是这样发现ISP劫持HTTP请求的
  7. 所不为人知的Python装饰器
  8. 我的第一个python web开发框架(33)——接口代码重构
  9. css3选择器和伪类
  10. sqlserver常用存储过程基本语法
  11. Java高阶回调,回调函数的另一种玩法
  12. JavaScript 日期与时间戳互转
  13. 数组Array的一些方法
  14. vue相关操作命令
  15. PHP和MySQL实现消息队列
  16. Disable Maven Nature和disable workspace resolution
  17. PTA (Advanced Level) 1015 Reversible Primes
  18. jdk8 流操作
  19. FFF at Valentine(强连通分量缩点+拓扑排序)
  20. sort--Linux下文本处理五大神器之三

热门文章

  1. 在pycharm中右键运行,只有unnitest,HtmltTestRunner不生成报告
  2. ffmpeg 把视频转换为图片
  3. github hooks 配置教程 钩子搭建(实测通过,手把手教程)
  4. docker 安装prometheus
  5. pinpoint的只是总结
  6. 浅谈Linux下傻瓜式磁盘分区工具cfdisk的使用
  7. 离线安装docker(RedHat7.4)
  8. javascript数据结构与算法——栈
  9. 数据库索引数据结构总结——ART树就是前缀树
  10. vscode——常用插件记录