DEEP LEARNING 大满贯课程表
Reinforcement Learning
post by ISH GIRWAN
Courses/Tutorials
- Deep Reinforcement Learning, Spring 2017, by UC Berkeley: http://rll.berkeley.edu/deeprlcours...
- Reinforcement Learning, 2015, by UCL (David Siver): http://www0.cs.ucl.ac.uk/staff/d.si...
- https://github.com/yandexdataschool...
- Lecture notes by Andrew Ng: http://cs229.stanford.edu/notes/cs2...
- https://medium.com/emergent-future/...
Books
- Reinforcement Learning: An Introduction by Richard S. Sutton and Andrew G. Barto: http://webdocs.cs.ualberta.ca/~sutt...
Blogs
I think you can take the UC Berkeley course instead of David Silver's course as it's more up to date. Additionally you can check Arthur Juliani's blog series, it's really good.
相关课程
Calculus One, Coursera, Jim Fowler
Calculus Two, Coursera, Jim Fowler
Multivariable Calculus, Khan Academy, Grant Sanderson
Linear Algebra, MIT, Prof. Gilbert Strang (so mechanical..)
Coding the Matrix, Brown University, Philip Klein
Introduction to Probability, The Science of Uncertainty Edx, MIT, Joh Tsitsiklis
微积分, coursera, 吉姆·福勒
微积分, coursera, 吉姆·福勒
多元微积分, 汗学院, grant sanderson
线性代数, 麻省理工学院教授 吉尔伯特·斯特朗(所以机械..)
编码矩阵, 布朗大学, 菲利普·克莱因
介绍概率, 不确定的科学, 麻省理工学院, joh tsitsiklis
以下是比较旧的RL Course by David Silver
UCL Course on RL
http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html
Advanced Topics 2015 (COMPM050/COMPGI13)
Reinforcement Learning
Contact: d.silver@cs.ucl.ac.uk
Video-lectures available here
Lecture 1: Introduction to Reinforcement Learning
Lecture 2: Markov Decision Processes
Lecture 3: Planning by Dynamic Programming
Lecture 4: Model-Free Prediction
Lecture 5: Model-Free Control
Lecture 6: Value Function Approximation
Lecture 7: Policy Gradient Methods
Lecture 8: Integrating Learning and Planning
Lecture 9: Exploration and Exploitation
Lecture 10: Case Study: RL in Classic Games
Easy21 assignment
Discussion and announcements: http://groups.google.com/group/csml-advanced-topics
最新文章
- 在asp.net WebForms中使用路由Route
- word20161205
- 做HDU1010 带出来一个小问题
- VS2010遇到_WIN32_WINNT宏定义问题
- iOS异步图片加载优化与常用开源库分析
- Linux下Oracle11G RAC报错:在安装oracle软件时报file not found一例
- Qt ImageProvider 的使用
- UVA699 dfs and map
- C++ 头文件系列(forward_list)
- Java多线程推荐使用的停止方法和暂停方法
- sql group句子
- 基于Spring Cloud的微服务入门教程
- Linux新增用户过程详解
- 怎么在父窗口调用它页面的iframe里面数据,进行操作?
- R基本图形示例及代码(持续收集)
- MySQL主从复制备份
- Binary Search-483. Smallest Good Base
- iOS swift项目IM实现,从长连接到数据流解析分析之Socket
- AI逻辑实现-取舍行为树还是状态机
- python https协议和InsecurePlatformWarning问题