当前位置:网站首页>How to get started quickly and strengthen learning?
How to get started quickly and strengthen learning?
2022-07-27 05:28:00 【Charleslc's blog】
How to get started quickly and strengthen learning ?
Understand the concept of reinforcement learning :
- Books 《Reinforcement Learning-An Introduction》 author :Richard Sutton Read address :http://incompleteideas.net/book/the-book-2nd.html
- David Silver Teaching courses , Video address :http://www0.cs.ucl.ac.uk/staff/d.silver/web/Teaching.html
- Pieter Abbeel and John Schulman (Open AI/ Berkeley AI Research Lab). Address :https://people.eecs.berkeley.edu/~pabbeel/nips-tutorial-policy-optimization-Schulman-Abbeel.pdf
- Start testing reinforcement learning projects
- Blog : How to use strategy gradient training ATARI Pong agent Address :https://karpathy.github.io/2016/05/31/rl/
- DeepMind Lab, Open source 3D Game platform , Based on agent Of AI Research creation , Rich simulation environment . DeepMind It is definitely a big guy team in reinforcement learning research , alphago It was invented . Address :https://deepmind.com/blog/open-sourcing-deepmind-lab/
- Project Malmo, An artificial intelligence experimental platform supporting the basic research of artificial intelligence . Address :https://www.microsoft.com/en-us/research/project/project-malmo/
- OpenAI gym, A toolbox for constructing and comparing reinforcement learning algorithms . Address :https://gym.openai.com/
Related:https://www.kdnuggets.com/2018/03/5-things-reinforcement-learning.html
边栏推荐
猜你喜欢

李宏毅机器学习组队学习打卡活动day02---回归

JVM Part 1: memory and garbage collection part 11 -- execution engine

Pinball games
![[CSAPP] Application of bit vectors | encoding and byte ordering](/img/96/344936abad90ea156533ff49e74f59.gif)
[CSAPP] Application of bit vectors | encoding and byte ordering

Li Hongyi machine learning team learning punch in activity day02 --- return

JVM Part 1: memory and garbage collection -- runtime data area 4 - program counter

Prime number screening (Ehrlich sieve method, interval sieve method, Euler sieve method)

素数筛选(埃氏筛法,区间筛法,欧拉筛法)

Graph cuts learning

2021 OWASP top 5: security configuration error
随机推荐
Integrate SSM
素数筛选(埃氏筛法,区间筛法,欧拉筛法)
Prime number screening (Ehrlich sieve method, interval sieve method, Euler sieve method)
数据库设计——关系数据理论(超详细)
Three waiting methods of selenium and three processing methods of alert pop-up
redis发布订阅模式
Gradio quickly builds ml/dl Web Services
稀疏数组→五子棋的存盘续盘等操作
B1024 科学计数法
redis事务
笔记系列k8s编排MySQL容器-有状态的容器创建过程
The interface can automatically generate E and other asynchronous access or restart,
实用小工具: Kotlin 代码片段
B1024 scientific counting method
数据库迁移报错解决
牛客剑指offer--JZ12 矩阵中的路径
JDBC API details
Li Hongyi machine learning team learning punch in activity day03 --- error and gradient decline
Pytorch data type and numpy data are mutually transformed
JVM上篇:内存与垃圾回收篇三--运行时数据区-概述及线程