Skip to content

Commit c111b4d

Browse files
authored
Update doc (#202)
* add Laura to code of conduct contact * update benchmarks * update benchmark
1 parent b4a0538 commit c111b4d

File tree

2 files changed

+11
-11
lines changed

2 files changed

+11
-11
lines changed

BENCHMARK.md

+10-10
Original file line numberDiff line numberDiff line change
@@ -6,17 +6,17 @@ All the results below link to their respective PRs with the full experiment repo
66
- [the `result` PRs](https://github.com/kengz/SLM-Lab/pulls?utf8=%E2%9C%93&q=is%3Apr+label%3Aresult+).
77
- the full experiment datas contributed are [public on Dropbox ](https://www.dropbox.com/sh/y738zvzj3nxthn1/AAAg1e6TxXVf3krD81TD5V0Ra?dl=0)
88

9-
| Algorithm / *Owner* | [DQN](https://arxiv.org/abs/1312.5602) / *[Keng](https://github.com/kengz)* | DQN + RNN | [DDQN](https://arxiv.org/abs/1509.06461) | DDQN + RNN | [Dueling DQN](https://arxiv.org/abs/1511.06581) | DQN + [PER](https://arxiv.org/abs/1511.05952) | DDQN + [PER](https://arxiv.org/abs/1511.05952) | DQN + [CER](https://arxiv.org/abs/1712.01275) | DDQN + [CER](https://arxiv.org/abs/1712.01275) | [DIST DQN](https://arxiv.org/abs/1602.01783) | REINFORCE | A2C | A2C + [GAE](https://arxiv.org/abs/1506.02438) / *[Laura](https://github.com/lgraesser)* | A2C + [GAE](https://arxiv.org/abs/1506.02438) + [SIL](https://arxiv.org/abs/1806.05635) | [A3C](https://arxiv.org/abs/1602.01783) | [A3C](https://arxiv.org/abs/1602.01783) + [GAE](https://arxiv.org/abs/1506.02438) | [PPO](https://arxiv.org/abs/1707.06347) | [PPO](https://arxiv.org/abs/1707.06347) + [SIL](https://arxiv.org/abs/1806.05635) | [DPPO](https://arxiv.org/pdf/1707.02286.pdf) |
9+
| Algorithm / *Owner* | [DQN](https://arxiv.org/abs/1312.5602) / *[Keng](https://github.com/kengz)* | [DDQN](https://arxiv.org/abs/1509.06461) | [Dueling DQN](https://arxiv.org/abs/1511.06581) | DQN + [PER](https://arxiv.org/abs/1511.05952) | DDQN + [PER](https://arxiv.org/abs/1511.05952) | DQN + [CER](https://arxiv.org/abs/1712.01275) | DDQN + [CER](https://arxiv.org/abs/1712.01275) | [DIST DQN](https://arxiv.org/abs/1602.01783) | REINFORCE | A2C | A2C + [GAE](https://arxiv.org/abs/1506.02438) / *[Laura](https://github.com/lgraesser)* | A2C + [GAE](https://arxiv.org/abs/1506.02438) + [SIL](https://arxiv.org/abs/1806.05635) | [A3C](https://arxiv.org/abs/1602.01783) | [A3C](https://arxiv.org/abs/1602.01783) + [GAE](https://arxiv.org/abs/1506.02438) | [PPO](https://arxiv.org/abs/1707.06347) | [PPO](https://arxiv.org/abs/1707.06347) + [SIL](https://arxiv.org/abs/1806.05635) | [DPPO](https://arxiv.org/pdf/1707.02286.pdf) |
1010
|------------|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|-|
11-
|[CartPole-v0](https://gym.openai.com/envs/CartPole-v0/)|[4.79](https://github.com/kengz/SLM-Lab/pull/184) | | | | | | | | | | |[44.7](https://github.com/kengz/SLM-Lab/pull/185) | [1.20](https://github.com/kengz/SLM-Lab/pull/180) | | | | | | |
12-
|[3dball](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Learning-Environment-Examples.md#3dball-3d-balance-ball)| | | | | | | | | | | | | | | | | | | |
13-
|[LunarLander-v2](https://gym.openai.com/envs/LunarLander-v2/)|[1.14](https://github.com/kengz/SLM-Lab/pull/191)| |[1.15](https://github.com/kengz/SLM-Lab/pull/203)| | | | | | | | | | | | | | | | |
14-
|[gridworld](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Learning-Environment-Examples.md#gridworld)| | | | | | | | | | | | | | | | | | | |
15-
|[BeamRider-v0](https://gym.openai.com/envs/BeamRider-v0/)| | | | | | | | | | | | | | | | | | | |
16-
|[Pendulum-v0](https://gym.openai.com/envs/Pendulum-v0/)| n/a | n/a | n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
17-
|[Acrobot-v1](https://gym.openai.com/envs/Acrobot-v1/)| n/a | n/a | n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
18-
|[BipedalWalker-v2](https://gym.openai.com/envs/BipedalWalker-v2/)| n/a | n/a | n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
19-
|[CarRacing-v0](https://gym.openai.com/envs/CarRacing-v0/)| n/a | n/a | n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
11+
|[CartPole-v0](https://gym.openai.com/envs/CartPole-v0/)| | | | | |[4.79](https://github.com/kengz/SLM-Lab/pull/184) |[5.65](https://github.com/kengz/SLM-Lab/pull/195) | |[1.21](https://github.com/kengz/SLM-Lab/pull/200) |[7.10](https://github.com/kengz/SLM-Lab/pull/185) | [1.20](https://github.com/kengz/SLM-Lab/pull/180) |[6.26](https://github.com/kengz/SLM-Lab/pull/201) | | | | | |
12+
|[3dball](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Learning-Environment-Examples.md#3dball-3d-balance-ball)| | | | | | | | | | | | | | | | | |
13+
|[LunarLander-v2](https://gym.openai.com/envs/LunarLander-v2/)|[1.14](https://github.com/kengz/SLM-Lab/pull/191)|[1.15](https://github.com/kengz/SLM-Lab/pull/203)| | | | | | | | | | | | | | | |
14+
|[gridworld](https://github.com/Unity-Technologies/ml-agents/blob/master/docs/Learning-Environment-Examples.md#gridworld)| | | | | | | | | | | | | | | | | |
15+
|[BeamRider-v0](https://gym.openai.com/envs/BeamRider-v0/)| | | | | | | | | | | | | | | | | |
16+
|[Pendulum-v0](https://gym.openai.com/envs/Pendulum-v0/)| n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
17+
|[Acrobot-v1](https://gym.openai.com/envs/Acrobot-v1/)| n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
18+
|[BipedalWalker-v2](https://gym.openai.com/envs/BipedalWalker-v2/)| n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
19+
|[CarRacing-v0](https://gym.openai.com/envs/CarRacing-v0/)| n/a | n/a| n/a | n/a | n/a | n/a | n/a | n/a | | | | | | | | | |
2020

2121
### Terminology
2222
- DQN: Deep Q-learning

CODE_OF_CONDUCT.md

+1-1
Original file line numberDiff line numberDiff line change
@@ -34,7 +34,7 @@ This Code of Conduct applies both within project spaces and in public spaces whe
3434

3535
## Enforcement
3636

37-
Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by contacting the project team at [email protected]. The project team will review and investigate all complaints, and will respond in a way that it deems appropriate to the circumstances. The project team is obligated to maintain confidentiality with regard to the reporter of an incident. Further details of specific enforcement policies may be posted separately.
37+
Instances of abusive, harassing, or otherwise unacceptable behavior may be reported by contacting the project team at Wah Loon Keng ([email protected]) or Laura Graesser ([email protected]). The project team will review and investigate all complaints, and will respond in a way that it deems appropriate to the circumstances. The project team is obligated to maintain confidentiality with regard to the reporter of an incident. Further details of specific enforcement policies may be posted separately.
3838

3939
Project maintainers who do not follow or enforce the Code of Conduct in good faith may face temporary or permanent repercussions as determined by other members of the project's leadership.
4040

0 commit comments

Comments
 (0)