Skip to content

Commit 58b416e

Browse files
committed
2 parents e15ed90 + 3355be8 commit 58b416e

File tree

1 file changed

+13
-1
lines changed

1 file changed

+13
-1
lines changed

README.md

+13-1
Original file line numberDiff line numberDiff line change
@@ -46,7 +46,7 @@ This repo only contains **PyTorch** Implementation.
4646

4747
* Two versions of **QT-Opt** are implemented [here](https://github.com/quantumiracle/QT_Opt).
4848

49-
* **PointNet** for landmarks generation from images with unsupervised learning is implemented [here](https://github.com/quantumiracle/PointNet_Landmarks_from_Image/tree/master). This method is also used for image-based reinforcement learning as a STOA algorithm, called **Transporter**.
49+
* **PointNet** for landmarks generation from images with unsupervised learning is implemented [here](https://github.com/quantumiracle/PointNet_Landmarks_from_Image/tree/master). This method is also used for image-based reinforcement learning as a SOTA algorithm, called **Transporter**.
5050

5151
original paper: [Unsupervised Learning of Object Landmarksthrough Conditional Image Generation](https://papers.nips.cc/paper/7657-unsupervised-learning-of-object-landmarks-through-conditional-image-generation.pdf)
5252

@@ -65,6 +65,18 @@ This repo only contains **PyTorch** Implementation.
6565
[Memory-based control with recurrent neural networks](https://arxiv.org/abs/1512.04455)
6666

6767
[Sim-to-Real Transfer of Robotic Control with Dynamics Randomization](https://arxiv.org/abs/1710.06537)
68+
69+
* **Maximum a Posteriori Policy Optimisation (MPO)**:
70+
71+
todo
72+
73+
paper: [Maximum a Posteriori Policy Optimisation](https://arxiv.org/abs/1806.06920)
74+
75+
* **Advantage-Weighted Regression (AWR)**:
76+
77+
todo
78+
79+
paper: [Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning](https://arxiv.org/pdf/1910.00177.pdf)
6880

6981
## Usage:
7082
`python ***.py --train`

0 commit comments

Comments
 (0)