You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+13-1
Original file line number
Diff line number
Diff line change
@@ -46,7 +46,7 @@ This repo only contains **PyTorch** Implementation.
46
46
47
47
* Two versions of **QT-Opt** are implemented [here](https://github.com/quantumiracle/QT_Opt).
48
48
49
-
***PointNet** for landmarks generation from images with unsupervised learning is implemented [here](https://github.com/quantumiracle/PointNet_Landmarks_from_Image/tree/master). This method is also used for image-based reinforcement learning as a STOA algorithm, called **Transporter**.
49
+
***PointNet** for landmarks generation from images with unsupervised learning is implemented [here](https://github.com/quantumiracle/PointNet_Landmarks_from_Image/tree/master). This method is also used for image-based reinforcement learning as a SOTA algorithm, called **Transporter**.
50
50
51
51
original paper: [Unsupervised Learning of Object Landmarksthrough Conditional Image Generation](https://papers.nips.cc/paper/7657-unsupervised-learning-of-object-landmarks-through-conditional-image-generation.pdf)
52
52
@@ -65,6 +65,18 @@ This repo only contains **PyTorch** Implementation.
65
65
[Memory-based control with recurrent neural networks](https://arxiv.org/abs/1512.04455)
66
66
67
67
[Sim-to-Real Transfer of Robotic Control with Dynamics Randomization](https://arxiv.org/abs/1710.06537)
68
+
69
+
***Maximum a Posteriori Policy Optimisation (MPO)**:
70
+
71
+
todo
72
+
73
+
paper: [Maximum a Posteriori Policy Optimisation](https://arxiv.org/abs/1806.06920)
74
+
75
+
***Advantage-Weighted Regression (AWR)**:
76
+
77
+
todo
78
+
79
+
paper: [Advantage-Weighted Regression: Simple and Scalable Off-Policy Reinforcement Learning](https://arxiv.org/pdf/1910.00177.pdf)
0 commit comments