My Experiments on Discor

This repository is based on discor-pytorch and [1]. I am going to add on-policy reweight[2] and probably some other components.

Acknowledgements

We would like to thank Samarth Sinha for kindly providing the source code of [2].

References

[1] Kumar, Aviral, Abhishek Gupta, and Sergey Levine. "Discor: Corrective feedback in reinforcement learning via distribution correction." arXiv preprint arXiv:2003.07305 (2020).

[2] Sinha, Samarth, et al. "Experience Replay with Likelihood-free Importance Weights." arXiv preprint arXiv:2006.13169 (2020).

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

My Experiments on Discor

Acknowledgements

References

Files

README.md

Latest commit

History

README.md

File metadata and controls

My Experiments on Discor

Acknowledgements

References