Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ranked reward alernatives? #24

Open
pstjohn opened this issue Oct 4, 2020 · 0 comments
Open

ranked reward alernatives? #24

pstjohn opened this issue Oct 4, 2020 · 0 comments

Comments

@pstjohn
Copy link
Collaborator

pstjohn commented Oct 4, 2020

Might be worth exploring our own versions of ranked rewards. I don't expect that paper is a canonical standard at this point, so there might be further optimizations we could look into.

Do loss rewards need to be -1? Might be easier to make a loss 0 and win 1.

A couple ideas:

  • pareto front optimization
    Here, we'd just check to see if a molecule is pareto-optimal, i.e., whether its dominated by any other recent solution currently in the game buffer.
    select count(*) > 0 from game_table where buried_vol > %s and max_spin < %s and atom_type = %s

  • percentile scoring
    Similar to ranked rewards but perhaps less sparse, just return 0->1 rewards depending on the percentile score of the game w.r.t a game buffer

  • anything else from the literature?

@pstjohn pstjohn created this issue from a note in alpha zero refactor (To do) Oct 4, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Development

No branches or pull requests

1 participant