Model improvements6 #206

detlefarend · 2021-12-10T12:57:45Z

Description

Background

Types of changes

Bug fix
New Example, please read the detailed guideline here
New Feature
Documentation

New Feature

Model (Improvement on the Core Model)
Environment (New Environment), please read the detailed guideline here
Policy (New Policy), please read the detailed guideline here
.....

Checklist:

Read the CONTRIBUTION guide.
Update the history on the source code (required).
This change requires a change to the documentation.
Update the tests accordingly.
Update the documentation accordingly.

detlefarend · 2021-12-12T23:34:19Z

Hi Rizky and William, classes Training and RLTraining are reworked and enhanced as described in #161 and demonstrated in new Howto 17. Please take a look. Thx!

rizkydiprasetya · 2021-12-13T06:58:11Z

@detlefarend could you please check there are still some errors.

detlefarend · 2021-12-13T08:10:06Z

Hi @rizkydiprasetya, I guess the problem is caused by this code

rew is not a scalar but a np array. The method Reward.set_overall_reward() in turn expects a scalar.

Then a parameter of class RLTraining changed it's name. I already fixed and pushed it:

rizkydiprasetya · 2021-12-13T08:14:09Z

why should be a scalar? it used to work with np array

detlefarend · 2021-12-13T08:20:24Z

why should be a scalar? it used to work with np array

A reward is a scalar value, not an array. That's why both methods Reward.set_overall_reward() and Reward.add_agent_reward() expect scalars.

Fyi: I edited my last comment (parameter of class RLTraining)

rizkydiprasetya · 2021-12-13T08:24:26Z

ok, and why is it working without this stagnation and evaluation and other stuff that you added?

detlefarend · 2021-12-13T08:38:18Z

ok, and why is it working without this stagnation and evaluation and other stuff that you added?

It is a simple thing: you use the method beyond its specification. If you do so the behavior is undefined. The new evaluation stuff expects scalar values and gets arrays. We can surely improve both methods by adding try/except logic. But it's not a bug. Please adjust your code or add the logic to Reward by yourself.

rizkydiprasetya · 2021-12-13T08:40:15Z

ok, then we need to change all of the environment. Since all of them based on numpy. You can check them.

rizkydiprasetya · 2021-12-13T08:49:40Z

ok, and why is it working without this stagnation and evaluation and other stuff that you added?

It is a simple thing: you use the method beyond its specification. If you do so the behavior is undefined. The new evaluation stuff expects scalar values and gets arrays. We can surely improve both methods by adding try/except logic. But it's not a bug. Please adjust your code or add the logic to Reward by yourself.

its not that beyond. Just about either using NumPy or not.

detlefarend · 2021-12-13T08:50:05Z

I have no problem to solve this in Reward - except of time. Feel free to do it if you prefer this solution.

rizkydiprasetya · 2021-12-13T08:53:47Z

done

rizkydiprasetya · 2021-12-13T10:21:13Z

@detlefarend why?

detlefarend · 2021-12-13T11:02:49Z

@detlefarend why?

What is the problem? Give me a chance to understand.

rizkydiprasetya · 2021-12-13T11:04:21Z

@detlefarend why?

What is the problem? Give me a chance to understand.

Why are you using try and except, but there is no error that you want to catch?

detlefarend · 2021-12-13T11:14:38Z

Because you can use an env as one (or all) of the afcts in an envmodel. But an env is not adaptive and raises an exception, if you try to switch the adaptivity. See EnvBase.switch_adaptivity().

rizkydiprasetya · 2021-12-13T11:31:06Z

I see, ok. It is fine like that, but I would prefer to check for the inheritance of the object for the adaptivity.

Oh just check again, EnvBase inherits also the Model class. That is why you put the raise on the function.

detlefarend · 2021-12-13T11:51:56Z

What is your problem? Does something behave different from your expectation?

detlefarend added 3 commits December 9, 2021 10:29

Classes *Training: changed to dynamic parameter **p_kwargs

fc40c9a

Class HyperParamTuner changed

4cfda30

Refactoring

0c046b2

detlefarend added enhancement New feature or request v0.8.0 In scope of Release 0.8.0 labels Dec 10, 2021

detlefarend added this to the IEEE IES ICIT 2022 Submission milestone Dec 10, 2021

detlefarend assigned detlefarend and steveyuwono Dec 10, 2021

detlefarend and others added 13 commits December 10, 2021 14:33

RTD: Added new section MLPro-SL

9b27b25

Code optimization

2c19202

Bugfixes in EnvModel, AdaptiveFunction

aa0e26f

Class diagrams updated

100dbca

Update WrEnvPZOO2MLPro() to fix howto 08

f8ceb58

Updated diags and rtd

ae4d373

Flowchart Agent Adaptation

93909a5

Training with evaluation, Howto 17

cd11e71

Cl Training: counts adaptations, eval/train cycles

dadf58e

...

1487e52

...

a891c3c

Classes Training, RLTraining completed

8916933

Howtos 08, 17 (RL) added to unit test

687ddb4

detlefarend requested review from budiatmadjajaWill and rizkydiprasetya December 12, 2021 23:26

detlefarend linked an issue Dec 12, 2021 that may be closed by this pull request

Class RLTraining - Completion #161

Closed

7 tasks

detlefarend marked this pull request as ready for review December 12, 2021 23:31

Mochammad Rizky Diprasetya added 2 commits December 13, 2021 07:32

Refactoring Howto 15 with new model

8009ace

Remove Plotting

9aaa7f3

Howto 15: param renamed to p_cylces_per_epi_limit

97e1700

change numpy scalar to python scalar

8122d78

edit reward to python scalar

9aadab8

rizkydiprasetya approved these changes Dec 13, 2021

View reviewed changes

detlefarend merged commit a3de2a8 into main Dec 13, 2021

detlefarend deleted the model_improvements6 branch December 14, 2021 07:33

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Model improvements6 #206

Model improvements6 #206

detlefarend commented Dec 10, 2021 •

edited

Loading

detlefarend commented Dec 12, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021 •

edited

Loading

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021 •

edited

Loading

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021 •

edited

Loading

detlefarend commented Dec 13, 2021

Model improvements6 #206

Model improvements6 #206

Conversation

detlefarend commented Dec 10, 2021 • edited Loading

Description

Background

Types of changes

New Feature

Checklist:

detlefarend commented Dec 12, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021 • edited Loading

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021 • edited Loading

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021

detlefarend commented Dec 13, 2021

rizkydiprasetya commented Dec 13, 2021 • edited Loading

detlefarend commented Dec 13, 2021

detlefarend commented Dec 10, 2021 •

edited

Loading

detlefarend commented Dec 13, 2021 •

edited

Loading

rizkydiprasetya commented Dec 13, 2021 •

edited

Loading

rizkydiprasetya commented Dec 13, 2021 •

edited

Loading