More than 5 years have passed since last update.

強化学習３０　Colaboratory+Acrobot+ChainerRL

強化学習
OpenAIGym
chainerRL
colaboratory

Posted at 2019-12-06

強化学習２８まで終了していることが前提です。
標準DQNでAcrobotを学習しようとするとうまく行きません。
最初はうまく学習できても、途中からおかしくなります。
いろいろといじりまわした結果、reward-scale-factorがややこしくなる原因のようです。
reward-scale-factorは報酬を正規化する係数のようですが、1.0にします。
つまり何もしないということのようですが。。。

ノートブックは、chokozainerRLに入れておきました。
acrobot_dqn_chokozainer.ipynb
です。

You get articles that match your needs
You can efficiently read back useful information
You can use dark theme

What you can do with signing up

強化学習３０ Colaboratory+Acrobot+ChainerRL

強化学習３０　Colaboratory+Acrobot+ChainerRL