Search result of “user:ku2482”

21 search resultsShowing 1~20 results

Stocked

@ku2482(Toshiki Watanabe)

2020/01/23

[論文解説] BCQ: Off-Policy Deep Reinforcement Learning without Exploration

この記事は，以下の論文の解説です． Off-Policy Deep Reinforcement Learning without Exploration (ICLR 2018) 記事内容では，強...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/02/01

[論文解説] Deterministic Policy Gradient Algorithms

この記事は，以下の論文の解説です． Deterministic Policy Gradient Algorithms (ICML 2014) ただし，この記事は「DDPGが(Importance...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/05/08

[論文解説] IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Architectures

この記事は，以下の論文の解説です． IMPALA: Scalable Distributed Deep-RL with Importance Weighted Actor-Learner Arc...

ReinforcementLearning

Comment2

@ku2482(Toshiki Watanabe)

2020/06/03

[論文解説] DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correction

この記事は，以下の論文の解説です． DisCor: Corrective Feedback in Reinforcement Learning via Distribution Correcti...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2019/10/11

[論文解説] HIRO: Data-Efficient Hierarchical Reinforcement Learning

以下の論文の解説(まとめ)になります． Data-Efficient Hierarchical Reinforcement Learning この論文は，Google Brainが出した論文でN...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2019/06/07

[論文解説] Soft Actor-Critic

Soft Actor-Criticの論文を説明します．以下の2つの論文を参考にしていて，本文中の図は全て論文からの引用になります． Soft Actor-Critic: Off-Policy ...

ReinforcementLearning

Comment1

@ku2482(Toshiki Watanabe)

2019/10/01

[論文解説] Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning?

以下の論文に関する解説(まとめ)になります． Why Does Hierarchy (Sometimes) Work So Well in Reinforcement Learning? この論...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2019/10/17

[論文解説] SAC-Discrete: Soft Actor-Critic for Discrete Action Settings

以下の論文の解説(まとめ)になります． Soft Actor-Critic for Discrete Action Settings この論文は，Imperial College Londonの...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/02/16

[論文解説] PCL: Bridging the Gap Between Value and Policy Based Reinforcement Learning

この記事は，以下の論文の解説です． Bridging the Gap Between Value and Policy Based Reinforcement Learning (NIPS 20...

ReinforcementLearning

Comment1

@ku2482(Toshiki Watanabe)

2020/01/28

[論文解説] QR-DQN: Distributional Reinforcement Learning with Quantile Regression

この記事は，以下の論文の解説です． Distributional Reinforcement Learning with Quantile Regression (AAAI 2018) 記事内容...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/02/03

[論文解説] IQN: Implicit Quantile Networks for Distributional Reinforcement Learning

この記事は，以下の論文の解説です． Implicit Quantile Networks for Distributional Reinforcement Learning (2018) 記事内...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/02/08

[論文解説] FQF: Fully Parameterized Quantile Function for Distributional Reinforcement Learning

この記事は，以下の論文の解説です． Fully Parameterized Quantile Function for Distributional Reinforcement Learning...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/01/27

[論文解説] C51: A Distributional Perspective on Reinforcement Learning

この記事は，以下の論文の解説です． A Distributional Perspective on Reinforcement Learning (ICML 2017) 記事内容では，強化学習の...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/02/06

[チュートリアル] Amazon SageMakerでの学習・デプロイ

本記事では，Amazon SageMakerを用いて機械学習モデルの学習・デプロイを行うための必要最低限の知識を説明します．普段，仕事や学業で機械学習プロジェクトに携わっているけどAWSにあまり...

Comment0

@ku2482(Toshiki Watanabe)

2020/01/26

[論文解説] Upside-Down RL: Training Agents using Upside-Down Reinforcement Learning

この記事は，以下の論文の解説です． Training Agents using Upside-Down Reinforcement Learning (2019) 記事内容では，強化学習の基礎的...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/01/24

[論文解説] AQL: Q-Learning in enormous action spaces via amortized approximate maximization

この記事は，以下の論文の解説です． Q-Learning in enormous action spaces via amortized approximate maximization (20...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2020/01/25

[論文解説] NAC: Reinforcement Learning from Imperfect Demonstrations

この記事は，以下の論文の解説です． Reinforcement Learning from Imperfect Demonstrations (2018) 記事内容では，強化学習の基礎的な知識を...

ReinforcementLearning

Comment0

@ku2482(Toshiki Watanabe)

2019/10/02

[論文解説] DRAW: A Recurrent Neural Network For Image Generation

以下の論文の解説(まとめ)になります． DRAW: A Recurrent Neural Network For Image Generation この論文はDeep Mindの方によるもので，...

GenerativeModels

Comment0

@ku2482(Toshiki Watanabe)

2019/10/03

[論文解説] CVAE: Semi-supervised Learning with Deep Generative Models

以下の論文の解説(まとめ)になります． Semi-supervised Learning with Deep Generative Models この論文はDeep MindのKingmaさん(...

GenerativeModels

Comment0

@ku2482(Toshiki Watanabe)

2019/10/10

[論文解説] TD3: Addressing Function Approximation Error in Actor-Critic Methods

以下の論文の解説(まとめ)になります． Addressing Function Approximation Error in Actor-Critic Methods この論文はICML 201...

ReinforcementLearning

Comment0

21 search resultsShowing 1~20 results

Qiita is a knowledge sharing service for engineers.

You can follow users and tags
You can stock useful information
You can make edit suggestions for articles

Functions that can be used after logging in

Search article