強化学習による意思決定モデル

Decision-Making Model by Reinforcement Learning

11.1 強化学習の概要 Abstract of Reinforcement Learning

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.2 一般式 General function

$$ a∈A(=a_{1}, a_{2},...) $$

11.1.2 一般式 General function

$$ s∈S(=s_{1}, s_{2},...) $$

11.1.2 一般式 General function

$$ T(s,a,s')=Pr(s'|s,a) $$

11.1.2 一般式 General function

$$ R(s,a) $$

11.1.2 一般式　General function

$$ p(0)=Pr(S(0)) $$

11.1.3 Q学習 Q learning

$$ Q(s,a) <- (1 - α)Q(s,a) + α(r + γmax_{p}Q(s',p)) $$

11.2.1 1人の意思決定 Decision-Making by one person

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

EX1 迷路探索 Maze exploration

11.3 2人の意思決定 Decision-Making by two people

11.3.1 二つのエージェントの意思決定の手順

11.3.1 Process of Decision-Making by two agents

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

強化学習による意思決定モデル

Decision-Making Model by Reinforcement Learning

11.1 強化学習の概要 Abstract of Reinforcement Learning

11.1 強化学習の概要 Abstract of Reinforcement Learning

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.2 一般式　General function

11.1.3 Q学習 Q learning

11.2.1 1人の意思決定 Decision-Making by one person

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

EX1 迷路探索 Maze exploration

11.3 2人の意思決定 Decision-Making by two people

11.3.1 二つのエージェントの意思決定の手順

11.3.1 Process of Decision-Making by two agents

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

FilesExpand file tree

slide11.md

Latest commit

History

slide11.md

File metadata and controls

強化学習による意思決定モデル

Decision-Making Model by Reinforcement Learning

11.1 強化学習の概要 Abstract of Reinforcement Learning

11.1 強化学習の概要 Abstract of Reinforcement Learning

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.1 定義 Definition

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.2 一般式 General function

11.1.3 Q学習 Q learning

11.2.1 1人の意思決定 Decision-Making by one person

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

11.2.2 例題：ミントタブレット問題 Example: Mint tablet problem

EX1 迷路探索 Maze exploration

11.3 2人の意思決定 Decision-Making by two people

11.3.1 二つのエージェントの意思決定の手順

11.3.1 Process of Decision-Making by two agents

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.4 深層強化学習の概要

11.4 Abstract of Deep Reinforcement Learning

11.1.2 一般式　General function