Technology reports of the Yamaguchi University

Back to Top

Technology reports of the Yamaguchi University Volume 3 Issue 2
published_at 1983-12

Learning behavior of variable-structure stochastic automata in a three person zero-sum game

Learning behavior of variable-structure stochastic automata in a three person zero-sum game
Okamura Kenshiro
Kanaoka Taiho
Okada Toshihiko
Tomita Shingo
fulltext
662 KB
KJ00004351049.pdf
Descriptions
This paper investigates the learning behavior of variable-structure stochastic automata in a three person zero-sum game. The game has three variable-structure stochastic automata and a random environment. In the game the players do not possess prior information concerning the payoff matrix and at the end of every play all the players update their own strategies on the basis of the response from the random environment. Under such situations if a payoff matrix satisfies some conditions, it can be shown that the learning behavior of the automata converges to the optimal strategies.