Fuzzy Q-learning using Distributed Eligibility

;;

Journal of the Korean Institute of Intelligent Systems (한국지능시스템학회논문지)

Volume 11 Issue 5
/
Pages.388-394
/
2001
/
1976-9172(pISSN)
/
2288-2324(eISSN)

Korean Institute of Intelligent Systems (한국지능시스템학회)

Fuzzy Q-learning using Distributed Eligibility

분포 기여도를 이용한 퍼지 Q-learning

정석일 (경북대학교 전자전기공학부) ;
이연정 (경북대학교 전자전기공학부)

Published : 2001.10.01

PDF

Download PDF

⟨ Previous Next ⟩

Abstract

Reinforcement learning is a kind of unsupervised learning methods that an agent control rules from experiences acquired by interactions with environment. The eligibility is used to resolve the credit-assignment problem which is one of important problems in reinforcement learning, Conventional eligibilities such as the accumulating eligibility and the replacing eligibility are ineffective in use of rewards acquired in learning process, since on1y one executed action for a visited state is learned. In this paper, we propose a new eligibility, called the distributed eligibility, with which not only an executed action but also neighboring actions in a visited state are to be learned. The fuzzy Q-learning algorithm using the proposed eligibility is applied to a cart-pole balancing problem, which shows the superiority of the proposed method to conventional methods in terms of learning speed.

강화학습은 에이전트가 환경과의 상호작용을 통해 획득한 경험으로부터 제어 규칙을 학습하는 방법이다. 강화학습의 중요한 문제 중의 하나인 신뢰 할당 문제를 해결하기 위해 기여도가 사용되는데, 누적 기여도나 대체 기여도와 같은 기존의 기여도를 이용한 방법은 방문한 상태에서 수행된 행위만을 학습시키기 때문에 학습 자정에서 획득된 보답 신호를 효과적으로 사용하지 못한다. 본 논문에서는 방문한 상태에서 수행된 행위뿐만 아니라 인접 행위들도 학습될 수 있도록 하는 새로운 기여도로써 분포 기여도를 제안한다. 제안된 기여도를 이용한 퍼지 Q-learning 알고리즘을 역진자 시스템에 적용하여 학습 속도면에서 기존의 방법에 비해 우수함을 보인다.

Keywords

References

IEEE Trans. on Sys., Man, and Cyber. v.13 no.5 Neuronlike Adaptive Elements That Can Solve Difficult Learning Control Problems A. G. Barto;R. S. Sutton;C. W. Anderson
Machine Learning v.8 Technical Note : Q-learning C. J. C. H. Watkins;P. Dayan
Informat. Control v.8 Fuzzy Sets L. A. Zadeh
Reinforcement learning with hidden states from animals to animats L. J. Lin;M. Mitchell
IEEE Conf. on Fuzzy Systems v.1 Fuzzy Interpolation-Based Q-learning with Continuous States and Actions T. Horiuch;A. Fujino;O. Katai;T. Sawaragi
IEEE Trans. on Sys., Man, and Cyber. v.15 no.1 Fuzzy Identification of Systems and Its Applications to Modeling and Control T. Takagi;M. Sugeno
IEEE Conf. on Fuzzy Systems v.2 Fuzzy Q-learning P. Y. Glorennec;L. Jouffe
Machine Learning v.22 Reinforcement Learning With Replacing Eligibility Traces S. P. Singh;R. S. Sutton
Reinforcement Learning : An Introduction R. S. Sutton;A. G. Barto

Journal of the Korean Institute of Intelligent Systems (한국지능시스템학회논문지)

Fuzzy Q-learning using Distributed Eligibility

분포 기여도를 이용한 퍼지 Q-learning

Abstract

Keywords

References

이메일무단수집거부

이용약관

제 1 장 총칙

제 2 장 이용계약의 체결

제 3 장 계약 당사자의 의무

제 4 장 서비스의 이용

제 5 장 계약 해지 및 이용 제한

제 6 장 손해배상 및 기타사항

Detail Search

Image Search (β)