Abstract
The reinforcement learning algorithms have often been applied to socialrobots. However, most reinforcement learning algorithms were not optimized forthe use of social robots, and consequently they may bore users. We proposed anew reinforcement learning method specialized for the social robot, theFRAC-Q-learning, that can avoid user boredom. The proposed algorithm consistsof a forgetting process in addition to randomizing and categorizing processes.This study evaluated interest and boredom hardness scores of theFRAC-Q-learning by a comparison with the traditional Q-learning. TheFRAC-Q-learning showed significantly higher trend of interest score, andindicated significantly harder to bore users compared to the traditionalQ-learning. Therefore, the FRAC-Q-learning can contribute to develop a socialrobot that will not bore users. The proposed algorithm has a potential to applyfor Web-based communication and educational systems. This paper presents theentire process, detailed implementation and a detailed evaluation method of theof the FRAC-Q-learning for the first time.