Abstract
The reinforcement learning algorithms have often been applied to socialrobots. However, most reinforcement learning algorithms were not optimized forthe use of social robots, and consequently they may bore users. We proposed anew reinforcement learning method specialized for the social robot, theFRAC-Q-learning, that can avoid user boredom. The proposed algorithm consistsof a forgetting process in addition to randomizing and categorizing processes.This study evaluated interest and boredom hardness scores of theFRAC-Q-learning by a comparison with the traditional Q-learning. TheFRAC-Q-learning showed significantly higher trend of interest score, andindicated significantly harder to bore users compared to the traditionalQ-learning. Therefore, the FRAC-Q-learning can contribute to develop a socialrobot that will not bore users. The proposed algorithm can also findapplications in Web-based communication and educational systems. This paperpresents the entire process, detailed implementation and a detailed evaluationmethod of the of the FRAC-Q-learning for the first time.