Abstract
The human ability to coordinate and cooperate has been vital to thedevelopment of societies for thousands of years. While it is not fully clearhow this behavior arises, social norms are thought to be a key factor in thisdevelopment. In contrast to laws set by authorities, norms tend to evolve in abottom-up manner from interactions between members of a society. While muchbehavior can be explained through the use of social norms, it is difficult tomeasure the extent to which they shape society as well as how they are affectedby other societal dynamics. In this paper, we discuss the design and evaluation of a reinforcementlearning model for understanding how the opportunity to choose who you interactwith in a society affects the overall societal outcome and the strength ofsocial norms. We first study the emergence of norms and then the emergence ofcooperation in presence of norms. In our model, agents interact with otheragents in a society in the form of repeated matrix-games: coordination gamesand cooperation games. In particular, in our model, at each each stage, agentsare either able to choose a partner to interact with or are forced to interactat random and learn using policy gradients.