This paper deals with the issue of multi-agent learning of a populace of players, engaged in the repeated normalform game. Assuming boundedly-rational agents, we suggest a design of social Understanding based upon demo and error, called "social reinforcement Discovering". This extension of nicely-known Q-Finding out algorithm, permits players inside a https://abrahamz616lfx3.blog2freedom.com/profile