Endogenous Mixture Game Equilibria¶
This walks through the process of computing the equilibria for an arbitrary mixture between two games. For this purpose, we we only care about the mixture ratio between the two games, and the payoff to a single player for deviating to strategy when every other agent mixes according to mixture , which we’ll denote and respectively. This derivation is for the symmetric case where , but the extension is role-symmetry is trivial. The shorthand indicates the derivative of with respect to (), and following similar patterns, capital letters are matricies, bold lowercase letters are vectors, and regular lowercase letters are scalars.
For the game where each payoff is fraction of the payoff from game and fraction from game , the deviation payoff is simply , which we can differentiate easily:
If is an equilibrium mixture of game , then for every pair of strategies with support () that , which also implies . independent pairs of strategies can chosen to generate independent equalities of the form:
A simple choice of pairs is . The final equation keeps the equilibrium mixture a mixture, . If we take the strategies as indices from to and use the pairs suggested, this can be represented as the matrix equation:
This final equation represents the derivative of the components of an equilibrium mixture with support in a game. Given an equilibrium of a , equilibria of nearby games can be found using numeric ODE solving techniques until either a beneficial deviation exists to a strategy outside of support, or support for a current strategy drops to zero. In the later case, the support can just be dropped, and the technique can progress, in the former, this equilibrium effectively disappears, and a new equilibrium must be found for .
A limitation of this method is that it stops as soon as beneficial deviation exists outside of support. There are papers the project the equilibrium into a space where the equilibria are piecewise continuous, and as a result, may skirt the need to find new equilibria when a beneficial deviation exists outside of support, however, these methods aren’t readily applicable to our circumstances for two reasons.
They parameterize games by subtracting off the expected deviation payoff played against the uniform mixture, which we generally can’t sample because it would imply having complete game data instead of sparse game data.
The projection of the games does not merge arbitrary games and , but instead ones that differ only in the expected payoff to deviating from the uniform mixture, meaning that some aspect of the projection would need to be tweaked in order to work for arbitrary games.