Linear algebraic structure of zero-determinant strategies in repeated games

Masahiko Ueda; Toshiyuki Tanaka

doi:10.1371/journal.pone.0230973

Abstract

Zero-determinant (ZD) strategies, a recently found novel class of strategies in repeated games, has attracted much attention in evolutionary game theory. A ZD strategy unilaterally enforces a linear relation between average payoffs of players. Although existence and evolutional stability of ZD strategies have been studied in simple games, their mathematical properties have not been well-known yet. For example, what happens when more than one players employ ZD strategies have not been clarified. In this paper, we provide a general framework for investigating situations where more than one players employ ZD strategies in terms of linear algebra. First, we theoretically prove that a set of linear relations of average payoffs enforced by ZD strategies always has solutions, which implies that incompatible linear relations are impossible. Second, we prove that linear payoff relations are independent of each other under some conditions. These results hold for general games with public monitoring including perfect-monitoring games. Furthermore, we provide a simple example of a two-player game in which one player can simultaneously enforce two linear relations, that is, simultaneously control her and her opponent’s average payoffs. All of these results elucidate general mathematical properties of ZD strategies.

Citation: Ueda M, Tanaka T (2020) Linear algebraic structure of zero-determinant strategies in repeated games. PLoS ONE 15(4): e0230973. https://doi.org/10.1371/journal.pone.0230973

Editor: Long Wang, Peking University, CHINA

Received: November 12, 2019; Accepted: March 12, 2020; Published: April 2, 2020

Copyright: © 2020 Ueda, Tanaka. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Data Availability: All relevant data are within the manuscript and its Supporting Information files.

Funding: This study was supported by JSPS KAKENHI Grant Numbers JP18H06476 and JP19K21542. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Competing interests: The authors have declared that no competing interests exist.

Introduction

Game theory is a powerful framework explaining rational behaviors of human beings [1] and evolutionary behaviors of biological systems [2, 3]. In a simple example of prisoner’s dilemma game, mutual defection is realized as a result of rational thought, even if mutual cooperation is more favorable. On the other hand, when the game is repeated infinite times, cooperation can be realized if players are far-sighted, which is confirmed as folk theorem. Axelrod’s famous tournaments on infinitely repeated prisoner’s dilemma game [4, 5] also showed that cooperative but retaliating strategy, called the tit-for-tat strategy, is successful in the setting of infinitely repeated game.

Recently, in repeated games with perfect monitoring, a novel class of strategies, called zero-determinant (ZD) strategy, was discovered [6]. Surprisingly, ZD strategy unilaterally enforces a linear relation between average payoffs of players. A strategy which unilaterally sets her opponent’s average payoff (equalizer strategy) is one example. Another example is extortionate strategy in which the player can earn more average payoff than her opponent. ZD strategies contain the well-known tit-for-tat strategy as a special example. After the pioneering work of Press and Dyson, stability of ZD strategies has been studied in the context of evolutionary game theory [7–12], and it was found that some kind of ZD strategies, called generous ZD strategies, can stably exist. Performance of ZD strategies has also been studied in human experiments [13, 14]. Although ZD strategy was originally formulated in two-player two-action (iterated prisoner’s dilemma) games, ZD strategy was extended to multi-player two-action (iterated social dilemma) games [15, 16], two-player multi-action games [17, 18], and multi-player multi-action games [19]. In addition, ZD strategy was extended to two-player two-action noisy games [20, 21], which is one example of the repeated games with imperfect monitoring. Furthermore, besides these fundamental theoretical studies, ZD strategies are also applied to resource sharing in wireless networks [22, 23]. See Ref. [24] for a review of ZD strategies in the context of direct reciprocity.

The contributions of this paper are four-fold. First, we extend ZD strategy for general multi-player multi-action repeated games with public monitoring, where players know the structure of games (players, sets of actions of all players, and payoffs of all players) but cannot observe actions of other players. A typical example of such situation is auction. In a sealed-bid auction, a player cannot know actions (bids) of other players, but only knows the result of the game (whether she is the winner or not). Second, we prove, in terms of a linear-algebraic argument, that linear payoff relations enforced by players with ZD strategies are consistent, that is, always have solutions. Third, we introduce the notion of independence of ZD strategies, and prove, again in terms of a linear-algebraic argument, that linear payoff relations enforced by players with ZD strategies are independent under a general condition. Fourth, as an application of linear algebraic formulation, we provide a simple example of a two-player game in which one player can simultaneously enforce two linear relations. This means that she can simultaneously control her and her opponent’s average payoffs, which has never been reported in the context of ZD strategies. All of these results develop deeper understanding of mathematical properties of ZD strategies in general games.

We remark on discounting. In standard repeated games, discounting of future payoffs is considered by introducing a discounting factor δ ≤ 1 [1]. In the original work on ZD strategy by Press and Dyson, only the case without discounting (i.e., δ = 1) was investigated [6]. After their work, ZD strategy was extended to δ < 1 case [18, 25, 26]. In this paper, we consider only the non-discounting case δ = 1.

Setup

We consider an N-player multi-action repeated game, in which player n ∈ {1, ⋯, N} has M_n possible actions, where M_n is a positive integer. Let denote a state of the game, which is the combination of the actions taken by the N players. Let be the size of the state space Σ. We assume that player n decides the next action stochastically according to her own previous action and common information τ ∈ B with the conditional probability , where B is some set. We also define the conditional probability that common information τ arises when actions of players in the preceding round are σ′ by W(τ|σ′). (An example of τ is the winner in each round; see S1 Text) Then the sequence of states of the repeated game forms a Markov chain (1) with the transition probability (2) where P(σ, t) denotes the state distribution at time t. We assume that all players know the function W(τ|σ′) but cannot directly observe σ′. When B = Σ and W(τ|σ′) = δ_τ,σ′, the above formulation reduces to that of perfect monitoring games. Otherwise, it represents games with public monitoring, where players cannot directly observe actions of other players. The model treated here can therefore be regarded as an extension of repeated games with perfect monitoring to those with imperfect monitoring, and the extension includes the former as a special case.

For each state σ, a payoff of player n is defined as s_n(σ). Let s_n ≡ (s_n(σ′))_σ′∈Σ be the M-dimensional vector representing the payoffs of player n, which we call the payoff vector of player n. It should be noted that in the following analysis we do not assume the payoffs to be symmetric, unless otherwise stated.

Results

Zero-determinant strategies

Because a discounting factor δ is one, the payoffs of players are the average payoffs with respect to the stationary distribution of the Markov chain. Let P^(s)(σ) denote the stationary distribution, which may depend on the initial condition when the Markov chain is not irreducible. It satisfies (3) Taking summation of both sides of Eq (3) with respect to σ_−n ≡ σ∖σ_n with an arbitrary n, we obtain (4) where we have defined (5) Regarding as representing the strategy “Repeat”, where player n repeats the previous action with probability one, one can readily see that Eq (4) is an extension of Akin’s lemma [15, 18, 27, 28], relating a player’s strategy with the stationary distribution, to the multi-player multi-action public-monitoring case. Letting (6) Eq (4) means that the average of with respect to the stationary distribution is zero for any n and σ_n. We remark that all players are assumed to know the functional form of W(τ|σ′), and that , and thus T_n(σ_n|σ′) as well, are solely under control of player n. Because of the normalization condition , the relation (7) holds.

Let , which we call the strategy vector of player n associated with action σ_n. (Another name for is the Press-Dyson vector [27].) A strategy of player n is represented as an M × M_n matrix composed of the strategy vectors for her actions σ_n ∈ {1, …, M_n}. For a matrix A, let span A be the subspace spanned by the column vectors of A. Let 0_m and 1_m denote the m-dimensional zero vector and the m-dimensional vector of all ones, respectively. From Eq (7), one has (8) for any player n, implying that the dimension of is at most (M_n − 1).

Let ρ ≡ (P^(s)(σ))_σ∈Σ be the vector representation of the stationary distribution P^(s)(σ). When player n chooses a strategy , for any vector , one has due to Eq (4). In other words, the expectation of v with respect to the stationary distribution ρ vanishes.

Let and . The following definition is an extension of the notion of the ZD strategy [6, 27] to multi-player multi-action public-monitoring games.

Definition 1. A zero-determinant (ZD) strategy is defined as a strategy for which dim V_n ≥ 1 holds.

To see that this is indeed an extended definition of the ZD strategy, note that any vector is represented as , where is the coefficient vector. Let be the vector with element e_n equal to the expected payoff e_n ≡ 〈s_n(σ)〉_s of player n in the steady state. When player n employs a ZD strategy, it amounts to enforcing linear relations on e with α satisfying .

Consistency

A question naturally arises: When more than one of the players employ ZD strategies, are they “consistent”, that is, do linear payoff relations enforced by the players always have solutions? For example, in a two-player game, when player 1 enforces by a ZD strategy and player 2 enforces by a ZD strategy, do the simultaneous equations of (e₁, e₂) have a solution? Let N′ be the set of players who employ ZD strategies. The set consists of all combinations of the expected payoffs that satisfy the enforced linear relations by the players in N′. If E is empty, then it implies that the set of ZD strategies is inconsistent in the sense that there is no valid solution of the linear relations enforced by the players.

Definition 2. ZD strategies are said to be consistent when E is not empty.

In the multi-player setting, one may regard N′ as a variant of a ZD strategy alliance [15], where the players in N′ agree to coordinate on the linear relations to be enforced on the expected payoffs. The above question then amounts to asking whether it is possible for a player to serve as a counteracting agent who participates in the ZD strategy alliance with a hidden intention to invalidate it by adopting a ZD strategy that is inconsistent with others.

The following proposition is the first main result of this paper.

Proposition 1. Any set of ZD strategies is consistent.

Proof. We first note that the following property holds for strategy vectors, whose proof is given in Methods.

Lemma 1. Let . Then .

For any set span(V_n)_n∈N′ of ZD strategies, let K be the dimension of span(V_n)_n∈N′, and let be a basis of span(V_n)_n∈N′. The expected payoff vector should be given by a non-zero solution of the linear equation in , where we define A, b, and as (9) One has (10) where .

The Rouché-Capelli theorem [29] tells us that is a necessary and sufficient condition for the linear equation in to have a solution, that is, for span(V_n)_n∈N′ to be consistent (because A is augmented matrix). An equivalent expression of this condition is that there is no vector such that and hold (which ensures that there is no elementary operations which make the rank of augmented matrix larger than that of the original matrix). Assume to the contrary that there exist such that and hold. One would then have (11) On the other hand, is a linear combination of , so that Lemma 1 states that it should be zero if it is proportional to 1_M, leading to contradiction.

Proposition 1 states that it is impossible for any player to serve as a counteracting agent to invalidate ZD strategy alliances. This statement is quite general in that it applies to any instance of repeated games covered by our formulation.

In Ref. [19], it was shown that every player can have at most one master player, who can play an equalizer strategy on the given player (that is, controlling the expected payoff of the given player), in multi-player multi-action games. Indeed, our general result on the absence of inconsistent ZD strategies (Proposition 1) immediately implies that more than one ZD players cannot simultaneously control the expected payoff of a player to different values. Therefore, our result generalizes their result on equalizer strategy to arbitrary ZD strategies.

Since the dimension of is at most (M_n − 1), depending on , it should be possible for player n with M_n ≥ 3 to adopt a ZD strategy for which dim V_n ≥ 2 holds. The dimension of V_n corresponds to the number of independent linear relations to be enforced on the expected payoffs of the players, so that it implies that one player may be able to enforce multiple independent linear relations. On the other hand, our result on the absence of inconsistent ZD strategies implies that for any set N′ of ZD players the dimension of span(V_n)_n∈N′ should be at most N, the number of players, since any set of ZD strategies should contain at most N independent linear relations if it is consistent. This in turn implies that if the dimension of span(V_n)_n∈N′ is equal to N for a subset N′ of players then players not in N′ cannot employ independent ZD strategy any more.

Independence

Another naturally-arising question would be regarding independence for a set of ZD strategies, which we define as follows:

Definition 3. A set of ZD strategies is independent if any set {v_n}_n∈N′ of non-zero vectors v_n in V_n is linearly independent. Otherwise, is said to be dependent.

If a set of ZD strategies is dependent, then there exists a ZD player whose ZD strategy adds no linear constraints other than those already imposed by other ZD players. One of the simplest example of a dependent set of ZD strategies is the case where two players enforce exactly the same linear relation to the expected payoffs. Our second main result is to show that any set of ZD strategies is independent under a general condition.

Proposition 2. Let N′ be a subset of players. Assume that does not have zero elements for any n ∈ N′ and any σ_n ∈ {1, …, M_n}. Then, any set of ZD strategies of players in N′ is independent.

See Methods for the proof.

It should be noted that when has zero elements then one might have dependent ZD strategies. A simple example can be found in a two-player two-action perfect-monitoring (iterated prisoner’s dilemma) game: Let the payoff vectors s₁ and s₂ for players 1 and 2 be and , with T ≠ S. If player 1 adopts the strategy (12) then it enforces the linear payoff relation e₁ = e₂. This strategy is a well-known tit-for-tat strategy [6]. By symmetry, player 2 can also adopt the same strategy , implying that these two strategies are indeed dependent.

Simultaneous multiple linear relations by one player

As mentioned above, when the number M_n of possible actions for player n is more than two, player n may be able to employ a ZD strategy with dim V_n ≥ 2 to simultaneously enforce more than one linear relations. (We note that this is impossible for public goods game [15, 16] because the number of action for each player is two.) Such a possibility has never been reported in the context of ZD strategies. Here, we provide a simple example of such a situation in a two-player three-action symmetric game.

We consider the 3 × 3 symmetric game (13) We remark that s₁, s₂, and 1₉ are linearly independent when r₁ ≠ r₂ and r₁ ≠ −r₂. We choose strategies of player 1 as (14) with 0 ≤ p ≤ 1, 0 ≤ q ≤ 1, 0 ≤ p′ ≤ 1, 0 ≤ q′ ≤ 1, q ≤ p, and p′ ≤ q′. Then we obtain (15) (16) Therefore, player 1 can simultaneously control average payoffs of both players, e₁ and e₂, as e₁ = e₂ = 0. Note that σ with s₁(σ) = 0 is an absorbing state regardless of the strategy of player 2 in this case.

In general, when one player simultaneously enforces two linear relations in two-player multi-action symmetric games, only e₁ = e₂ = C is allowed with some C. This is explained as follows: Assume that player 1 can simultaneously enforce e₁ = C₁ and e₂ = C₂ with C₁ ≠ C₂ by one ZD strategy. Because the game is symmetric, player 2 can also simultaneously enforce e₁ = C₂ and e₂ = C₁ independently by one ZD strategy. This contradicts the consistency of ZD strategies (Proposition 1). Therefore, the only possibility is e₁ = e₂ = C.

The above argument can be extended straightforwardly to the multi-player case. For that purpose, we introduce some notions of symmetric multi-player games. The following definition of a symmetric multi-player game is due to von Neumann and Morgenstern [30, Section 28].

Definition 4. A game is symmetric with respect to a permutation π on {1, …, N} if M_n = M_π(n) holds for any n ∈ {1, …, N} and if π preserves the payoff structure of the game, that is, (17) holds for any σ ∈ Σ and for any n ∈ {1, …, N}, where σ_π ≡ (σ_π(1), …, σ_π(N)).

The following definition is due to Ref. [31].

Definition 5. A game is weakly symmetric if for any pair of players n and there exists some permutation π on {1, …, N} satisfying such that the game is symmetric with respect to π.

Consider an N-player weakly symmetric game. Assume that one player simultaneously enforces N independent linear relations on the average payoffs {e_n}_{n ∈ {1, …, N}} of N players via adopting an N-dimensional ZD strategy. (Note that for this to be possible the number M_n of actions should satisfy M_n ≥ N + 1). Then, the average payoffs {e_n}_{n∈{1,…,N}} should be simultaneously controlled, but they should satisfy e₁ = e₂ = ⋯ = e_n due to the consistency of ZD strategies.

The difficulty of construction of a ZD strategy of one player with dimension N in weakly symmetric N-player games can be seen in the following two propositions, whose proofs are given in Methods.

Proposition 3. In a weakly symmetric N-player game, if the strategy vectors of one player contain no zero element, then a ZD strategy of the player with dimension N is impossible.

Proposition 4. In a weakly symmetric N-player game, if payoffs s_n(σ) of player n are different from each other for all σ, then a ZD strategy with dimension N is impossible.

Discussion

In this paper, we have derived ZD strategies for general multi-player multi-action public-monitoring games, in which players cannot observe actions of other players. By formulating ZD strategy in terms of linear algebra, we have proved that linear payoff relations enforced by ZD players are consistent. Furthermore, we have proved that linear payoff relations enforced by players with ZD strategies are independent under a general condition. We emphasize that these results hold not only for imperfect-monitoring games but also for perfect-monitoring games. We have also provided a simple example in which one player can simultaneously enforce more than one linear constraints on the expected payoffs. These results elucidate constraints on ZD strategies in terms of linear algebra.

Although we have discussed mathematical properties of ZD strategies if exist, we do not know the criterion for whether ZD strategies exist or not when a game is given. For example, we can easily show that ZD strategy does not exist for the rock-paper-scissors game, which is the simplest two-player three-action symmetric zero-sum game. (See S1 Text for the proof.) Whereas, we can also show that there is a two-player three-action symmetric zero-sum game for which ZD strategy exists, which is also provided in S1 Text. Generally, the dimension of is smaller than N + 1 for zero-sum games, and construction of ZD strategies for zero-sum games is expected to be more difficult compared to non-zero-sum games. Consistency together with constraints on payoffs such as symmetry and linear dependence may be useful to specify the space of ZD strategies which can exist. Specifying a general criterion for the existence of ZD strategies is an important future problem.

In addition, it should be noted that ZD strategies are not always “rational” strategies, which have been a main subject of game theory. Therefore, investigation of ZD strategies in terms of bounded rationality [32] may be needed. Specifying the situation where ZD strategies are adopted is another important problem.

Another remark is related to memory of strategies. In this work, we considered only memory-one strategies. In Ref. [6], it has been proved that a player with longer memory does not have advantage over a player with short memory in terms of average payoff in two-player games. In Ref. [16, 19], it has been shown that this statement also holds for multi-player games. Therefore, considering only memory-one strategies should be sufficient even in our public-monitoring situation. Longer memory strategies attract much attentions in repeated games with implementation errors [33, 34]. Extension of ZD strategies to longer memory case may lead to different evolutionary behavior compared to memory-one strategies.

We remark on the effect of imperfect monitoring. In perfect monitoring case, the strategy vectors are arbitrary as long as they satisfy the conditions for probability distributions. In contrast, in imperfect monitoring case, forms of the strategy vectors are constrained by Eq (5). Therefore, the space of ZD strategies for imperfect-monitoring games is generally smaller than that for perfect-monitoring games. In S1 Text, we provide examples of ZD strategies in simple imperfect-monitoring games.

Methods

Proof of Lemma 1

Assume to the contrary that with γ ≠ 0. Taking the inner product of v with the stationary distribution ρ, one has since is represented as a linear combination of the strategy vectors and since the inner product of a strategy vector and the stationary distribution is zero. On the other hand, holds because of the normalization of the stationary distribution. Therefore we obtain γ = 0, leading to contradiction.

Proof of Proposition 2

We first show the following lemma.

Lemma 2. Let N′ be a subset of players. Assume that does not have zero elements for any n ∈ N′ and any σ_n ∈ {1, …, M_n}. For n ∈ N′, let v_n be an arbitrary non-zero vector in . Then {v_n}_n∈N′ are linearly independent.

Proof. We assume to the contrary that {v_n}_n∈N′ are linearly dependent. Then there is a set of coefficients {a_n}_n∈N′ with which ∑_n∈N′a_nv_n = 0_M holds. Without loss of generality we assume a_n ≠ 0 for n ∈ N′.

Since , it is expressed as with a non-zero vector . Let , where ties may be broken arbitrarily, and . With Eq (8), one obtains (18) and thus (19)

We show that the inequality (20) holds for any n, any σ_n ∈ {1, …, M_n}, and any σ′ ∈ Σ satisfying . We first note that for any strategy vector with action σ_n ∈ {1, ⋯, M_n}, one has, from Eq (6), (21) Fix any σ′ ∈ Σ satisfying for a moment. Then, for one has by definition, making the left-hand side of Eq (20) equal to zero. For , on the other hand, one has by definition. Also, since , from Eq (21) one has . These imply that the inequality (20) holds for . Putting the above arguments together, we have shown that the inequality (20) holds for any n, any σ_n ∈ {1, …, M_n}, and any σ′ ∈ Σ satisfying .

Fix any σ′ ∈ Σ satisfying for all n ∈ N′. The above argument has shown that the inequality (20) holds for any n and any σ_n ∈ {1, …, M_n}. On the other hand, at the beginning of the proof we have assumed that (22) holds, implying that the summand is equal to zero for any n ∈ N′ and any σ_n ∈ {1, …, M_n}. By assumption, a_n ≠ 0 and , so that one has , and consequently, , leading to contradiction.

The proof of Proposition 2 is straightforward by taking v_n as belonging to in Lemma 2.

Proof of Proposition 3

We first show the following lemma.

Lemma 3. Consider an N-player game which is symmetric with respect to a permutation π on {1, …, N}. Assume that the column vectors of are linearly independent. For any pair of players n and satisfying , if the strategy vectors of these players contain no zero element, then it is impossible for these players to adopt ZD strategies with which player n enforces linear relation with α ≠ 0_N+1, and where player enforces , where .

Proof. We assume to the contrary that there exists α ≠ 0_N+1 satisfying the properties stated in Lemma 3. By assumption, and . There then exist c_n and satisfying and . One has (23) where the second equality is due to the assumed symmetry of the game with respect to π. Letting , , and , one has (24) implying that holds. Let .

Let and , where ties may be broken arbitrarily, and and . One then has (25) Recalling that we have assumed , let σ′ ∈ Σ be an arbitrary state satisfying and . Then, in view of Eq (21), one has (26) implying that v(σ′) = 0 holds. Since for all σ_n ∈ {1, …, M_n}, they are all equal to zero. Since is assumed non-zero, one has for all σ_n ∈ {1, …, M_n} and consequently . One similarly has . Therefore, from Eq (8) one has . Due to the assumption of linear independence of the columns of , it in turn implies that α = 0_N+1 holds, leading to contradiction.

It should be noted that Lemma 3 holds even if one takes , in which case the Lemma implies that, if the game is symmetric with respect to π, player n with π(n) ≠ n cannot enforce linear relations simultaneously. It should also be noted that Lemma 3 furthermore implies that it is impossible for that player to enforce a linear relation satisfying α_π = α ≠ 0_N+1. In other words, in a symmetric game no player to whom the game is symmetric can enforce a linear relation with the same symmetry as the game itself.

Proposition 3 is a direct consequence of Lemma 3 in weakly symmetric multi-player games.

Proof of Proposition 4

Without loss of generality, we assume that player k takes an N-dimensional ZD strategy determining the average payoffs e_n for n = 1, ⋯, N. Due to the above discussion, only e₁ = ⋯ = e_N = C is allowed. Letting for n ∈ {1, …, N}, one can take as a basis of the N-dimensional ZD strategy. Let c⁽ⁿ⁾ be defined as (27) By the assumption of weak symmetry, for any player n ≠ k, there exists a permutation π satisfying π(n) = k such that the game is symmetric with respect to π. Noting that , from Eq (23) one has (28)

For n ∈ {1, …, N}, define and , where ties may be broken arbitrarily provided that holds, and and . From Eq (7), one has (29) Then, from Eqs (28) and (21), we obtain for an arbitrary σ* satisfying and (30) (31) implying s_n(σ*) = C. On the other hand, we also obtain for an arbitrary σ** satisfying and (32) (33) implying s_n(σ**) = C. Then, because we have assumed that all elements of the payoff vector s_n are different from each other, we have arrived at a contradiction.

Supporting information

S1 Text. Details of discussion.

https://doi.org/10.1371/journal.pone.0230973.s001

(PDF)

Acknowledgments

We thank Ryosuke Kobayashi for valuable discussions.

References

1. Fudenberg D, Tirole J. Game Theory. Massachusetts: MIT Press; 1991.
2. Smith JM, Price GR. The logic of animal conflict. Nature. 1973;246(5427):15.
- View Article
- Google Scholar
3. Nowak MA. Five rules for the evolution of cooperation. Science. 2006;314(5805):1560–1563. pmid:17158317
- View Article
- PubMed/NCBI
- Google Scholar
4. Axelrod R, Hamilton WD. The evolution of cooperation. Science. 1981;211(4489):1390–1396.
- View Article
- Google Scholar
5. Axelrod R. The Evolution of Cooperation. New York: Basic Books; 1984.
6. Press WH, Dyson FJ. Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent. Proceedings of the National Academy of Sciences. 2012;109(26):10409–10413.
- View Article
- Google Scholar
7. Hilbe C, Nowak MA, Sigmund K. Evolution of extortion in Iterated Prisoner’s Dilemma games. Proceedings of the National Academy of Sciences. 2013;110(17):6913–6918.
- View Article
- Google Scholar
8. Adami C, Hintze A. Evolutionary instability of zero-determinant strategies demonstrates that winning is not everything. Nature Communications. 2013;4. pmid:23903782
- View Article
- PubMed/NCBI
- Google Scholar
9. Stewart AJ, Plotkin JB. From extortion to generosity, evolution in the Iterated Prisoner’s Dilemma. Proceedings of the National Academy of Sciences. 2013;110(38):15348–15353.
- View Article
- Google Scholar
10. Hilbe C, Nowak MA, Traulsen A. Adaptive Dynamics of Extortion and Compliance. PLOS ONE. 2013;8(11):1–9.
- View Article
- Google Scholar
11. Stewart AJ, Plotkin JB. Extortion and cooperation in the Prisoner’s Dilemma. Proceedings of the National Academy of Sciences. 2012;109(26):10134–10135.
- View Article
- Google Scholar
12. Szolnoki A, Perc M. Evolution of extortion in structured populations. Physical Review E. 2014;89(2):022804.
- View Article
- Google Scholar
13. Hilbe C, Röhl T, Milinski M. Extortion subdues human players but is finally punished in the prisoner’s dilemma. Nature communications. 2014;5:3976. pmid:24874294
- View Article
- PubMed/NCBI
- Google Scholar
14. Wang Z, Zhou Y, Lien JW, Zheng J, Xu B. Extortion can outperform generosity in the iterated prisoner’s dilemma. Nature communications. 2016;7:11125. pmid:27067513
- View Article
- PubMed/NCBI
- Google Scholar
15. Hilbe C, Wu B, Traulsen A, Nowak MA. Cooperation and control in multiplayer social dilemmas. Proceedings of the National Academy of Sciences. 2014;111(46):16425–16430.
- View Article
- Google Scholar
16. Pan L, Hao D, Rong Z, Zhou T. Zero-determinant strategies in iterated public goods game. Scientific Reports. 2015;5.
- View Article
- Google Scholar
17. Guo JL. Zero-determinant strategies in iterated multi-strategy games. ArXiv e-prints. 2014;.
18. McAvoy A, Hauert C. Autocratic strategies for iterated games with arbitrary action spaces. Proceedings of the National Academy of Sciences. 2016;113(13):3573–3578.
- View Article
- Google Scholar
19. He X, Dai H, Ning P, Dutta R. Zero-determinant strategies for multi-player multi-action iterated games. IEEE Signal Processing Letters. 2016;23(3):311–315.
- View Article
- Google Scholar
20. Hao D, Rong Z, Zhou T. Extortion under uncertainty: Zero-determinant strategies in noisy games. Phys Rev E. 2015;91:052803.
- View Article
- Google Scholar
21. Mamiya A, Ichinose G. Strategies that enforce linear payoff relationships under observation errors in Repeated Prisoner’s Dilemma game. Journal of Theoretical Biology. 2019;477:63–76. pmid:31201882
- View Article
- PubMed/NCBI
- Google Scholar
22. Daoud AA, Kesidis G, Liebeherr J. Zero-determinant strategies: A game-theoretic approach for sharing licensed spectrum bands. IEEE Journal on Selected Areas in Communications. 2014;32(11):2297–2308.
- View Article
- Google Scholar
23. Zhang H, Niyato D, Song L, Jiang T, Han Z. Zero-determinant strategy for resource sharing in wireless cooperations. IEEE Transactions on Wireless Communications. 2016;15(3):2179–2192.
- View Article
- Google Scholar
24. Hilbe C, Chatterjee K, Nowak MA. Partners and rivals in direct reciprocity. Nature human behaviour. 2018;2(7):469. pmid:31097794
- View Article
- PubMed/NCBI
- Google Scholar
25. Hilbe C, Traulsen A, Sigmund K. Partners or rivals? Strategies for the iterated prisoner’s dilemma. Games and Economic Behavior. 2015;92:41–52. pmid:26339123
- View Article
- PubMed/NCBI
- Google Scholar
26. Ichinose G, Masuda N. Zero-determinant strategies in finitely repeated games. Journal of Theoretical Biology. 2018;438:61–77. pmid:29154776
- View Article
- PubMed/NCBI
- Google Scholar
27. Akin E. The iterated prisoner’s dilemma: good strategies and their dynamics. Ergodic Theory, Advances in Dynamical Systems. 2016; p. 77–107.
- View Article
- Google Scholar
28. Akin E. What you gotta know to play good in the iterated prisoner’s dilemma. Games. 2015;6(3):175–190.
- View Article
- Google Scholar
29. Shafarevich IR, Remizov AO. Linear Algebra and Geometry. New York: Springer; 2012.
30. von Neumann J, Morgensternx O. Theory of Games and Economic Behavior. 3rd ed. Princeton University Press; 1953.
31. Plan A. Symmetric n-player games; 2017.
32. Rubinstein A. Modeling bounded rationality. Massachusetts: MIT Press; 1998.
33. Hilbe C, Martinez-Vaquero LA, Chatterjee K, Nowak MA. Memory-n strategies of direct reciprocity. Proceedings of the National Academy of Sciences. 2017;114(18):4715–4720.
- View Article
- Google Scholar
34. Murase Y, Baek SK. Seven rules to avoid the tragedy of the commons. Journal of theoretical biology. 2018;449:94–102. pmid:29678691
- View Article
- PubMed/NCBI
- Google Scholar

[ref1] 1. Fudenberg D, Tirole J. Game Theory. Massachusetts: MIT Press; 1991.

[ref2] 2. Smith JM, Price GR. The logic of animal conflict. Nature. 1973;246(5427):15.
View Article
Google Scholar

[3] View Article

[4] Google Scholar

[ref3] 3. Nowak MA. Five rules for the evolution of cooperation. Science. 2006;314(5805):1560–1563. pmid:17158317
View Article
PubMed/NCBI
Google Scholar

[6] View Article

[7] PubMed/NCBI

[8] Google Scholar

[ref4] 4. Axelrod R, Hamilton WD. The evolution of cooperation. Science. 1981;211(4489):1390–1396.
View Article
Google Scholar

[10] View Article

[11] Google Scholar

[ref5] 5. Axelrod R. The Evolution of Cooperation. New York: Basic Books; 1984.

[ref6] 6. Press WH, Dyson FJ. Iterated Prisoner’s Dilemma contains strategies that dominate any evolutionary opponent. Proceedings of the National Academy of Sciences. 2012;109(26):10409–10413.
View Article
Google Scholar

[14] View Article

[15] Google Scholar

[ref7] 7. Hilbe C, Nowak MA, Sigmund K. Evolution of extortion in Iterated Prisoner’s Dilemma games. Proceedings of the National Academy of Sciences. 2013;110(17):6913–6918.
View Article
Google Scholar

[17] View Article

[18] Google Scholar

[ref8] 8. Adami C, Hintze A. Evolutionary instability of zero-determinant strategies demonstrates that winning is not everything. Nature Communications. 2013;4. pmid:23903782
View Article
PubMed/NCBI
Google Scholar

[20] View Article

[21] PubMed/NCBI

[22] Google Scholar

[ref9] 9. Stewart AJ, Plotkin JB. From extortion to generosity, evolution in the Iterated Prisoner’s Dilemma. Proceedings of the National Academy of Sciences. 2013;110(38):15348–15353.
View Article
Google Scholar

[24] View Article

[25] Google Scholar

[ref10] 10. Hilbe C, Nowak MA, Traulsen A. Adaptive Dynamics of Extortion and Compliance. PLOS ONE. 2013;8(11):1–9.
View Article
Google Scholar

[27] View Article

[28] Google Scholar

[ref11] 11. Stewart AJ, Plotkin JB. Extortion and cooperation in the Prisoner’s Dilemma. Proceedings of the National Academy of Sciences. 2012;109(26):10134–10135.
View Article
Google Scholar

[30] View Article

[31] Google Scholar

[ref12] 12. Szolnoki A, Perc M. Evolution of extortion in structured populations. Physical Review E. 2014;89(2):022804.
View Article
Google Scholar

[33] View Article

[34] Google Scholar

[ref13] 13. Hilbe C, Röhl T, Milinski M. Extortion subdues human players but is finally punished in the prisoner’s dilemma. Nature communications. 2014;5:3976. pmid:24874294
View Article
PubMed/NCBI
Google Scholar

[36] View Article

[37] PubMed/NCBI

[38] Google Scholar

[ref14] 14. Wang Z, Zhou Y, Lien JW, Zheng J, Xu B. Extortion can outperform generosity in the iterated prisoner’s dilemma. Nature communications. 2016;7:11125. pmid:27067513
View Article
PubMed/NCBI
Google Scholar

[40] View Article

[41] PubMed/NCBI

[42] Google Scholar

[ref15] 15. Hilbe C, Wu B, Traulsen A, Nowak MA. Cooperation and control in multiplayer social dilemmas. Proceedings of the National Academy of Sciences. 2014;111(46):16425–16430.
View Article
Google Scholar

[44] View Article

[45] Google Scholar

[ref16] 16. Pan L, Hao D, Rong Z, Zhou T. Zero-determinant strategies in iterated public goods game. Scientific Reports. 2015;5.
View Article
Google Scholar

[47] View Article

[48] Google Scholar

[ref17] 17. Guo JL. Zero-determinant strategies in iterated multi-strategy games. ArXiv e-prints. 2014;.

[ref18] 18. McAvoy A, Hauert C. Autocratic strategies for iterated games with arbitrary action spaces. Proceedings of the National Academy of Sciences. 2016;113(13):3573–3578.
View Article
Google Scholar

[51] View Article

[52] Google Scholar

[ref19] 19. He X, Dai H, Ning P, Dutta R. Zero-determinant strategies for multi-player multi-action iterated games. IEEE Signal Processing Letters. 2016;23(3):311–315.
View Article
Google Scholar

[54] View Article

[55] Google Scholar

[ref20] 20. Hao D, Rong Z, Zhou T. Extortion under uncertainty: Zero-determinant strategies in noisy games. Phys Rev E. 2015;91:052803.
View Article
Google Scholar

[57] View Article

[58] Google Scholar

[ref21] 21. Mamiya A, Ichinose G. Strategies that enforce linear payoff relationships under observation errors in Repeated Prisoner’s Dilemma game. Journal of Theoretical Biology. 2019;477:63–76. pmid:31201882
View Article
PubMed/NCBI
Google Scholar

[60] View Article

[61] PubMed/NCBI

[62] Google Scholar

[ref22] 22. Daoud AA, Kesidis G, Liebeherr J. Zero-determinant strategies: A game-theoretic approach for sharing licensed spectrum bands. IEEE Journal on Selected Areas in Communications. 2014;32(11):2297–2308.
View Article
Google Scholar

[64] View Article

[65] Google Scholar

[ref23] 23. Zhang H, Niyato D, Song L, Jiang T, Han Z. Zero-determinant strategy for resource sharing in wireless cooperations. IEEE Transactions on Wireless Communications. 2016;15(3):2179–2192.
View Article
Google Scholar

[67] View Article

[68] Google Scholar

[ref24] 24. Hilbe C, Chatterjee K, Nowak MA. Partners and rivals in direct reciprocity. Nature human behaviour. 2018;2(7):469. pmid:31097794
View Article
PubMed/NCBI
Google Scholar

[70] View Article

[71] PubMed/NCBI

[72] Google Scholar

[ref25] 25. Hilbe C, Traulsen A, Sigmund K. Partners or rivals? Strategies for the iterated prisoner’s dilemma. Games and Economic Behavior. 2015;92:41–52. pmid:26339123
View Article
PubMed/NCBI
Google Scholar

[74] View Article

[75] PubMed/NCBI

[76] Google Scholar

[ref26] 26. Ichinose G, Masuda N. Zero-determinant strategies in finitely repeated games. Journal of Theoretical Biology. 2018;438:61–77. pmid:29154776
View Article
PubMed/NCBI
Google Scholar

[78] View Article

[79] PubMed/NCBI

[80] Google Scholar

[ref27] 27. Akin E. The iterated prisoner’s dilemma: good strategies and their dynamics. Ergodic Theory, Advances in Dynamical Systems. 2016; p. 77–107.
View Article
Google Scholar

[82] View Article

[83] Google Scholar

[ref28] 28. Akin E. What you gotta know to play good in the iterated prisoner’s dilemma. Games. 2015;6(3):175–190.
View Article
Google Scholar

[85] View Article

[86] Google Scholar

[ref29] 29. Shafarevich IR, Remizov AO. Linear Algebra and Geometry. New York: Springer; 2012.

[ref30] 30. von Neumann J, Morgensternx O. Theory of Games and Economic Behavior. 3rd ed. Princeton University Press; 1953.

[ref31] 31. Plan A. Symmetric n-player games; 2017.

[ref32] 32. Rubinstein A. Modeling bounded rationality. Massachusetts: MIT Press; 1998.

[ref33] 33. Hilbe C, Martinez-Vaquero LA, Chatterjee K, Nowak MA. Memory-n strategies of direct reciprocity. Proceedings of the National Academy of Sciences. 2017;114(18):4715–4720.
View Article
Google Scholar

[92] View Article

[93] Google Scholar

[ref34] 34. Murase Y, Baek SK. Seven rules to avoid the tragedy of the commons. Journal of theoretical biology. 2018;449:94–102. pmid:29678691
View Article
PubMed/NCBI
Google Scholar

[95] View Article

[96] PubMed/NCBI

[97] Google Scholar