Skip to main content

First-order algorithm with \({\mathcal{O}({\rm ln}(1{/}\epsilon))}\) convergence for \({\epsilon}\)-equilibrium in two-person zero-sum games

  • Full Length Paper
  • Series A
  • Published:
Mathematical Programming Submit manuscript

Abstract

We propose an iterated version of Nesterov’s first-order smoothing method for the two-person zero-sum game equilibrium problem

$$\min_{x \in Q_1}\max_{y \in Q_2} {x}^{\rm T}{Ay} = \max_{y \in Q_2} \min_{x \in Q_1} {x}^{\rm T}{Ay}.$$

This formulation applies to matrix games as well as sequential games. Our new algorithmic scheme computes an \({\epsilon}\)-equilibrium to this min-max problem in \({\mathcal {O}\left(\frac{\|A\|}{\delta(A)} \, {\rm ln}(1{/}\epsilon)\right)}\) first-order iterations, where δ(A) is a certain condition measure of the matrix A. This improves upon the previous first-order methods which required \({\mathcal {O}(1{/}\epsilon)}\) iterations, and it matches the iteration complexity bound of interior-point methods in terms of the algorithm’s dependence on \({\epsilon}\). Unlike interior-point methods that are inapplicable to large games due to their memory requirements, our algorithm retains the small memory requirements of prior first-order methods. Our scheme supplements Nesterov’s method with an outer loop that lowers the target \({\epsilon}\) between iterations (this target affects the amount of smoothing in the inner loop). Computational experiments both in matrix games and sequential games show that a significant speed improvement is obtained in practice as well, and the relative speed improvement increases with the desired accuracy (as suggested by the complexity bounds).

This is a preview of subscription content, log in via an institution to check access.

Access this article

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. Bienstock D.: Potential Function Methods for Approximately Solving Linear Programming Problems. Kluwer International Series, Dordrecht (2002)

    MATH  Google Scholar 

  2. Dantzig G.: Linear Programming and Extensions. Princeton University Press, Princeton (1963)

    MATH  Google Scholar 

  3. Gilpin, A., Sandholm, T., Sørensen, T.B.: Potential-aware automated abstraction of sequential games, and holistic equilibrium analysis of Texas Hold’em poker. In: Proceedings of the National Conference on Artificial Intelligence (AAAI), pp. 50–57. AAAI Press, Vancouver (2007)

  4. Goffin J.-L.: On the convergence rate of subgradient optimization methods. Math. Program. 13, 329–347 (1977)

    Article  MathSciNet  MATH  Google Scholar 

  5. Hirriart-Urruty J., Lemaréchal C.: Fundamentals of Convex Analysis. Springer, Berlin (2001)

    Book  Google Scholar 

  6. Hoda S., Gilpin A., Peña J., Sandholm T.: Smoothing techniques for computing Nash equilibria of sequential games. Math. Oper. Res. 35(2), 494–512 (2010)

    Article  MathSciNet  MATH  Google Scholar 

  7. Koller D., Megiddo N.: The complexity of two-person zero-sum games in extensive form. Games Econ. Behav. 4(4), 528–552 (1992)

    Article  MathSciNet  MATH  Google Scholar 

  8. Lan, G., Lu, Z., Monteiro, R.D.C.: Primal-dual first-order methods with \({{O}(1{/}\epsilon)}\) iteration-complexity for cone programming. (to appear in Math Program) (2010)

  9. McMahan, H., Gordon, G.J.: A fast bundle-based anytime algorithm for poker and other convex games. In: Proceedings of the 11th International Conference on Artificial Intelligence and Statistics (AISTATS), San Juan, Puerto Rico (2007)

  10. Mordukhovich, B., Peña, J., Roshchina, V.: Computation of a condition measure of a smoothing algorithm for matrix games. (to appear in SIAM J. Optim.) (2010)

  11. Nesterov Y.: A method for unconstrained convex minimization problem with rate of convergence O(1/k 2). Doklady AN SSSR 269, 543–547 (1983) (Translated to English as Soviet Math. Docl.)

    MathSciNet  Google Scholar 

  12. Nesterov Y.: Excessive gap technique in nonsmooth convex minimization. SIAM J. Optim. 16(1), 235–249 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  13. Nesterov Y.: Smooth minimization of non-smooth functions. Math. Program. 103, 127–152 (2005)

    Article  MathSciNet  MATH  Google Scholar 

  14. Osborne M., Rubinstein A.: A Course in Game Theory. MIT Press, Cambridge (1994)

    MATH  Google Scholar 

  15. Romanovskii I.: Reduction of a game with complete memory to a matrix game. Sov. Math. 3, 678–681 (1962)

    Google Scholar 

  16. Shi, J., Littman, M.: Abstraction methods for game theoretic poker. In: CG ’00: Revised Papers from the Second International Conference on Computers and Games, London, UK, pp. 333–345. Springer, Berlin (2002)

  17. Smola, A.J., Vishwanathan, S.V.N., Le, Q.: Bundle methods for machine learning. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS), Vancouver, Canada (2007)

  18. von Stengel B.: Efficient computation of behavior strategies. Games Econ. Behav. 14(2), 220–246 (1996)

    Article  MATH  Google Scholar 

  19. Wright S.J.: Primal-Dual Interior-Point Methods. SIAM, Philadelphia (1997)

    Book  MATH  Google Scholar 

  20. Ye Y., Todd M., Mizuno S.: An \({o(\sqrt{n}{L})}\)-iteration homogeneous and self-dual linear programming algorithm. Math. Oper. Res. 19, 53–67 (1994)

    Article  MathSciNet  MATH  Google Scholar 

  21. Zinkevich, M., Bowling, M., Burch, N.: A new algorithm for generating equilibria in massive zero-sum games. In: Proceedings of the National Conference on Artificial Intelligence (AAAI), Vancouver, Canada (2007)

  22. Zinkevich, M., Bowling, M., Johanson, M., Piccione, C.: Regret minimization in games with incomplete information. In: Proceedings of the Annual Conference on Neural Information Processing Systems (NIPS), Vancouver, Canada (2007)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Javier Peña.

Additional information

A short early version of this paper appeared at the National Conference on Artificial Intelligence (AAAI), 2008.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Gilpin, A., Peña, J. & Sandholm, T. First-order algorithm with \({\mathcal{O}({\rm ln}(1{/}\epsilon))}\) convergence for \({\epsilon}\)-equilibrium in two-person zero-sum games. Math. Program. 133, 279–298 (2012). https://doi.org/10.1007/s10107-010-0430-2

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s10107-010-0430-2

Mathematics Subject Classification (2000)

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy