0% found this document useful (0 votes)

32 views2 pages

Inability of A Graph Neural Network Heuristic To Outperform Greedy Algorithms in Solving Combinatorial Optimization Problems Like Max-Cut

The document discusses the inability of a graph neural network heuristic to outperform greedy algorithms in solving combinatorial optimization problems like Max-Cut. It shows that while a GNN approach appeared close to theoretical bounds, simpler methods like gradient descent and greedy search achieved results that were nearly as good or even better for solving Max-Cut problems on random graphs.

Uploaded by

Jhon Kennedy Queiroz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

32 views2 pages

Inability of A Graph Neural Network Heuristic To Outperform Greedy Algorithms in Solving Combinatorial Optimization Problems Like Max-Cut

Uploaded by

Jhon Kennedy Queiroz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 2

Inability of a graph neural network heuristic to outperform greedy algorithms in

solving combinatorial optimization problems like Max-Cut

Stefan Boettcher
Department of Physics, Emory University, Atlanta, GA 30322; USA

Matters Arising from Martin J. A.

(a)
Schuetz et al. Nature Machine Intelligence
https://doi.org/10.1038/s42256-022-00468-6 (2022). 4
10

cut size
cutub
arXiv:2210.00623v1 [cond-mat.dis-nn] 2 Oct 2022

In Ref. [1], Schuetz et al provide a scheme to employ 3 EO

10
GNN
graph neural networks (GNN) as a heuristic to solve a
2 GD
variety of classical, NP-hard combinatorial optimization 10
2 3 4 5
problems. It describes how the network is trained on 10 10 10 10
sample instances and the resulting GNN heuristic is eval- number of nodes n
uated applying widely used techniques to determine its (b)
ability to succeed. Clearly, the idea of harnessing the -0.60 Gradient Descent
GNN
powerful abilities of such networks to “learn” the intrica- -0.62 GraphSAGE
cies of complex, multimodal energy landscapes in such a Greedy Search
-0.64
hands-off approach seems enticing. And based on the ob- EO-Heuristic

<e3>/31/2
served performance, the heuristic promises to be highly -0.66 1-RSB
-P*(Parisi Energy)
scalable, with a computational cost linear in the input -0.68
size n, although there is likely a significant overhead in -0.70
the pre-factor due to the GNN itself. However, closer in-
-0.72
spection shows that the reported results for this GNN are
only minutely better than those for gradient descent and -0.74
get outperformed by a greedy algorithm, for example, for -0.76
Max-Cut. The discussion also highlights what I believe 0 0.0005 0.001 0.0015
are some common misconceptions in the evaluations of 1/n
heuristics.
Among a variety of QUBO problems Ref. [1] consider Figure 1. Results discussed in the text for various heuristics
in their numerical evaluation of their GNN, I want to and bounds for the Max-Cut problem on a 3-regular random
focus the discussion here on Max-Cut. As explained in graph ensemble, (a) plotted for the raw cut-size as a function
the context of Eq. (7), it is derived from an Ising spin- of problem size n, and (b) as an extrapolation plot according
glass Hamiltonian on a d-regular random graph [2] for to Eq. (1). Note that in (b), a fit (red-dashed line) to the
d = 3. (In the physics literature, for historical reason EO-data (circles) suggests a non-linear asymptotic correction
such a graph is often referred to as a Bethe-lattice [3, 4].) with ∼ 1/n2/3 [4].
Minimizing the energy of the Hamiltonian, H, maximizes
the cut-size cut = −H. The cut results for the GNN (for
both, d = 3 and 5) are presented in Fig. 4 of Ref. [1], whose flip (xi 7→ ¬xi ) will improve the cost function.
where they find cut ∼ γ3 n with γ3 ≈ 1.28 via an asymp- (Such “unstable” variables are easy to track.) After only
totic fit to the GNN data obtained from averaging over ∼ 0.4n such flips, typically no further improvements were
randomly generated instances of the problem for a pro- possible and GD converged; very scalable and fast (done
gression of different problem sizes n. In Fig. 1(a) here, overnight on a laptop, averaging over 103 − 105 instances
I have recreated their Fig. 4, based on the value of γ3 at each n, up to n = 105 ). Presented in the form of
reported for GNN (blue line). Like in Ref. [1], I have also Fig. 1(a), the results all look rather good, although it is
included what they describe as a rigorous upper bound, already noticeable that results for GD are barely distin-
cutub (black-dashed line), which derives from an exact guishable from those of the elaborate GNN heuristic.
result obtained when d = ∞ [5]. While the GNN results To discern further details, it is essential to present
appear impressively close to that upper bound, however, the data in a form that, at least, eliminates some of
including two other sets of data puts these results in a its trivial aspects. For example, as Schuetz et al ref-
different perspective. The first set I obtained at signif- erence themselves, the ratio cut/n p ∼ γ converges
√ to a
icant computational cost (∼ n3 ) with another heuristic stable limit with γ ∼ d/4 + P∗ d/4 + O( d) + o(n0 )
(“extremal optimization”, EO) long ago in Ref. [4] (black for n, d → ∞ [6], where P∗ = 0.7632 . . . [5]. In fact,
circles). The second set is achieved by a simple gradient for better comparison with Refs. [3, 4], we focus on the
descent (GD, maroon squares). GD sequentially looks average ground-state energy density of the Hamiltonian
at randomly selected (Boolean) variables xi among those in their Eq. (7) at n = ∞, which is related to γ via
2
√ p p
hed i / d = d/4 − γ 4/d. (The awkward √ denominator of Fig. 1(b), it becomes apparent that the claimed GNN
is owed to fact that P∗ = limd→∞ hed i / d. Also, energy results (blue line) are systematical far (> 15% at any n)
provides a fair reference point to assess relative error be- from optimal (1-RSB, green line) and hardly provide any
cause a purely random assignment of variables results in improvement over pure gradient descent (GD, maroon
an energy of zero, the ultimate null model. Such a refer- squares). It appears that the GNN learns what is indeed
ence point is lacking for the errors quoted in Tab. 1 of the most typical about the energy landscape: the vast
Ref. [1], for example.) prevalence of high-energy, poor-quality metastable solu-
More revealing then merely dividing by n is the trans- tions that gradient descent gets trapped in, missing the
formation of the data into an extrapolation plot [4, 7]: faint signature of exceedingly rare low-energy minima.
Since we care about the scalability of the algorithm in In fact, extending GD by a subsequent 5n spin flips, say,
the asymptotic limit for large problem sizes n → ∞, each flip adjusting one among the least-stable spins (even
which in the form of Fig. 1(a) is out of view, it expe- if not always unstable), allows this greedy local search to
dient to visualize the data plotted for an inverse of the explore several local minima, still at linear cost. The re-
problem size (i.e., 1/n or some power thereof [4, 8, 9]). sults of that simple algorithm, also shown in Fig. 1(b)
Independent of the largest sizes n achieved in the data, it (diamonds), already reduce the error to ≈ 6% across all
conveniently condenses the asymptotic behavior arbitrar- sizes n, a considerable improvement on the GNN results
ily close to the y-intercept where 1/n → 0, albeit it at the in Ref. [1] and still better than an improved version,
cost of sacrificing some data for smaller n. To this end, GraphSAGE, the authors mention in their response (or-
I propose to plot the data in the finite-size corrections ange line).
form,
In conclusion, the study in Ref. [1] exemplifies a num-
const ber of common shortcomings found in the analysis of op-
he3 in ∼ he3 in=∞ + + ..., (n → ∞). (1) timization heuristics (see also Ref. [7]): (1) Reliance on
n
rigorous but rather poor and often meaningless bounds,
In Fig. 1(b) we have plotted the same data from Fig. 1(a) as provided by the Goemans-Williamson algorithm in
according
√ to Eq. (1) for d = 3 (modulo a trivial factor of this case, instead of using the much more relevant re-
1/ 3 for better comparison with P∗ ). Stark differences sults (albeit as-of-yet unproven) from statistical physics,
between each set of data appear, since each set converges (2) using an obscure presentation of the data, (3) lack of
asymptotically to a stable but distinct limit at 1/n = 0. state-of-the-art comparisons across different areas in sci-
First, we note the addition of a well-known result from ence, and (4) lack of benchmarking against trivial, base-
replica theory, a one-step replica symmetry-breaking (1- line models such as gradient descent or greedy search we
RSB) calculation [3, 10] that is expected to yield the presented here. On such closer inspection, the proposed
actual value for he3 in=∞ (and thus, γ3 ) with a preci- GNN heuristic does not provide much algorithmic advan-
sion of 10−4 (green line), a superior reference value than tage over that base line. It is likely that these conclusions
−P∗ (black-dashed line), valid only at d = ∞ although are not isolated to this specific example but would also
seemingly sensible in the form of Fig. 1(a). The 1-RSB hold for Max-Cut at d = 5 and for the other QUBO appli-
value is further emphasized by the fact that the EO data cations discussed in Ref. [1], as the concurrent comment
(black circles) from Ref. [4] smoothly extrapolate to the by Angelini and Ricci-Tersenghi (arXiv:2206.13211) in-
same limit within statistical errors. Finally, in the form dicates.

[1] M. J. A. Schuetz, J. K. Brubaker, and H. G. Katzgraber, Probability 45, 1190 (2017).

Nature Machine Intelligence 4, 367 (2022). [7] S. Boettcher, Physical Review Research 1, 033142 (2019).
[2] Technically, their Hamiltonian in Eq. (7) pertains to an [8] S. Boettcher, Journal of Statistical Mechanics: Theory
antiferromagnet instead of a spin glass, but on such ran- and Experiment 2010, P07002 (2010).
dom graphs, both are equivalent [11]. [9] S. Boettcher, Physical Review Letters 124, 177202
[3] M. Mezard and G. Parisi, J. Stat. Phys. 111, 1 (2003). (2020).
[4] S. Boettcher, The European Physical Journal B - Con- [10] M. Mezard and G. Parisi, Europhys. Lett. 3, 1067 (1987).
densed Matter 31, 29 (2003). [11] L. Zdeborová and S. Boettcher, Journal of Statistical Me-
[5] G. Parisi, J. Phys. A 13, L115 (1980). chanics: Theory and Experiment 2010, P02020 (2010).
[6] A. Dembo, A. Montanari, and S. Sen, The Annals of

Mathematical Introduction To Deep Learning: Methods, Implementations, and Theory
No ratings yet
Mathematical Introduction To Deep Learning: Methods, Implementations, and Theory
714 pages
Blotter System Docs
No ratings yet
Blotter System Docs
31 pages
GNN - PEter
No ratings yet
GNN - PEter
96 pages
Backward Feature Correction: How Deep Learning Performs Deep (Hierarchical) Learning
No ratings yet
Backward Feature Correction: How Deep Learning Performs Deep (Hierarchical) Learning
93 pages
Statistical Physics Analysis of Graph Neural Networks: Approaching Optimality in The Contextual Stochastic Block Model
No ratings yet
Statistical Physics Analysis of Graph Neural Networks: Approaching Optimality in The Contextual Stochastic Block Model
36 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
58 pages
Joint Edge-Model Sparse Learning Is Provably Efficient For Graph Neural Networks
No ratings yet
Joint Edge-Model Sparse Learning Is Provably Efficient For Graph Neural Networks
45 pages
Theory of Graph Neural Networks: Representation and Learning
No ratings yet
Theory of Graph Neural Networks: Representation and Learning
23 pages
cs236 Lecture11
No ratings yet
cs236 Lecture11
27 pages
Mod6 Slides
No ratings yet
Mod6 Slides
27 pages
F - P N N S L: Unction Space Arameterization of Eural Etworks For Equential Earning
No ratings yet
F - P N N S L: Unction Space Arameterization of Eural Etworks For Equential Earning
29 pages
Index
No ratings yet
Index
127 pages
A Biased Graph Neural Network Sampler With Near Optimal Regret
No ratings yet
A Biased Graph Neural Network Sampler With Near Optimal Regret
25 pages
Entropy 27 00304
No ratings yet
Entropy 27 00304
22 pages
2782 On The Generalization of
No ratings yet
2782 On The Generalization of
28 pages
General Graph Random Features
No ratings yet
General Graph Random Features
18 pages
Nonequilibrium Monte Carlo For Unfreezing Variables in Hard Combinatorial Optimization
No ratings yet
Nonequilibrium Monte Carlo For Unfreezing Variables in Hard Combinatorial Optimization
28 pages
Symmetry-Aware Gflownets: Hohyun Kim Seunggeun Lee Min-Hwan Oh
No ratings yet
Symmetry-Aware Gflownets: Hohyun Kim Seunggeun Lee Min-Hwan Oh
29 pages
Path SGD Behnam
No ratings yet
Path SGD Behnam
12 pages
Combinatorial Optimization With Physics-Inspired Graph Neural Networks
No ratings yet
Combinatorial Optimization With Physics-Inspired Graph Neural Networks
17 pages
Unit 4 NNDL-1
No ratings yet
Unit 4 NNDL-1
12 pages
Nguyen 20 C
No ratings yet
Nguyen 20 C
11 pages
saVANt PRE
No ratings yet
saVANt PRE
10 pages
Recent Progress in The Theory of Deep Learning: Tengyu Ma Facebook AI Research/Stanford
No ratings yet
Recent Progress in The Theory of Deep Learning: Tengyu Ma Facebook AI Research/Stanford
50 pages
AlonAndYahav 2021 On The Bottleneck of Graph Neu
No ratings yet
AlonAndYahav 2021 On The Bottleneck of Graph Neu
16 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
61 pages
Community Detection With Graph Neural Networks
No ratings yet
Community Detection With Graph Neural Networks
16 pages
Is Distance Matrix Enough For Geometric Deep Learning?: Corresponding Author: Muhan Zhang (Muhan@pku - Edu.cn)
No ratings yet
Is Distance Matrix Enough For Geometric Deep Learning?: Corresponding Author: Muhan Zhang (Muhan@pku - Edu.cn)
34 pages
Chen, Deng Et Al 2021 - Effective and Efficient Batch Normalization
No ratings yet
Chen, Deng Et Al 2021 - Effective and Efficient Batch Normalization
15 pages
(AIEEE-2008) Ans. (4) Sol.: Section - 1: Single Choice Correct Questions
No ratings yet
(AIEEE-2008) Ans. (4) Sol.: Section - 1: Single Choice Correct Questions
36 pages
Barren Plateaus in Quantum Neural Network Training Landscapes
No ratings yet
Barren Plateaus in Quantum Neural Network Training Landscapes
7 pages
Exploring Node Classification U Ncertainty in Graph Neural
No ratings yet
Exploring Node Classification U Ncertainty in Graph Neural
5 pages
Learning For Structured Prediction Using Approximate Subgradient Descent With Working Sets
No ratings yet
Learning For Structured Prediction Using Approximate Subgradient Descent With Working Sets
8 pages
On The Bottleneck of Graph Neural Networks
No ratings yet
On The Bottleneck of Graph Neural Networks
16 pages
71 Graph Q Learning For Combinato
No ratings yet
71 Graph Q Learning For Combinato
8 pages
Restricted Boltzmann Machines: Abstract
No ratings yet
Restricted Boltzmann Machines: Abstract
21 pages
Set2Graph-Learning Graphs From Sets
No ratings yet
Set2Graph-Learning Graphs From Sets
17 pages
(00000) - 2018-Weinan - (CommMathStat) - The Deep Ritz Method A Deep Learning-Based
No ratings yet
(00000) - 2018-Weinan - (CommMathStat) - The Deep Ritz Method A Deep Learning-Based
12 pages
Dirac-Bianconi Graph Neural Networks - Enabling Non-Diffusive Long-Range Graph Predictions
No ratings yet
Dirac-Bianconi Graph Neural Networks - Enabling Non-Diffusive Long-Range Graph Predictions
14 pages
Are Powerful Graph Neural Nets Necessary
No ratings yet
Are Powerful Graph Neural Nets Necessary
16 pages
#Loyola, Pedergnana & Garcia - Smart Sampling and Incremental Function Learning For Very Large High Dimensional Data
No ratings yet
#Loyola, Pedergnana & Garcia - Smart Sampling and Incremental Function Learning For Very Large High Dimensional Data
13 pages
Expressive Power of Binary and Ternary Neural Networks: Binaryconnect
No ratings yet
Expressive Power of Binary and Ternary Neural Networks: Binaryconnect
7 pages
Graphnorm: A Principled Approach To Accelerating Graph Neural Network Training
No ratings yet
Graphnorm: A Principled Approach To Accelerating Graph Neural Network Training
25 pages
Application of HNN For Max Cut Problem
No ratings yet
Application of HNN For Max Cut Problem
6 pages
Notes On Deep Learning Theory
No ratings yet
Notes On Deep Learning Theory
68 pages
Why Are Graph Neural Networks Effective For EDA Problems
No ratings yet
Why Are Graph Neural Networks Effective For EDA Problems
8 pages
Deep Symbolic Regression For Physics Guided by Units Constraints PDF
No ratings yet
Deep Symbolic Regression For Physics Guided by Units Constraints PDF
16 pages
29256-Article Text-33310-1-2-20240324
No ratings yet
29256-Article Text-33310-1-2-20240324
9 pages
Crescendo Milestone3 3
No ratings yet
Crescendo Milestone3 3
12 pages
07 Regularization
No ratings yet
07 Regularization
51 pages
A Comparison Between Recursive Neural Networks and Graph Neural Networks
No ratings yet
A Comparison Between Recursive Neural Networks and Graph Neural Networks
8 pages
Six Lectures On NN - Montanari
No ratings yet
Six Lectures On NN - Montanari
77 pages
DLbook
No ratings yet
DLbook
165 pages
Notes On Gans, Energy-Based Models, and Saddle Points
No ratings yet
Notes On Gans, Energy-Based Models, and Saddle Points
10 pages
L S N N R: Earning Parse Eural Etworks Through Egularization
No ratings yet
L S N N R: Earning Parse Eural Etworks Through Egularization
13 pages
Aouchiche Et Al
No ratings yet
Aouchiche Et Al
30 pages
Unit V NNHDL
No ratings yet
Unit V NNHDL
33 pages
3.1.1weight Decay, Weight Elimination, and Unit Elimination: GX X X X, Which Is Plotted in
No ratings yet
3.1.1weight Decay, Weight Elimination, and Unit Elimination: GX X X X, Which Is Plotted in
26 pages
Link Building Service 124
100% (1)
Link Building Service 124
20 pages
DAVSUR MLBB Tournament Rules
No ratings yet
DAVSUR MLBB Tournament Rules
7 pages
Machine Learning and Pattern Recognition Week 8 - Neural - Net - Fitting
No ratings yet
Machine Learning and Pattern Recognition Week 8 - Neural - Net - Fitting
3 pages
Mathematics of Deep Learning: Lecture 2 - Depth Separation
No ratings yet
Mathematics of Deep Learning: Lecture 2 - Depth Separation
13 pages
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
No ratings yet
Mathematics of Deep Learning: Lecture 1-Introduction and The Universality of Depth 1 Nets
12 pages
C# Array PDF
No ratings yet
C# Array PDF
13 pages
Unit 1 - Overview of OS
No ratings yet
Unit 1 - Overview of OS
41 pages
Website Css
No ratings yet
Website Css
31 pages
Series 8400 Bistro
No ratings yet
Series 8400 Bistro
47 pages
A Real Time Novel Technique For Controlling CNC System
No ratings yet
A Real Time Novel Technique For Controlling CNC System
9 pages
FD1104SN-R1 Datasheet V1.0
100% (1)
FD1104SN-R1 Datasheet V1.0
2 pages
Networking Assignment
No ratings yet
Networking Assignment
80 pages
43-Filing System
No ratings yet
43-Filing System
2 pages
CCBoot Manual - Troubleshoots
No ratings yet
CCBoot Manual - Troubleshoots
272 pages
Amazon Sales Analytics
No ratings yet
Amazon Sales Analytics
13 pages
Sifive S76-Mc Manual 20G1.03.00
No ratings yet
Sifive S76-Mc Manual 20G1.03.00
149 pages
SBS Product Catalog 2018
No ratings yet
SBS Product Catalog 2018
53 pages
Cs 083 HP 2 Mat Cs Final
No ratings yet
Cs 083 HP 2 Mat Cs Final
10 pages
Softdot Hi - Tech Educational & Training Institute Unit-1 Operating System Overview
No ratings yet
Softdot Hi - Tech Educational & Training Institute Unit-1 Operating System Overview
67 pages
What Is Computer
No ratings yet
What Is Computer
19 pages
Unit Three DBMS Notes-1
No ratings yet
Unit Three DBMS Notes-1
31 pages
Stok Status Report
No ratings yet
Stok Status Report
52 pages
TCSESM Managed Switch 16p
No ratings yet
TCSESM Managed Switch 16p
6 pages
MKE02P64M40SF0
No ratings yet
MKE02P64M40SF0
37 pages
Citra Log
No ratings yet
Citra Log
7 pages
Qa Manager Power Play
No ratings yet
Qa Manager Power Play
37 pages
NLP Assignment 2
No ratings yet
NLP Assignment 2
3 pages
Artigo 02
No ratings yet
Artigo 02
19 pages
Voucher-VICTOIRE WIFI-24H-up-977-07.06.24
No ratings yet
Voucher-VICTOIRE WIFI-24H-up-977-07.06.24
10 pages
Java Developer Road Map
No ratings yet
Java Developer Road Map
1 page
Trab 3
No ratings yet
Trab 3
16 pages
Artigo01 Galoa Proceedings Sbpo 2023 174872
No ratings yet
Artigo01 Galoa Proceedings Sbpo 2023 174872
12 pages
Trab 2
No ratings yet
Trab 2
9 pages
Combinatorial Optimization and Reasoning With Graph Neural Networks
No ratings yet
Combinatorial Optimization and Reasoning With Graph Neural Networks
8 pages
Trab 4
No ratings yet
Trab 4
8 pages
TFX Power 3 Data Sheet en
No ratings yet
TFX Power 3 Data Sheet en
3 pages
Datasheet 708E Series v1 02 20221205
No ratings yet
Datasheet 708E Series v1 02 20221205
3 pages
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
From Everand
Computer Vision Graph Cuts: Exploring Graph Cuts in Computer Vision
Fouad Sabry
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Inability of A Graph Neural Network Heuristic To Outperform Greedy Algorithms in Solving Combinatorial Optimization Problems Like Max-Cut

Uploaded by

Inability of A Graph Neural Network Heuristic To Outperform Greedy Algorithms in Solving Combinatorial Optimization Problems Like Max-Cut

Uploaded by

Inability of a graph neural network heuristic to outperform greedy algorithms in

solving combinatorial optimization problems like Max-Cut

Matters Arising from Martin J. A.

In Ref. [1], Schuetz et al provide a scheme to employ 3 EO

[1] M. J. A. Schuetz, J. K. Brubaker, and H. G. Katzgraber, Probability 45, 1190 (2017).

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.