0% found this document useful (0 votes)

265 views15 pages

David Diagonalization

This document discusses the Davidson diagonalization method and its application to electronic structure calculations. The Davidson method is a subspace iterative diagonalization approach that appends modified (preconditioned) residue vectors to the subspace during iterations, unlike Krylov space methods. The original Davidson method converges fast for diagonally dominant matrices but slower otherwise. Improved versions were developed that generate new expansion vectors in different ways to address this, such as the improved invariant subspace method (IIGD) and generalized Jacobi-Davidson (GJD) method. The performance of these methods is evaluated on an electronic structure Hamiltonian diagonalization problem and compared to the preconditioned conjugate gradient method.

Uploaded by

fangzhou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

265 views15 pages

David Diagonalization

Uploaded by

fangzhou

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 15

DAVIDSON DIAGONALIZATION METHOD AND ITS

APPLICATION TO ELECTRONIC STRUCTURE CALCULATION∗

BOLIN LIAO†

Abstract.

Davidson algorithm is a subspace iterative diagonalization method, which ap-

pends to the subspace with modified (preconditioned) residue vectors during the it-
eration process, in contrast to Krylov space methods. In this report, different views
of understanding Davidson-type methods are reviewed, and a variety of ways of gen-
erating the new expansion vector are implemented in Matlab and their performances
compared. It is observed that the original Davidson method converges fast for diago-
nally dominant matrices, but relatively slow otherwise, which could be fixed to some
extent by using modified methods. A “quasi-realistic” problem, the diagonalization
of the Hamiltonian of a planewave-pseudopotential based electronic structure calcu-
lation, is solved using Davidson methods, and the result is compared with the other
mainstream algorithm, preconditioned non-linear Conjugate Gradient (CG) method.
Key words. Iterative Diagonalization, Subspace Methods, Davidson Method

1. Introduction. Large-scale symmetric eigenvalue problems arise from a vari-

ety of contexts of scientific and engineering computing, and examples include configu-
ration interaction calculations in quantum chemistry, electronic structure calculation
of solids, and solving eigenmodes of electromagnetic wave propagation, etc. In some
application, only the lowest one or few eigenpairs are of interest. If the dimension
of the problem is not “too large”, direct or dense methods are available[1], the cost
of which grows like N 3 and the direct access memory requirements grows liks N 2 .
When N becomes much larger, however, the direct methods are not affordable any
more, and iterative methods are generally used in search of the lowest (or highest,
or some specific ones) eigenpairs. One of the most important categories of iterative
eigen algorithms consists of subspace methods[2], whose general principle is to project
the full matrix into smaller subspace, solve a smaller eigen problem in the subspace,
and use the eigenvalues and eigenvectors in the subspace problem (they are called
Ritz values and Ritz vectors) to approximate the true eigenpairs. Krylov space al-
gorithms (such as Lanczos algorithm) are familiar examples of subspace methods,
which construct the subspace given matrix H and initial guess vector b as the span
of vectors {b, Hb, H2 b, ..., Hn−1 b)}. Although the Krylov space methods elegantly
demonstrate how to construct and expand the subspace, they usually fail to favor
certain eigenpair that is wanted, and lead to slow convergence.
Davidson[3] came up with an idea of expanding the subspace in such a way that
certain eigenpair(s) would be favored. Bearing in mind the fact that if certain true
eivenvector lies in the subspace of current iteration, the eigen problem in the subspace
would give the exact corresponding eigenpair. Thus to achieve fast convergence, a
better way to expand the subspace is to choose the new expansion vector to be the
component of the error vector which is orthogonal to the subspace[2]. If this orthogo-
nal component could be solved exactly and added to the subspace, then convergence

∗ Thisis the report of the course project for Fall 2011 18.335J by Prof. Steven Johnson
† Department of Mechanical Engineering, Massachusetts Institute of Technology, 77 Massachusetts
Avenue, Cambridge, MA, 02139 (bolin@mit.edu).
1
2 BOLIN LIAO

is guaranteed to be achieved in the next iteration (in exact arithmetic).

Denote the true eigenvector as v, the current approximate eigenvector as x, the
orthogonal component of the error vector as δ, and the eigen equation becomes
(1.1) H(x + δ) = λ(x + δ),
In above λ is the true eigenvalue, and can be writen as λ = ρ + ǫ, with ρ = xT Hx
being the Rayleigh quotient and ǫ being the error. And equation (1.1) can be rewritten
as
(1.2) (H − ρ − ǫ)δ = −(H − ρ − ǫ)x
= −r + ǫx
Here r = (H − ρI)x is the residue vector at current iteration step. Solving δ and
appending it to the subspace is the essence of Davidson methods, but since solving
δ from equation (1.2) is nontrivial (a linear system with the same dimension as the
original problem), different schemes of obtaining δ in an approximate but efficient way
are devised, forming various flavors of Davidson methods, which will be reviewed in
the following subsections. Here, the main steps in a typical Davidson algorithm are
sketched below:
Initialization To solve the lowest k eigenvectors, a guess eigenspace with dimension
l (l ≥ k) and orthonormal basis {b1 , b2 , ..., bl } is constructed
Subspace Problem Form the matrix-vector products {Hb1 , Hb2 , ..., Hbl }, and cal-
culate the matrix elements in the subspace H e ij = bT Hbj , and solve for the
i
eigenvalues {ρi } and eigenvectors {xi } in the subspace as the approximate
eigenpairs.
Compute Residue Calculate the residue vector ri = (H − ρi I)xi , and check if
the convergence is achieved as kri k ≤ tolerance. If the convergence is not
achieved, continue to the next step.
Compute Correction Vector Calculate the correction vector based on the residue
vector and approximate eigenpairs. This part is where the main variations
of the algorithm exist, and has large impact on the performance. Different
schemes will be discussed in detail in later sections.
Expand the Subspace Orthogonalize the correction vector to the previous sub-
space using (modified) Gram-Schmidt scheme, and append the orthonormal-
ized correction vector to the previous subspace and repeat the iteration from
step 2 (only matrix elements involved with the new vector need to be com-
puted) until the convergence is achieved.
1.1. Diagonal-Preconditioned-Residue (DPR) method. In his original pa-
per, Davidson[3] argued that for diagonally dominant matrices, one can replace H in
(1.2) by its diagonals (denoted as D here), and assume ǫ is negligible, thus can be
removed from (1.2), and one ends up with the equation
(1.3) δ = −(D − ρI)−1 r
Although the equation (1.3) now becomes trivial to solve, the approximations
made seem too crude at the first sight. It turns out that there are different ways
of understanding this correction vector δ, which give more motivations and insights.
Consider the Rayleigh quotient of the trial eigenvector x
xT Hx
(1.4) ρ(x) =
xT x
DAVIDSON DIAGONALIZATION METHODS 3

Now we would like to minimize the Rayleigh quotient by varying x, but instead of
varying x along certain direction (as we usually do in steepest descent and conjugate
gradient method), we vary one component of x while holding all the other components
fixed. Specifically, if one varies the ith component xi by an amount δi , the optimum
choice of δi from

∂ρ
(1.5) =0
∂xi xi +δi

is just given by

(1.6) δi = −(Hii − ρ)−1 ri

This expression looks similar to (1.3), except that ρ and ri here are evaluated at
x + δi ei . In this sense the correction vector given by equation (1.3) could also be
interpreted as an approximation of the optimum variation vector that minimizes the
Rayleigh quotient locally. A third interpretation of equation (1.3) has to do with
Rayleigh Quotient Inverse Iteration (RQII)[4]. Consider a RQII step

(1.7) xnew = (H − ρ)−1 x

It was shown in class that RQII exhibits cubic convergence when the approximate
eigenvector approaches the true one. Now if one imposes that the correction in each
step be orthogonal to the previous trial vector, i.e. xnew = (x + δ)/ǫ, where xT δ = 0
and ǫ plays a role as the normalization factor, then equation (1.7) could be rewritten
as (according to Davidson[3], a modified Newton-Raphson equation)

(1.8) x + δ = (H − ρI)−1 x/ǫ

After operating xT (H − ρI)−1 on both sides on the left, ǫ can be solved as

1
(1.9) ǫ= ≈λ−ρ
xT (H − ρI)−1 x

This observation is consistent with the previous discussions. Given ǫ the equation
(1.8) can be rewritten as

(1.10) (ρI − H)(x + δ) ≈ ǫx

X
(1.11) (ρ − Hii )δi ≈ ri + Hij δj + ǫxi
j6=i

From this point of view, equation (1.3) is also an approximate form of an orthog-
onal correction vector in one step of RQII. Davidson[5] and Pulay[6] also pointed out
that equation (1.3) is a form of diagonal-preconditioned (Jacobi-type) gradient of the
Rayleigh quotient, after which this original Davidson method is also referred to as
Diagonal-Preconditioned-Residue (DPR) method. From the discussion given above,
especially the approximation made along the way, one can tentatively predict that
this original flavor only works well in diagonally dominant matrices, which will be
verified in experiments shown in the next two sections.
4 BOLIN LIAO

1.2. Improved versions: IIGD, GJD and RQII. Some slightly modified
versions of Davidson methods were proposed in late 80s and 90s, and the basic idea
was to add correction terms that were dropped in the original version back and try
to maintain an efficient way of evaluating the correction vector. A brief review was
given in the Appendix of reference[11]. Olsen et al.[4] proposed that adding the ǫx
term back and the correction vector can be given by

(1.12) δ = (D − ρI)−1 (−r + ǫx)

By enforcing the orthogonality relation xT δ = 0, ǫ can be solved as

xT (D − ρI)−1 r
(1.13) ǫ=
xT (D − ρI)−1 x
Because of the resemblance of the correction vector to that from RQII, this method
was named as Invers-Iteration generalized Davidson (IIGD) method.
Sleijpen et al.[7][8][9] suggested a further improvement over IIGD, the Generalized
Jacobi Davidson (GJD), which is also explained in detail in the online book Templates
for the Solution of Algebraic Eigenvalue Problems[2]. By operating on both sides of
the RQII equation (1.10) the projector (1 − xxT ), ǫ can be removed explicitly from
the equation, and after reorganizing the RQII equation can be written as a projected
form

(1.14) (I − xxT )(H − ρI)(I − xxT )δ = −r

Here H e = (I − xxT )(H − ρI)(I − xxT ) is the projected matrix onto the subspace
which is orthogonal to x. In the original paper it is suggested to solve the equation
(1.14) approximately, for example, by some steps of MINRES. But in practice, any
efficient iterative linear solver (conjugate gradient, for example) can be utilized to solve
the GJD equation at each iteration step. Along the same path, it is also possible to
solve the RQII equation directly at each step by efficient linear solvers. By including
more correction terms back into the original recipe, these improved versions aim
at improving the performance of Davidson method when applied to non-diagonally-
dominant matrices. The results and comparison will be given in subsequent sections.
There are other modifications to Davidson methods, which are mainly concerned
about optimizing the correction vector and speeding up the convergence. Since space
is limited, those minor modifications will not be described here, and a thorough review
by Leininger et al.[10] is available.
1.3. Subspace Projected Approximate Matrix (SPAM) modification.
Taking into account the fact that when the matrix dimension becomes extremely large,
the most time-consuming parts of the iterative algorithms are the matrix-vector prod-
ucts, an extension of Davidson method called Subspace Projected Approximate Matrix
(SPAM)[11] was designed, aiming at reducing number of “exact” matrix-vector prod-
ucts as much as possible, in a flexible and adaptive way. Assume at certain iteration
step, the subspace vectors are given by columns of matrix B, and the matrix-vector
products are computed as columns of matrix W.Thus the subspace representation of
H is given by H̄ = BT HB = BT W. Define the orthogonal projector P = BT B and
the complementary projector Q = I − P, then the original matrix H can be written
equivalently as

H = (P + Q)H(P + Q)
DAVIDSON DIAGONALIZATION METHODS 5

(1.15) = PHP + PHQ + QHP + QHQ

= (BH̄BT + BWT Q + QWXT ) + QHQ

When computing the matrix-vector product Hy, the first three terms in equation
(1.15) are easy to handle since B and W are available and have low dimensions. The
basic idea of SPAM algorithm is to approximate H in the fourth term by another
matrix H1 whose matrix-vector products H1 y require less effort to compute. Choice
of H1 is flexible and problem-dependent and can be a less dense matrix than H or
some formal or algebraic approximation to H. Given H1 , the original matrix H can
now be approximated by a “SPAM” matrix HSP AM

(1.16) HSP AM = (BH̄BT + BWT Q + QWXT ) + QH1 Q

One good property of HSP AM is that for any vector y ∈ span(B), equation (1.16)
indicates that HSP AM y = Hy, which means if the column space of B converges to
the eigenspace of HSP AM , this eigenspace is exactly the eigenspace of H. Thus one
can solve the eigenproblem of HSP AM instead and use the eigenvectors of HSP AM
to append to the previous subspace (span(B)). To update W, one “exact” matrix-
vector product involving H is required. To solve the eigenproblem of HSP AM (with
the same dimension as the original problem), an iterative Davidson method is utilized,
which would be cheap, thanks to another good property of HSP AM : for any vector
x⊥ orthogonal to the column space of B , the matrix-vector product takes the simple
form

(1.17) HSP AM x⊥ = w1 + B(WT x⊥ − BT w1 )

where w1 = H1 x⊥ is the inexpensive matrix-vector product. Here the David-

son method serves as the inner loop of an outer iterative algorithm. By virtue of
this nested structure, matrix-vector products in the inner loop Davidson are cheap,
whereas the outer loop “exact” matrix-vector might be expensive, but the necessary
number of them are minimized. In large-scale applications, this nested structure even
provides the possibility to subdivide the inner loop problem to single steps, each of
which could be treated by a Davidson iteration, forming a multilevel algorithm, with
more degrees of freedom to tune to adapt to specific problems.
1.4. Relation to Lanczos and Gradient-based methods. There is a close
relation between Davidson method and Lanczos method[12][11]. If the diagonal-
precondition part is ignored in evaluating the correction vector, i.e.

(1.18) δ = −r

It is easily seen that the residue vector lies in an expanding Krylov space, so is the
correction vector. By generating correction vectors this way Davidson method is
reduced to an explicit-orthogonalization Lanczos method. Although Lanczos method
seems more elegant (only two latest trial vectors need to be stored, and the subspace-
projected matrix is tridiagonal), it suffers from slow convergence due to the fact that
it does not selectively converge to the desired eigenpair of interest.
On the other hand, since the correction vector for the Davidson method can
be interpreted as just the gradient of the Rayleigh quotient preconditioned in some
6 BOLIN LIAO

way, there are also connections between the Davidson method and gradient-based
methods[1][13], such as steepest descent (SD) and conjugate gradient (CG) method.
They all compute the correction vector from the residue somehow (with certain kinds
of preconditioning) but use it differently: SD and CG use the correction vector as
the search direction for the next step whereas Davidson methods use it to expand the
subspace. Detailed comparison with respect to convergence performance in a realistic
problem will be given in a later section, and also the possibility to combine these
methods will be discussed.
1.5. Block Davidson Method and Subspace Collapse. Another remarkable
feature of Davidson method is that it can be easily extended to computing a few lowest
eigenpairs simultaneously. This type of Davidson method is called Block Davidson or
Davidson-Liu[14] algorithm. The basic idea is that instead of adding one new vector
at each iteration, a few new vectors, corresponding to the residue vectors of different
eigenpairs, will be added at each iteration, tuning the subspace eigenvetors to converge
at the same time.
Another extension of Davidson method is the subspace collapse technique[6], sim-
ilar to the restart scheme used in Lanczos method, which could reduce the memory
requirement. The basic idea is to choose the optimal approximate eigenvectors already
obtained and restart with an initial subspace expanded by the optimal approximate
eigenvectors.
2. Implementation and Performance Test.
2.1. Diagonally Dominant Matrices. First the original DPR Davidson method
[3][15] is implemented and applied to diagonally dominant matrices. For simplicity
and convenient comparison, only the lowest eigenpair is to be solved. Block Davidson
method, which solves a few lowest eigenpairs simultaneously will be demonstrated
separately, but not used to compare different flavors.
The convergence performance of DPR is demonstrated in figure 2.1.

Convergence Curve of DPR (N=1000) Convergence Curve of Davidson (N=5000)

0 0
10 10

−2 −2
10 10
Normalized norm of the residue

Normalized norm of residue

−4 −4
10 10

−6 −6
10 10

−8 −8
10 10

−10 −10
10 10

−12 −12
10 10
0 5 10 15 20 25 0 5 10 15 20 25
Number of Iterations Number of Iterations

(a) N = 1000 (b) N = 5000

Fig. 2.1: The convergence curve of DPR method, applied to randomly generated
diagonally dominant matrices

Below 30 iterations achieve the convergence of 10−10 , the convergence curves are
DAVIDSON DIAGONALIZATION METHODS 7

smooth and the number of iterations needed seems not to increase with the dimension
of the problem, which is quite amazing. Although these convergence curves demon-
strate the typical “successful” behavior of Davidson method, further study shows that
the performance is much more complicated and depends on a lot of factors, and can
be very sensitive. A good analysis of the convergence behavior of Davidson from the
perspective of the spectrum of a preconditioned Krylov problem was given by Morgan
and Scott[16]. Consider the operator N(ρ) = (D − ρI)−1 (H − ρI), then every cor-
rection vector generated during Davidson iteration is given by N operating on some
vector. Now if ρ were a constant, then the subspace generated using Davidson itera-
tion is just a Krylov space generated by powers of N. Then the methods of analyzing
the Krylov space methods may be considered here. Faster convergence of Arnoldi or
Lanczos method can be achieved (as we learned from class) if the gap ratio (relative
separation) of the spectrum of the matrix is large. Of course ρ is not a constant
here, but (ideally) converges to certain eigenvalue of H, so the spectrum of N when
ρ is near certain eigenvalue of H is crucial to the convergence rate of Davidson. An
extreme example is that when H is diagonal, then all eigenvalues of N are the same
(1), thus Davidson method is expected to perform badly (actually fails in exact arith-
metic, since δ = −x in this case and lies in the previous subspace). Known from
analysis of preconditioners of gradient-based methods, (D − ρI)−1 tends to compress
the spectrum of (H − ρI), which is a preferred property for gradient-based methods,
while in eigenvalue problems, an increased gap ratio is the desired property. From this
point of view, original Davidson method is expected to perform well (or better than
Lanczos method) only if after preconditioning (multiplied by (D − ρI)−1 ) the ratio
gap is increased and also the corresponding eigenvalue of N is not clustered with other
eigenvalues. So even in the seemingly simplest case of diagonally-dominant matrices,
the convergence behavior of Davidson method could be rather complicated. Figure
2.2a illustrates one typical situation where Davidson does not do so well. The algo-
rithm seems to converge to other eigenvalues at first (corresponding to deeps in the
norm of the residue) and then figures out that smaller eigenvalue exists and adjusts
to it at the end. This behavior may be explained by the analysis given above that
the corresponding eigenvalue of operator N resides in the interior instead of being
well separated from other eigenvalues. IIGD is devised to improve DPR in diago-
nally dominant matrices, and the performance of IIGD applied to the same matrix
is displayed in figure 2.2b. Although total number of iterations is reduced, stronger
oscillations are observed in IIGD, which may be dangerous because if the convergence
threshold is not set small enough, the algorithm may end up converging to a higher
eigenvalue.
In addition to explaining when Davidson does not perform very well, the anal-
ysis given above also sheds light upon further improving Davidson method by other
preconditioning method in specific problems, which will be demonstrated in a later
section.

2.2. Non-Diagonally-Dominant Matrices. Figure 2.3a demonstrates how

original Davidson method totally fails when dealing with non-diagonally-dominant
matrices (the test matrix is generated as H = AT A, where A is a random matrix).
Only when the dimension of the subspace is almost the same as the dimension of the
original problem does the algorithm converge. This is expected since in this scenario
the diagonal-preconditioning does not make improvement any more. Also shown in
figure 2.3b is how IIGD also fails when applied to the same matrix.
On the other hand, GJD and RQII are devised to work in both cases (since no
8 BOLIN LIAO

DPR convergence curve (N=1000) IIGD convergence curve (N=1000)

0
10 10
0

−2 −2
10 10

Normalized norm of the residue

−4
Normalized norm of residue

−4
10 10

−6 −6
10 10

−8 −8
10 10

−10 −10
10 10

−12 −12
10 10
0 20 40 60 80 100 120 0 10 20 30 40 50 60
Number of iterations Number of iterations

(a) DPR (b) IIGD

Fig. 2.2: An example of slow-converging DPR and performance of IIGD applied to

the same matrix

DPR convergence curve for non diagonally dominant matrix (N=100) IIGD convergence for non diagonally dominant matrix (N=100)
4 4
10 10

2 2
10 10

0 0
10 10

−2
Normalized norm of residue

−2
10
Normalized norm of residue

−4 −4
10 10

−6 −6
10 10

−8 −8
10 10

−10 −10
10 10

−12 −12
10 10

−14 −14
10 10
0 10 20 30 40 50 60 70 80 90 100 0 10 20 30 40 50 60 70 80 90 100
Number os iterations Number of iterations

(a) DPR (b) IIGD

Fig. 2.3: DPR and IIGD fail in dealing with non-diagonally-dominant matrices

diagonal approximation made when solving the correction vector). Figure 2.4 demon-
strates how they work when applied to the same matrix. They converge in around 40
steps, but the price to pay is (much) larger at each iteration. Another problem is that
when ρ converges to certain eigenvalue, the linear system to be solved at each step is
near singular, which may cause problems and slow convergence. Figure 2.5 shows that
when applied to a larger non diagonally dominanat matrix, they converge slowly and
other methods (such as gradient-based methods) may be more advantageous provided
the increased cost at each step.
DAVIDSON DIAGONALIZATION METHODS 9

GJD convergence curve for non diagonally dominant matrix (N=100) RQII convergence curve for non diagonally matrix (N=100)
4 4
10 10

2
2 10
10

0
10
0
10
−2

Normalized norm of residue

10
Normalized norm of residue

−2
10
−4
10
−4
10
−6
10
−6
10
−8
10

−8
10 −10
10

−10
10 −12
10

−12 −14
10 10
0 5 10 15 20 25 30 35 40 0 5 10 15 20 25 30 35 40 45
Number of iterations Number of iterations

(a) GJD (b) RQII

Fig. 2.4: GJD and RQII perform better for non diagonally dominant matrices

GJD convergence curve for non diagonally dominant matrix (N=1000) RQII convergence curve for non diagonally dominant matrix (N=1000)
6 6
10 10

4 4
10 10

2 2
10 10

0 0
Normalized norm of residue

10
Normalized norm of residue

−2 −2
10 10

−4 −4
10 10

−6 −6
10 10

−8 −8
10 10

−10 −10
10 10

−12 −12
10 10
0 50 100 150 200 250 0 50 100 150 200 250
Number of iterations Number of iterations

(a) GJD (b) RQII

Fig. 2.5: GJD and RQII perform not so well for larger non diagonally dominant
matrices

2.3. Block Davidson Method. To solve the lowest few eigenpairs, one can of
course solve the eigenpairs one by one sequentially using Davidson from the lowest
one, where at each step how to pick the correction vector is a little bit subtle[12]. A
more powerful method is to solve for the lowest few eigenpairs simultaneously, using
Block Davidson Algorithm[14]. Suppose now we are interested in the lowest k eigen-
pairs. Starting by a guess subspace with dimention l (l ≥ k), the only modification
to the original Davidson method is that at each step, all the correction vectors cor-
responding to the first k approximate eigenvectors are solved. Then orthonormalize
the first correction vector against the previous subspace, and append it to the sub-
10 BOLIN LIAO

space. Repeat this process for each of the other k − 1 correction vectors, neglecting
any new vector whose norm after normalization is less than some threshold. Then
solve the subspace problem again. This extension is simple and powerful, and keeps
all the properties of the original Davidson method, except that the dimension of the
subspace problem will grow faster during iterations.
Block Davidson Algorithm is implemented in Matlab, and the simultaneous con-
vergence curve is show in figure 2.6.

Simultaneous Convergen Curve for Blocked Davidson (DPR) (N=1000)

−2
10
Eival1
Eival2
Eival3
−4 Eival4
10

−6
Normalized norm of residue

−8
10

−10
10

−12
10

−14
10
0 5 10 15 20 25
Number of iterations

Fig. 2.6: The lowest 4 eigenvalues converge uniformly when applying Block Davidson
to a 1000 × 1000 diagonally dominant matrix.

3. Application to a “Quasi-Realistic” Problem. First-principles calcula-

tion of electronic structures of solids has been an indispensable tool in modern research
fields like material science and engineering and condensed matter physics. Among a
couple of state-of-the-art methods, Pseudopotential-planewave based method[17] has
long been thought of as the most natural one for the built-in periodic boundary con-
dition. In a pseudopotential-planewave formulation, the single-electron Hamiltonian
at a given point k in the first Brllouin zone takes the following form in the planewave
basis {ψi = Ω1 exp (i(k − Gi ))}, where {Gi } are reciprocal vectors.

(k − Gi )2
(3.1) Hii =
2
(3.2) Hij = V (Gj − Gi )
where the diagonal elements of H are kinetic energies of the planewaves in the
basis and V (G) is the Fourier component of the periodic pseudo potential with respect
DAVIDSON DIAGONALIZATION METHODS 11

to the reciprocal vector G. In a realistic problem (Density Functional Theory based

calculation, for instance), the eigenproblem of the Hamiltonian has to be solved self-
consistently, which means the Hamiltonian itself (components of the potential energy)
depends on the solutions of the problem (eigenwavefunctions and thus charge density
distribution). To focus on the diagonalization technique itself, only the initial Hamil-
tonian built without the knowledge of charge density distribution is investigated in
this report. In practical calculations, the situation can be much more complex since
the self-consistent problem is essentially a nonlinear eigenproblem, which is beyond
the scope of the course and far beyond the limited knowledge of the author. Most
widely implemented algorithms in popular electronic structure calculation packages
(such as Quantum ESPRESSO, SIESTA, etc.) are Davidson method and nonlinear
conjugate gradient method[17], and the old wisdom in this field is that ”Davidson is
faster, while CG is more stable”. To test and compare the performance of these algo-
rithms, a specific type of non-linear CG algorithm (Teter-Payne-Allen, TPA)[17][18] is
implemented. Besides common features shared among all non-linear CG flavors, TPA
algorithm features an efficient update scheme (instead of common Rayleigh-Ritz type)
and a smartly-designed preconditioner. After the orthonormalized conjugate gradient
rcg is computed at one iteration step, TPA parametrizes the new guess vector as

(3.3) xnew = xold cos θ + rcg sin θ

to maintain the unity of the guess eigenvector. Then the algorithm fits the en-
ergy functional (equivalent to Rayleigh quotient to be minimized) to the following
functional form of θ

(3.4) E(θ) = Eavg + A1 cos (2θ) + B1 sin (2θ)

which turns out to work magically well in this specific type of problems. Then
three function values (and/or the derivative values) of the energy functional at special
points are evaluated to calculate the three unknown coefficients in equation (3.4), and
the minimum point in terms of θ can be found analytically and thus used to update the
guess vector in equation (3.3). Readers are referred to the original paper[18] for the
detailed formulae. Compared to Block Davidson method, the non-linear CG can only
solve eigenpairs one by one sequentially. In this report comparison will only be made
in the case where the lowest eigenpair is of interest, while one has to keep in mind
that since the cost of each iteration step of CG is cheaper than Davidson (especially
when the subspace of Davidson grows relatively large), a more general discussion is
necessary to give a full judgement. Limited by time and space, only the convergence
behavior with respect to number of iteration steps will be discussed in this report.
So far the preconditioning issue has not been discussed. First we notice that
the single-electron Hamiltonian in this problem exhibits a rather special structure of
being “partially” diagonally dominant: diagonal elements corresponding to the kinetic
energy of planewaves with large wavenumber overwhelm the off-diagonal elements
whereas the ones corresponding to the low energy planewaves are actually comparable
or even smaller than the off-diagonal elements. Since it is not diagonally dominant,
original Davidson method (DPR) is not supposed to work super well especially when
the dimension of the problem is large, which can be justified by figure 3.1. In the test,
a Gaussian-type model pseudopotential is used and the calculation takes place at Γ
point (k = 0).
12 BOLIN LIAO

DPR convergence curve for test Hamiltonian (N=100)

−1
10

−2
10

−3
10

−4
10

Normalized norm of residue

−5
10

−6
10

−7
10

−8
10

−9
10

−10
10

−11
10
0 10 20 30 40 50 60
Number of iterations

(a) N=100
DPR convergence curve for test Hamiltonian (N=1000)
−1
10

−2
10

−3
10

−4
10
Normalized norm of residue

−5
10

−6
10

−7
10

−8
10

−9
10

−10
10

−11
10
0 100 200 300 400 500 600
Number of iterations

(b) N=1000

Fig. 3.1: DPR does not work well when directly applied to the planewave Hamiltonian
with N = 1000

Here the precondioner comes into play. TPA devised a specific preconditioner for
planewave Hamiltonian

27 + 18λ + 12λ2 + 8λ3

(3.5) KG,G′ = δG,G′
27 + 18λ + 12λ2 + 8λ3 + 16λ4
2
where λ = (k−G) T
/2
, and T = xT Dx is the kinetic energy of the current guess
vector and D is the diagonal matrix corresponding to H. The amazing properties
of this preconditioner include that it is diagonal , that when kGk is large, KG,G
approaches 1/(2(λ − 1)) with an asymptotic expansion correct to fourth order in 1/λ,
which is equivalent to Jacobi preconditioning for diagonal dominant elements, and
that when kGk is small, KG,G approaches unity, keep the spectrum at low energies
intact. To see how this preconditioner works, the result of a test run using CG
with/without TPA preconditioning with the same initial guess is given in figure 3.2.
DAVIDSON DIAGONALIZATION METHODS 13

Convergenc curve for TPA non−linear CG with/without Preconditioning

0
10
CG with preconditioning
CG without preconditioning

−2
10

−4
10
Normalized norm of residue

−6
10

−8
10

−10
10

−12
10
0 1 2 3
10 10 10 10

Fig. 3.2: Performance of non-linear CG gets improved a lot by preconditioning (N =

1001)

Given the previous discussions interpreting Davidson as a precondioned residue

method, I became curious about what may happen if we combine the TPA precondi-
tioner with the DPR method. Instead of using the original correction vector, I tried
a new way of generating correction vectors as following

(3.6) δ = Kr

And the result shows that this strategy seems to work magically well. Figure
3.3 shows that starting with the same initial guess vector, the number of iterations
needed for TPA preconditioned DPR is around 20, while the original DPR seems to
wander around for a long time before finally settling down. This example illustrates
the possibility of improving performance of Davidson method by preconditioning in
specific problems.
To conclude this section, a comparison between preconditioned Davidson and
nonlinear CG when applied to the same (larger) Hamiltonian with the same starting
vector is given in figure 3.4, which may not be really fair for CG.
4. Summary. This report mainly focuses on understanding the underlying mech-
anism that makes Davidson method work and reviewing different modifications and
extensions of original DPR method. Different flavors of Davidson algorithm are im-
plemented, so is the Blocked Davidson method. In a “not-so-realistic” toy problem,
the possibility of improving Davidson method via preconditioning is explored, and the
result compared with another state-of-the-art method, non-linear conjugate gradient
algorithm (in TPA flavor). Furthur analysis is required to judge the performance of
14 BOLIN LIAO

Convergence curves for DPR with/without TPA preconditioning

0
10
TPA preconditioned DPR
Original DPR

−2
10

−4
10
Normalized norm of residue

−6
10

−8
10

−10
10

−12
10
0 1 2 3
10 10 10 10
Number of iterations

Fig. 3.3: Performance of DPR is improved significantly by TPA preconditioning (N =

1001)

Convergence curves for preconditioned DPR and CG (N=3001)

0
10
TPA preconditioned DPR
TPA CG with preconditioning

−2
10

−4
10
Normalized norm of residue

−6
10

−8
10

−10
10

−12
10
0 1 2
10 10 10
Number of iterations

Fig. 3.4: Comparison between the convergence curves of preconditioned Davidson and
non-linear CG
DAVIDSON DIAGONALIZATION METHODS 15

these two methods, and the result might as well depend on the specific problem. In
realistic applications, the present problem needs to be solved self-consistently, which
makes a totally different story.
Acknowledgements. Firstly the author would like to thank Prof. Johnson for
his wonderfully inspiring lectures, desperately hard problem sets and frustratingly
super hard quiz. And also I want to thank Dr. Keivan Esfarjani in Nanoengineering
group at MIT for helpful discussions.

REFERENCES

[1] E. R. Davidson, Super-Matrix Methods, Comput. Phys. Comm., 53 49 (1989).

[2] Z. Bai et al., Template for the Solution of Algebraic Eigenvalue Problems: a Practical Guide,
SIAM, online book URL: http://web.eecs.utk.edu/ dongarra/etemplates/book.html
[3] E. R. Davidson, The Iterative Calculation of a Few of the Lowest Eigenvalues and Corre-
sponding Eigenvectors of Large Real Symmetric Matrices, J. Comput. Phys., 17 87 (1975).
[4] J. Olsen, P. Jørgensen and J. Simons, Passing the One-Billion Limit in Full Configuration-
Interaction (FCI) Calculations, Chem. Phys. Letters, 169 493 (1990).
[5] C. W. Murray, S. C. Racine and E. R. Davidson, Improved Algorithms for the Lowest Few
Eigenvalues and Associated Eigenvectors of Large Matrices, J. Comput. Physics., 103 382
(1992).
[6] J. H. van Lenthe and P. Pulay, A Space-Saving Modification of Davidson’s Eigenvector
Algorithm, J. Comput. Chem., 11 1164 (1990).
[7] G. L. G. Sleijpen, A. G. L. Booten, D. R. Fokkema and H. A. van der Vorst,Jacobi-
Davidson Type Methods for Generalized Eigenproblems and Polynomial Eigenproblems,
BIT, 36 595 (1996).
[8] G. L. G. Sleijpen and H. A. van der Vorst,A Jacobi-Davidson Iteration Method for Linear
Eigenvalue Problems, J. Matrix Anal. Appl., 17 401 (1996).
[9] H. J. J. van Dam, J. H. van Lenthe, G. L. G. Sleijpen and H. A. van der Vorst,An
Improvement of Davidson’s Iteration Method: Applications to MRCI and MRCEPA Cal-
culations, J. Comput. Chem., 17 267 (1996).
[10] M. L. Leininger, C. D. Sherrill, W. D. Allen and H. F. Schaefer III,Systematic Study
of Selected Diagonalization Methods for Configuration Interaction Matrices, J. Comput.
Chem., 22 1574 (2001).
[11] R. Shepard, A. F. Wagner, J. L. Tilson and M. Minkoff,The Subspace Projected Approx-
imate Matrix (SPAM) Modification of the Davidson Method, J. Comput. Phys., 172 472
(2001).
[12] E. R. Davidson,Comments on the Kalamboukis tests of the Davidson algorithm, J. Phys. A:
Math. Gen., 13 L179 (1980).
[13] E. R. Davidson,Monster Matrices: Their Eigenvalues and Eigenvectors, Comput. in Phys., 7
519 (1993).
[14] B. Liu,The Simultaneous Expansion Method for the Iterative Solution of Several of the Lowest
Eigenvalues and Corresponding Eigenvectors of Large Real-symmetric Matrices, Technical
Report LBL-8158, Lawrence Berkeley Laboratory, University of California, Berkeley (1978).
[15] M. Crouzeix, B. Philippe and M. Sadkane,The Davidson Method, J. Sci. Comput., 15 62
(1994).
[16] R. B. Morgan and D. S. Scott,Generalizations of Davidson’s Method for Computing Eigen-
values of Sparse Symmetric Matrices, J. Sci. Stat. Comput., 7 (1986), pp. 817.
[17] M. C. Payne, M. P. Teter, D. C. Allan, T. A. Arias and J. D. Joannopoulos,Iterative
Minimization Techniques for ab initio Total-Energy Calculations: Molecular Dynamics
and Conjugate Gradients, Rev. Mod. Phys., 64 1045 (1992).
[18] M. P. Teter, M. C. Payne and D. C. Allan,Solution of Schrödinger’s Equation for Large
Systems, Phys. Rev. B, 40 12255 (1989).

Numerical Methods For Least Squares Problems, Second Edition
No ratings yet
Numerical Methods For Least Squares Problems, Second Edition
510 pages
Chapter 4
No ratings yet
Chapter 4
27 pages
Chap 3
No ratings yet
Chap 3
166 pages
Jamshid Ghaboussi, Xiping Steven Wu-Numerical Methods in Computational Mechanics-Taylor & Francis Group, CRC Press (2017) PDF
100% (1)
Jamshid Ghaboussi, Xiping Steven Wu-Numerical Methods in Computational Mechanics-Taylor & Francis Group, CRC Press (2017) PDF
332 pages
Mid Term Model Question Papers - PDF Final
No ratings yet
Mid Term Model Question Papers - PDF Final
6 pages
Symmetric Eigenvalue Problem
No ratings yet
Symmetric Eigenvalue Problem
8 pages
Numerical Methods For Eigenvalue Problems (PDFDrive)
No ratings yet
Numerical Methods For Eigenvalue Problems (PDFDrive)
217 pages
Csir Net Mathematics Info
No ratings yet
Csir Net Mathematics Info
22 pages
MAEconomics
100% (1)
MAEconomics
41 pages
Exposé Davidson Jacobi
No ratings yet
Exposé Davidson Jacobi
17 pages
Full Chapter Finite Mathematics and Applied Calculus 8E Stefan Waner PDF
No ratings yet
Full Chapter Finite Mathematics and Applied Calculus 8E Stefan Waner PDF
53 pages
Full Text PDF
No ratings yet
Full Text PDF
29 pages
Wu Convergence Analysis On Iterative Methods
No ratings yet
Wu Convergence Analysis On Iterative Methods
20 pages
Full Stuck Software Developer
No ratings yet
Full Stuck Software Developer
45 pages
Power Method and Deflation
No ratings yet
Power Method and Deflation
16 pages
Preshius Project
No ratings yet
Preshius Project
39 pages
Slides Ch8 Bài 3. Bài toán trị riêng - Phương pháp lũy thừa
No ratings yet
Slides Ch8 Bài 3. Bài toán trị riêng - Phương pháp lũy thừa
44 pages
Saad Krylov Subspace Methods For Solving Large Unsymmetric Linear Systems
No ratings yet
Saad Krylov Subspace Methods For Solving Large Unsymmetric Linear Systems
23 pages
Chapter02 - 2024-2025 Num - Ana
No ratings yet
Chapter02 - 2024-2025 Num - Ana
23 pages
R18 B.Tech. Aeronautical Engg. Syllabus Jntu Hyderabad
No ratings yet
R18 B.Tech. Aeronautical Engg. Syllabus Jntu Hyderabad
119 pages
Chap3 Ho
No ratings yet
Chap3 Ho
14 pages
LecN16 R
No ratings yet
LecN16 R
11 pages
All Chapter Questions
No ratings yet
All Chapter Questions
42 pages
Bjorck 1988
No ratings yet
Bjorck 1988
12 pages
The New Iteration Methods For Solving Absolute Value Equations
No ratings yet
The New Iteration Methods For Solving Absolute Value Equations
14 pages
GMRES Saad
No ratings yet
GMRES Saad
14 pages
Chapter 10
No ratings yet
Chapter 10
6 pages
Cours 4 Part 2
No ratings yet
Cours 4 Part 2
11 pages
COMP1045 Exam Paper 2021
No ratings yet
COMP1045 Exam Paper 2021
4 pages
Structural Inference in Cointegrated Vector Autoregressive Models
No ratings yet
Structural Inference in Cointegrated Vector Autoregressive Models
197 pages
Numerical Methods For Large Eigenvalue Problems
100% (1)
Numerical Methods For Large Eigenvalue Problems
285 pages
Christmas Exam Chapters
No ratings yet
Christmas Exam Chapters
6 pages
LecN15 R
No ratings yet
LecN15 R
6 pages
Lecture Notes
No ratings yet
Lecture Notes
337 pages
Station Iter
No ratings yet
Station Iter
11 pages
UGBS 301 Course Outline - 2023
No ratings yet
UGBS 301 Course Outline - 2023
4 pages
Chap1 Ho
No ratings yet
Chap1 Ho
4 pages
0610206v3 PDF
No ratings yet
0610206v3 PDF
13 pages
CS365 Optimization Techniques Module4
No ratings yet
CS365 Optimization Techniques Module4
30 pages
Assignment Template
0% (1)
Assignment Template
18 pages
Legendre Polynomials: D PX N PX X NDX
No ratings yet
Legendre Polynomials: D PX N PX X NDX
17 pages
Posholi LE Asssignment 3
No ratings yet
Posholi LE Asssignment 3
7 pages
Assignment 2: Use Arrays To Structure The Raw Data and To Perform Data Comparison & Operations
No ratings yet
Assignment 2: Use Arrays To Structure The Raw Data and To Perform Data Comparison & Operations
6 pages
Chino Poisson 1d
No ratings yet
Chino Poisson 1d
8 pages
GLCM PDF
No ratings yet
GLCM PDF
7 pages
Practice 16 17 18
No ratings yet
Practice 16 17 18
13 pages
Preconditioning Eigenvalues and Some Comparison of Solvers: Ronald B. Morgan
No ratings yet
Preconditioning Eigenvalues and Some Comparison of Solvers: Ronald B. Morgan
15 pages
The Linear Algebra Curriculum Study Group Recommendations - Moving Beyond Concept Definition
No ratings yet
The Linear Algebra Curriculum Study Group Recommendations - Moving Beyond Concept Definition
20 pages
Householder's Method For Approximating Eigenvalues: Azhi Sabir Mohammed & Rebwar Mohammed WSW
No ratings yet
Householder's Method For Approximating Eigenvalues: Azhi Sabir Mohammed & Rebwar Mohammed WSW
69 pages
Analytical Mechanics of Space Systems: Errata List For The AIAA Education Series Text Book
No ratings yet
Analytical Mechanics of Space Systems: Errata List For The AIAA Education Series Text Book
11 pages
Chap 20 Slides
No ratings yet
Chap 20 Slides
28 pages
Improving The Accuracy of Computed Eigenvalues and Eigenvectors
No ratings yet
Improving The Accuracy of Computed Eigenvalues and Eigenvectors
23 pages
Krylov Subspace Methods For Solving Large Unsymmetric Linear Systems (Saad)
No ratings yet
Krylov Subspace Methods For Solving Large Unsymmetric Linear Systems (Saad)
22 pages
Improving The Accuracy of Computed Singular Values
No ratings yet
Improving The Accuracy of Computed Singular Values
8 pages
Math 1010 December 2014 Exam
No ratings yet
Math 1010 December 2014 Exam
13 pages
Newton Raphson
No ratings yet
Newton Raphson
16 pages
Fast Sparse Matrix Multiplication
No ratings yet
Fast Sparse Matrix Multiplication
11 pages
Rayleigh Quotients and Inverse Iteration: Restriction To Real Symmetric Matrices
No ratings yet
Rayleigh Quotients and Inverse Iteration: Restriction To Real Symmetric Matrices
12 pages
Matrix Algorithms Volume II Eigensystems TQW - Darksiderg
0% (1)
Matrix Algorithms Volume II Eigensystems TQW - Darksiderg
490 pages
Conjugate Gradient Method
No ratings yet
Conjugate Gradient Method
50 pages
Gmres Fom Versus QMR Bicg
No ratings yet
Gmres Fom Versus QMR Bicg
24 pages
Notes On The Symmetric QR Algorithm: 1 Subspace Iteration
No ratings yet
Notes On The Symmetric QR Algorithm: 1 Subspace Iteration
21 pages
A Two-Step Jacobi-Type Iterative M e T H o D: Computers Math. Applic. Vol. 34, No. 1, Pp. 1-9, 1997
No ratings yet
A Two-Step Jacobi-Type Iterative M e T H o D: Computers Math. Applic. Vol. 34, No. 1, Pp. 1-9, 1997
9 pages
Solvingsingular Linear Equation
No ratings yet
Solvingsingular Linear Equation
49 pages
Decision Tree
No ratings yet
Decision Tree
18 pages
ECON1202 Quantitative Analysis For Business and Economics 2014
No ratings yet
ECON1202 Quantitative Analysis For Business and Economics 2014
17 pages
Mathematical Properties of Stiffness Matrices CE 131 - Theory of Structures Henri Gavin Fall, 2002
No ratings yet
Mathematical Properties of Stiffness Matrices CE 131 - Theory of Structures Henri Gavin Fall, 2002
8 pages
TG2 Acc115
No ratings yet
TG2 Acc115
12 pages
Shifting Method
No ratings yet
Shifting Method
9 pages
Gujarat Technological University
No ratings yet
Gujarat Technological University
2 pages
Panju Article On Eigenvalues
No ratings yet
Panju Article On Eigenvalues
10 pages
Saint 03 Header
No ratings yet
Saint 03 Header
6 pages
Variational and Weighted Residual Methods
No ratings yet
Variational and Weighted Residual Methods
26 pages
Eigenvalues and Eigenvectors: T 1 N 1 N T T
No ratings yet
Eigenvalues and Eigenvectors: T 1 N 1 N T T
5 pages
Basic Iterative Methods For Solving Linear Systems PDF
No ratings yet
Basic Iterative Methods For Solving Linear Systems PDF
33 pages
Image Compression Using DCT Implementing Matlab
91% (11)
Image Compression Using DCT Implementing Matlab
23 pages
Galerkin Methods
100% (1)
Galerkin Methods
7 pages
Gmres Siam
No ratings yet
Gmres Siam
14 pages
Appendix: 9.1 Functionals and Functional Derivatives
No ratings yet
Appendix: 9.1 Functionals and Functional Derivatives
4 pages
Eigenvalue Problems (Inverse Power Iteration With Shift Routine)
No ratings yet
Eigenvalue Problems (Inverse Power Iteration With Shift Routine)
15 pages
Lecture 23
No ratings yet
Lecture 23
38 pages
1 Iterative Methods For Linear Systems 2 Eigenvalues and Eigenvectors
No ratings yet
1 Iterative Methods For Linear Systems 2 Eigenvalues and Eigenvectors
2 pages
DiagonalIzation Matrix
No ratings yet
DiagonalIzation Matrix
4 pages
Midterm Solutions: 1: Schur, Backsubstitution, Complexity (20 Points)
No ratings yet
Midterm Solutions: 1: Schur, Backsubstitution, Complexity (20 Points)
4 pages
Multiple Integrals, A Collection of Solved Problems
From Everand
Multiple Integrals, A Collection of Solved Problems
Steven Tan
No ratings yet
Real Variables with Basic Metric Space Topology
From Everand
Real Variables with Basic Metric Space Topology
Robert B. Ash
5/5 (1)
Worked Examples in Mathematics for Scientists and Engineers
From Everand
Worked Examples in Mathematics for Scientists and Engineers
G. Stephenson
No ratings yet
A-level Maths Revision: Cheeky Revision Shortcuts
From Everand
A-level Maths Revision: Cheeky Revision Shortcuts
Scool Revision
3.5/5 (8)
Calculus-II (Mathematics) Question Bank
From Everand
Calculus-II (Mathematics) Question Bank
Mohmmad Khaja Shareef
No ratings yet

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

David Diagonalization

Uploaded by

David Diagonalization

Uploaded by

DAVIDSON DIAGONALIZATION METHOD AND ITS

APPLICATION TO ELECTRONIC STRUCTURE CALCULATION∗

Davidson algorithm is a subspace iterative diagonalization method, which ap-

1. Introduction. Large-scale symmetric eigenvalue problems arise from a vari-

is guaranteed to be achieved in the next iteration (in exact arithmetic).

(1.6) δi = −(Hii − ρ)−1 ri

(1.7) xnew = (H − ρ)−1 x

(1.8) x + δ = (H − ρI)−1 x/ǫ

After operating xT (H − ρI)−1 on both sides on the left, ǫ can be solved as

(1.10) (ρI − H)(x + δ) ≈ ǫx

(1.12) δ = (D − ρI)−1 (−r + ǫx)

By enforcing the orthogonality relation xT δ = 0, ǫ can be solved as

(1.14) (I − xxT )(H − ρI)(I − xxT )δ = −r

(1.15) = PHP + PHQ + QHP + QHQ

(1.16) HSP AM = (BH̄BT + BWT Q + QWXT ) + QH1 Q

(1.17) HSP AM x⊥ = w1 + B(WT x⊥ − BT w1 )

where w1 = H1 x⊥ is the inexpensive matrix-vector product. Here the David-

Convergence Curve of DPR (N=1000) Convergence Curve of Davidson (N=5000)

Normalized norm of residue

(a) N = 1000 (b) N = 5000

2.2. Non-Diagonally-Dominant Matrices. Figure 2.3a demonstrates how

DPR convergence curve (N=1000) IIGD convergence curve (N=1000)

Normalized norm of the residue

(a) DPR (b) IIGD

Fig. 2.2: An example of slow-converging DPR and performance of IIGD applied to

(a) DPR (b) IIGD

Normalized norm of residue

(a) GJD (b) RQII

(a) GJD (b) RQII

Simultaneous Convergen Curve for Blocked Davidson (DPR) (N=1000)

3. Application to a “Quasi-Realistic” Problem. First-principles calcula-

to the reciprocal vector G. In a realistic problem (Density Functional Theory based

(3.3) xnew = xold cos θ + rcg sin θ

(3.4) E(θ) = Eavg + A1 cos (2θ) + B1 sin (2θ)

DPR convergence curve for test Hamiltonian (N=100)

Normalized norm of residue

27 + 18λ + 12λ2 + 8λ3

Convergenc curve for TPA non−linear CG with/without Preconditioning

Fig. 3.2: Performance of non-linear CG gets improved a lot by preconditioning (N =

Given the previous discussions interpreting Davidson as a precondioned residue

Convergence curves for DPR with/without TPA preconditioning

Fig. 3.3: Performance of DPR is improved significantly by TPA preconditioning (N =

Convergence curves for preconditioned DPR and CG (N=3001)

[1] E. R. Davidson, Super-Matrix Methods, Comput. Phys. Comm., 53 49 (1989).

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.