Molecular Modelling USTH
Molecular Modelling USTH
Hamiltonian operator, Ĥ , acts upon the wavefunction, ψ, to generate the evolution of the wavefunction
The Schrodinger equation gives the quantized energies, E, of the system and gives the form of the
Photo-switches are one type of molecular machines, a class of molecules that can be
switched between at least two distinct thermodynamically stable forms by the
application of an external stimulus. hence are ideal tools for different research areas
spanning from chemical biology to smart materials
Applications
Applications
Biochemistry
Lecture 2
Molecular Mechanics
Molecular mechanics methods are based on the following principles:
・Nuclei and electrons are lumped into atom-like particles
Atom-like
・Atom-like particles are spherical (radii obtained from measurements or theory) and
・Interactions determine the spatial distribution of atom-like particles and their energies.
fi
Bond stretch energy
• The bond stretch term describes energy changes of bond lengths with respect to their
equilibrium values
Examples:
• C-C bond:
ai= 50 kcal/mol/Å2
lo =1.5 Å2
• C=C bond:
ai= 100 kcal/mol/Å2
lo =1.3 Å2
• The force eld parameters of bond stretch are usually parametrised from spectroscopy
fi
fi
Bond angle energy
• The bond angle term describes energy changes of bond angles with respect to their
equilibrium values
• The force eld parameters of bond angle are usually parametrised from spectroscopy
fi
fi
Torsional angle energy
• The torsional angle term describes energy changes associated with rotations around
chemical bond
• The force eld parameters of torsional angle are usually parametrised from spectroscopy
fi
fi
Van der Waals energy
• The van der Waals term describes the repulsive and attractive inter-atomic forces.
• Force eld parameters: εij and σij
• The force eld parameters of van der Waals are usually parametrised from
small molecular crystals
fi
fi
Electrostatic energy
U= +
+
+ +
+
The Force Field
These total potential functions together with the data (parameters) required to describe
the behavior of different kinds of atoms and bonds, is called a “FORCE FIELD”.
The molecular mechanics “FORCE FIELD” relates the motions, and energies of
motions of atoms within the molecule. The force eld is used to govern how the
parts of a molecule relate to each other, that is, how each atom or group of atoms is
affected by its environment, and how these forces contribute to the structure of the
molecule.
Many different kinds of force- elds have been developed over the years.
Some include additional energy terms that describe other kinds of deformations.
Some force- elds account for coupling between bending and stretching in adjacent
bonds in order to improve the accuracy of the mechanical model.
fi
fi
fi
Popular molecular mechanics force elds
AMBER (Assisted Model Building and Energy Re nement) - widely used for proteins
and DNA
CHARMM - originally developed at Harvard, widely used for both small molecules and
macromolecules
GROMACS - The force eld optimized for the package of the same name GROMOS
- A force eld that comes as part of the GROMOS (GROningen MOlecular Simulation
package), a general-purpose molecular dynamics computer simulation package for the
study of biomolecular systems.
GROMOS force eld (A-version) has been developed for application to aqueous or
apolar solutions of proteins, nucleotides and sugars. However, a gas phase version (B-
version) for simulation of isolated molecules is also available
fi
Force Fields Parameters
In addition to the functional form of the potentials, a force eld typically de nes a set
of parameters for each of a number of atom or particle types that correspond to
different atoms and bonding patterns in commonly simulated molecules.
The parameter set includes values for atomic mass and partial charge for individual
atoms, and equilibrium bond lengths and angles for pairs, triplets, and quadruplets of
bonded atoms.
Example:
AMBER force eld
parameters of Phenylalanine
residue
fi
fi
fi
Molecular Mechanics Force Fields:
Parameterization
1. PartialCharge
HF/6-31G* RESP (derive charges to reproduce ab initio ESP)
fi
Force Field Comparison: Examples
Force Field Comparison: Examples
Force Field Comparison: Examples
Lecture 3
Polarizable Molecular Mechanical Force Field
Polarizable Molecular Mechanical Force Field
Polarizable Molecular Mechanical Force Field
Polarizable Molecular Mechanical Force Field
The use of Coulomb’s law with xed atomic charges to treat the electrostatic
interactions is a major simpli cation in current force elds.
Describe electrostatic interaction in a more physical fashion
Suitable to study heterogeneous systems, dielectric continually changed
system, highly charged systems.
0
∑
Vpol = − 1/2 μpEp
p
fi
fi
fi
Some Polarizable Force Fields
Comparison of Polarizable and Additive Force Fields
Comparison of Polarizable and Additive Force Fields
Lecture 5
Implicit Solvent
Why Solvent ?
Many reactions take place in solution
Consider
Water models:
SPC/E, TIP3P, TIP4P, TIP5P
Can we ignore solvent in simulations?
We can’t ignore solvent, but consider solvent implicitly
Strategy to construct an implicit solvent model
1 1 1 qiqj
ΔGelec = − ( −
2 ϵin ϵout ) rij2 + rirj exp(−rij2 /4rirj)
Here we used Estate and Esolvent=0, and two-body and four-body Ehb (OPEP)
The Coarse-grained OPEP Model
The Coarse-grained OPEP Model
PEP-FOLD 1 RESULTS
52 peptides and 13 miniproteins of 9-23 and 27-49 amino acids: RMSd = 2.3 and 3.7 Å
Maupetit et al., Nucleic Acid Res (2009) and J. Comput Chem. (2010)
Lecture 7
Molecular Dynamics Simulation
Molecular dynamics simulation
• No explicit electrons
• No polarisation, electron transfer or correlation
• Conformational energies for ground state
• No chemistry
• Not variational bound
Molecular dynamics simulation: the Principle
Potential functions
Bonded interactions
Non-bonded interactions
Bond stretch
Electrostatic
Torsional angle
fi
fi
Bond stretch energy
• The bond stretch term describes energy changes of bond lengths with respect to their
equilibrium values
Examples:
• C-C bond:
ai= 50 kcal/mol/Å2
lo =1.5 Å2
• C=C bond:
ai= 100 kcal/mol/Å2
lo =1.3 Å2
• The force eld parameters of bond stretch are usually parametrised from spectroscopy
fi
fi
Bond angle energy
• The bond angle term describes energy changes of bond angles with respect to their
equilibrium values
• The force eld parameters of bond angle are usually parametrised from spectroscopy
fi
fi
Torsional angle energy
• The torsional angle term describes energy changes associated with rotations around
chemical bond
• The force eld parameters of torsional angle are usually parametrised from spectroscopy
fi
fi
Van der Waals energy
• The van der Waals term describes the repulsive and attractive inter-atomic forces.
• Force eld parameters: εij and σij
• The force eld parameters of van der Waals are usually parametrised from
small molecular crystals
fi
fi
Electrostatic energy
• The total potential energy of a system is the sum of all energy terms
U= + +
+ +
• Popular force elds such as OPLS, CHARMM, AMBER have the same
potential functional form, but different in the parameters
• In the TP course, you will use and learn more about the AMBER force eld
fi
fi
Molecular dynamics
• Having described the potential energy of the system, the force acting on
the i-th atom is calculated from the potential energy as:
• Use this force to calculate the next position of the i-th atom
The Verlet Algorithm
• Our molecule may have steric clashes, inappropriate geometry, atoms are not
proper arranged…, therefore molecular structure is not energetically favourable.
• Before beginning dynamics, energy minimization ensures that the system nds
appropriate starting structure. The molecule is relaxed during this process
fi
Energy minimisation
Energy minimisation
fi
fi
Energy minimisation: steepest descent
Energy minimisation: conjugate gradient
Mimic experimental conditions
• Suppose that an experiment was done at temperature T=300 K. How can the simulation
mimic this experimental condition?
==> The initial velocity of atoms is generated from the Maxwellian distribution for the
required temperature T=300 K
Mimic experimental conditions: thermostat
Thermostat:
• The velocity of each atom is rescaled during the simulation (Berendsen thermostat):
Δt T(t)
Vnew = vold (1 + [ − 1])
τ T0
•Popular thermostats:
Berendsen thermostat,
Nose-Hoover thermostat
Bussi-Donadio-Parrinello thermostat
Barostat:
• The simulation box sizes are rescaled during the simulation (Berendsen thermostat)
βΔt
Lnew = Lold(1 − [P(t) − P0])1/3
τp
•Popular thermostats:
Berendsen thermostat
Parrinello-Rahman thermostat
• We are interested in experimental results done in bulk environment which are not
affected by the system surfaces.
• Surface effects are eliminated using the so called "periodic boundary conditions”
• PBC makes it possible to approximate an in nite system by using a small part (unit cell)
No PBC With PBC
• A particle that leaves the cell on the one side is replaced by a copy entering the cell
on the opposite side.
fi
Treat long-range interactions
Treat long-range interactions
Choice of time step
Method for increasing time step
• One way to increase the time step is to eliminate the fastest motions from the system.
• The so-called SHAKE algorithm is widely used to effectively x fast vibrating bonds
and angles at their equilibrium values.
fi
Lecture 8
Enhanced Sampling Methods
Conformational sampling: the problem
5 - 20 kJ/mol
Good conformational sampling
fi
How to obtain good sampling?
Strategy 2: Coarse-grained models
ails
det
ultra coarse-grained
coarse-grained
cale
m e s
, t i
size
tem
all-atom sys
How to obtain good sampling?
Strategy 3: Accelerate the motion of molecule (how?)
500 K Rep3
temperature
exchange MD
400 K Rep2
MD
exchange
300 K Rep1
MD
simulation time
normal sampling
enhanced sampling
500 K f3
temperature
e c t
re j
exc
e 400 K f2 400 K f2 ha
ang MD MD
ng
e
ch
ex
300 K f1 400 K f2
MD MD
simulation time
At one temperature, ST is just a normal MD: no problem with large
systems...
but how to choose fm prior the simulation ?
Simulated tempering: 37-residue WW domain
Simulated tempering
• AMBER99SB*-ILDN
Starting • 48 temperatures: 340-485 K
• ST: 1 μs
Simulated tempering
• AMBER99SB*-ILDN
Starting • 20 temperatures: 310-400 K
• ST: 1 μs
+ =
Metadynamics
•Metadynamics is a technique for enhancing conformational sampling in
molecular dynamics simulations along some collective variables.
A. Laio & M. Parrinello. Proc. Natl. Acad. Sci. 99, 12562 (2002)
fi
The partition function
Boltzmann factor
An example
U1 exp(−Ui /kBT )
Pi =
U0 2
∑i exp(−Ui /kBT )
2 kcal/mol
0
r
exp(−0/kBT ) exp(−2/kBT )
P1 = P2 =
exp(−0/kBT ) + exp(−2/kBT ) exp(−0/kBT ) + exp(−2/kBT )
At temperature T = 300 K, kBT = 0.6 kcal/mol. What are the values of P1 and P2?
Macrostate
Average of a quantity in practice
Brief Theory of Monte Carlo Simulation
Potential
Always accept accept with probability
E E’
E’ E
0 0
r r
Transition probability from state E to state E’:
1 if E′ − E ⩽ 0,
{e
W(E − > E′) = −(E′ − E)
kBT
if E′ − E > 0




fi
Metropolis criterion
The Metropolis criterion is used to decide if a new con guration is accepted or not
fi
Molecular dynamics versus Monte Carlo
Practice Course
Molecular dynamics simulation with GROMACS
Why GROMACS?
(Groningen Machine for Chemical Simulations)
• All basic techniques which you learn from this simulation can be used to
simulate any other systems.
Basic steps
of MD
MD simulation of Trialanine
mkdir ALA3
cp ala3.pdb ./ALA3
[ bonds ]
Structure of the topology le
; ai aj funct c0 c1 c2 c3 Indices for calculation of bond
1 2 1
stretching potential energy
1 3 1
[ pairs ]
; ai aj funct c0 c1 c2 c3
1 8 1 1-4 pairs for Lennard-Jones and
1 9 1 Coulomb interactions
[ angles ]
; ai aj ak funct c0 c1 c2 c3
2 1 3 1 Indices for calculation of angle
2 1 4 1 bending potential energy
[ dihedrals ]
; ai aj ak al funct c0 c1 c2 c3 c4
2 1 5 6 9
2 1 5 7 9 Indices for calculation of dihedral
[ dihedrals ]
angle bending potential energy
; ai aj ak al funct c0 c1 c2 c3
5 13 11 12 4
11 15 13 14 4 Indices for calculation of improper
15 23 21 22 4
dihedral angle bending potential
; Include Position restraint file energy
#ifdef POSRES
#include "posre.itp"
#endif
#include "amber99sb-ildn.ff/tip3p.itp"
Topology file for water molecule
#ifdef POSRES_WATER
; Position restraint for each water oxygen
[ position_restraints ]
; i funct fcx fcy fcz
1 1 1000 1000 1000
#endif
[ system ]
; Name
Generated by trjconv : Protein in water t= 0.00000
[ molecules ]
; Compound #mols
Protein 1
fi
MD simulation of Trialanine
•Create a box around our peptide that we can later fill with solvent.
•The peptide is centered in the simulation box with a minimum distance to
any wall of 1.0 nm.
Input Output
Coordinate file Coordinate file
vmd ala3-box.gro
•Add water to the box that we just generated using the command
•Tool: gmx solvate
vmd ala3-solvated.gro
•All parameters are listed in a .mdp file (files em.mdp and eq.mdp provided).
MD simulation of Trialanine
Step 7: Energy minimization
•This will relax some unnatural distances between atoms which are
accidentally too close to each other during the setup.
•GROMACS uses the gmx grompp tool to combine the coordinate (ala3-solvated.gro)
and topology (ala3.top) files with parameters from the (em.mdp) file to
generate a single input file (em.tpr) file for the minimization.
MD simulation of Trialanine
•All files are merged into a single file with tool: gmx grompp
Input Output
file Coordinates file
Output
(Last structure)
Energy file
Output
Log file
Output
Trajectory file
MD simulation of Trialanine
Exercise 1:
To see how the potential energy of the system changes during the
minimization, you can extract the potential energy from file (em.edr)
using the command energy:
xmgrace potential.xvg
• How does the potential energy behave? Report the initial and final energy values.
•Try with the conjugate gradient method by specifying: integrator = cg in the em.mdp file.
Then, run the energy minimization again, and compare the potential energy with that obtained
by “steep” minimiser. Which algorithm is more efficient?
MD simulation of Trialanine
Step 8: Equilibration
•Give the system a desired temperature and pressure.
•Relax the simulation box to a desired density
•GROMACS uses the gmx grompp tool to combine the energy minimised coordinate
(em.gro) and topology (ala3.top) files with parameters from the (eq.mdp) file
to generate a single input file for the equilibration.
MD simulation of Trialanine
Step 8: Equilibration
•Give the system a desired temperature and pressure.
•Relax the simulation box to a desired density
•GROMACS uses the gmx grompp tool to combine the energy minimised coordinate
(em.gro) and topology (ala3.top) files with parameters from the (eq.mdp) file
to generate a single input file for the equilibration.
MD simulation of Trialanine
•All files are merged into a single file with tool: gmx grompp
Input
file Output Output
Energy file Coordinates file
Output (Last structure)
Log file
Output
Trajectory file
MD simulation of Trialanine
•Visualize the temperature, pressure and energy using the gmx energy tool:
Exercise 2:
Exercise 3: You will run a production simulation with the following requirements:
1. Starting from the equilibrium structure obtained from Step 8.
2. The time length of the trajectory is 1 ns
3. Use the velocity-verlet algorithm for integration
4. Because the simulation box was adjusted to the correct size in the
equilibrium step, we do not need a barostat (turn off barostat).
5. We use the v-rescale thermostat.
6. We need a simulation at room temperature of 300 K.
7. Because atoms already had velocities from equilibration step, we do not
need to generate velocities.
•After modification of file (eq.mdp) taking into account 1-7, you save the
file with name (md.mdp), and follow simulation steps:
You can visualize the trajectory by using the vmd command of VMD:
vmd md-traj.gro
xmgrace rmsd.xvg
You will see that the RMSD changes in time, indicating the peptide
explores different structures.
[ distance ]
x y
where x and y are the indices of the first and last Cα atoms in the coordinate file Then, use the gmx dista
tool:
The generated data is saved in table-form into the distance.xvg file and can be plotted:
xmgrace distance.xvg
Case Study: Amyloid β peptide
•Initial structure of the peptide is taken from Protein Data Bank (PDB)
https://www.rcsb.org
with the PDB code: 1zOq
trapped
here
At 300 K:
KBT ≈ 2.5 kJ/mol
5 - 20 kJ/mol
Good conformational sampling
At 500 K:
KBT ≈ 4.2 kJ/mol
4 - 10 kJ/mol
500 K Rep3
temperature
exchange MD
400 K Rep2
MD
exchange
300 K Rep1
MD
simulation time
T0 = 300 K
T1 = 302 K
T2 = 305 K
T3 = 309 K
T4 = 313 K
T5 = 318 K
T6 = 323 K
…
…
T20 = 400 K
•To run REMD with GROMACS, you first create different parameter files which
are only different in temperatures.
•Take the md.mdp file you used in previous simulation, and rename it to md0.mdp,
md1.mdp, md2.mdp and md3.mdp etc. Then change the temperature:
You will see that the RMSD changes more at higher temperature replicas,
[ distance ]
x y
where x and y are the indices of the first and last Cα atoms in the coordinate file.
Then, use the gmx distance tool:
The generated data is saved in table-form into the distance.xvg file and can be plotted:
ails
det
Coarse-grained model
All-atom model
cale
me s
, t i
size
stem
sy
Today job
construct and simulate this system using coarse-grained force field
The coarse-grained MARTINI force field
The lipid and peptide of our system
We use the CHARMM-GUI server to
construct the system and input files
www.charmm-gui.org
Then choose: membrane builder ==> bilayer builder
Upper layer
Peptide
Lower layer
Please download this file
Having obtained coordinator le, map le and topology le, you can
perform simulation with GROMACS as you did for ala3 peptide
Provided by
CHARMM-GUI
CHARMM-GUI
After equilibration
Real simulation
Analyze the simulated trajectories
[ system ]
; name
Martini system
[ molecules ]
; name number
PROA_P 1
DOPC 100
W 619
NA 8
CL 12
https://www.drugdesign.org/chapters/molecular-dynamics/