0% found this document useful (0 votes)
104 views54 pages

Docking and Scoring

This document discusses molecular docking, which involves predicting how ligand molecules bind to protein receptors. It covers: - Types of docking such as rigid/lock-and-key docking and flexible/induced fit docking. - Popular docking tools like DOCK, AutoDock, GOLD, and GLIDE that use algorithms like genetic algorithms and Monte Carlo sampling. - Factors involved in docking like search algorithms, scoring functions, and high throughput virtual screening to identify potential drug candidates. - Scoring functions aim to rank poses based on factors like shape complementarity, hydrogen bonding, and hydrophobic interactions. Knowledge-based scoring functions are derived from protein-ligand complex statistics.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
104 views54 pages

Docking and Scoring

This document discusses molecular docking, which involves predicting how ligand molecules bind to protein receptors. It covers: - Types of docking such as rigid/lock-and-key docking and flexible/induced fit docking. - Popular docking tools like DOCK, AutoDock, GOLD, and GLIDE that use algorithms like genetic algorithms and Monte Carlo sampling. - Factors involved in docking like search algorithms, scoring functions, and high throughput virtual screening to identify potential drug candidates. - Scoring functions aim to rank poses based on factors like shape complementarity, hydrogen bonding, and hydrophobic interactions. Knowledge-based scoring functions are derived from protein-ligand complex statistics.
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 54

MOLECULAR

DOCKING
Content…

• What is docking ?
• Docking Tools
• Types of Docking
• Kinds of Docking
• Search Algorithm
• Scoring function
• High Throughput Screening, Virtual Screening & Docking
WHAT IS DOCKING….?
DOCKING TOOLS
Docking Software Docking Algorithm
• DOCK Shape fitting
• AutoDock Lamarckian algorithm,
Genetic algorithm
• GOLD Genetic Algorithm
• GLIDE Monte Carlo sampling
• LigandFit Monte Carlo sampling
Types of docking

Lock and Key\Rigid Docking – In rigid docking, both the


internal geometry of the receptor and ligand is kept fixed and
docking is performed.
 Induced fit\Flexible Docking - An enumeration on the
rotations of one of the molecules (usually smaller one) is
performed. Every rotation the surface cell occupancy and
energy is calculated; later the most optimum pose is selected
Rigid Docking
• Historically the first approaches.
• Protein and ligand are fixed.
• Search for the relative orientation of the
two molecules with lowest energy.
• Protein-Protein Docking
• Both molecules usually considered
rigid
• First apply steric constraints to limit
search space and the examine
energetics of possible binding
conformations
Flexible docking
• Protein-Ligand Docking
• Flexible ligand, rigid-
receptor
• Search space much larger
• Either reduce flexible ligand
to rigid fragments
connected by one or several
hinges, or search the
conformational space using
monte-carlo methods or
molecular dynamics
Kinds of Docking

• Bound docking
• Unbound docking
• Global docking
• Local docking
Bound docking and Unbound
docking
•The complex structure is known.
• Individually determined
The receptor and the ligand in the protein structures are
complex are pulled apart and used.
reassembled. •In the unbound docking,
which is a significantly more
•In bound docking the goal is to difficult problem, the
reproduce a known complex where starting coordinates are
the starting coordinates of the taken from the unbound
individual molecules are taken from molecules
the crystal of the complex
Global docking

• The general problem includes a search for the


location of the binding site and a search to figure out
the exact orientation of the ligand in the binding site.
A program that do both makes a Global docking
• Global docking is more demanding in terms of
computational time and the results are less accurate
Local docking

• Sometimes the location of the binding site is known.


In this case we only need to orient the ligand in the
binding site. In this case the problem is called Local
docking
Methodological advances

• Inverse docking-small molecules of interest are


dock into library of receptor.

• Covalent docking-it is used to study the covalent


character between ligand and receptor. It provides
stronger binding affinity that prolongs the duration
of biological effects
Search Algorithm

 Determine all possible optimal conformation for a given complex


(protein-ligand/ protein-protein)
 Calculate the energy of resulting complex & of each individual
interactions.
Conformational search strategies include-
• Systematic method
• Random method
• Simulation method
Systematic Search
• it uses incremental construction and conformational search
databases
• This search algorithm explores all the degree of freedom in a
molecule.
• Ligands are often incremenatlly grown into the active site.
• Step wise or incremental search can be accomplished in
different ways
• While docking various molecular fragments into the active
site region and linking them covalently or alternatively by
dividing dock ligands into rigid (core fragment) and by
flexible(side chain)
Systematic Search Contd…

• Once the rigid core is defined they are dock into the
active site.
Flexible regions are added in an incremental fashion.
Another method of systematic search is use of
library of pre-generated conformations.
library conformations are typically only calculated
once and the search problem is therefore reduced to
rigid body docking procedure.
Random search
• This method operate by making random change to either
single ligand or population of ligand.
• A newly obtained ligand is evaluated on the bases of pre
defined probability function.
• Basic idea is to take into consideration of already explored
area of conformation space.
• To determine if a molecular conformation is accepted or
not, the root mean square value is calculated between
current molecular coordinates and every previously recorded
conformations.
• Random search uses two algorithms-
Monte Carlo algorithm
Genetic algorithm
Simulation Search
• It uses algorithms like molecular dynamics and energy
minimization.
• In this approach, proteins are typically held rigid, and the
ligand is allowed to freely explore their conformational space.
• The generated conformations are then docked successively
into the protein, and an MD simulation consisting of
a simulated annealing protocol is performed.
• This is usually supplemented with short MD energy
minimization steps, and the energies determined from the
MD runs are used for ranking the overall scoring. Although
this is a computer-expensive method (involving potentially
hundreds of MD runs).
Scoring Function

• The evaluation and ranking of predicted ligand conformations


is a crucial aspect of structure-based virtual screening.
• Scoring functions implemented in docking programs make
various assumptions and simplifications in the evaluation of
modeled complexes
• They do not fully account for a number of physical
phenomena that determine molecular recognition — for
example, entropic effects.

contd…
• Affinity scoring functions are applied to the energetically
best pose or n best poses found
for each molecule, and comparing the affinity scores for
different molecules gives their
relative rank-ordering.

• Essentially, following types or classes of scoring functions


are currently applied:
1. Force-field-based scoring
2. Empirical scoring functions
3. Knowledge-based scoring functions
4. Consensus scoring
5. Shape & Chemical Complementary Scores
Classes of scoring function
• Broadly speaking, scoring functions can be divided into the
following classes:
• Forcefield-based
• Based on terms from molecular mechanics forcefields
• GoldScore, DOCK, AutoDock
• Empirical
• Parameterised against experimental binding affinities
• ChemScore, PLP, Glide SP/XP
• Knowledge-based potentials
• Based on statistical analysis of observed pairwise
distributions
• PMF, DrugScore, ASP
Terms in Scoring Functions
Shape & Chemical Complementary
Scores
• Divide accessible protein surface into zones:
– Hydrophobic
– Hydrogen-bond donating
– Hydrogen-bond accepting
• Do the same for the ligand surface
• Find ligand orientation with best complementarity score
Empirical scoring functions
Böhm’s empirical scoring
function
• This scoring function is an empirical scoring function
• Empirical = incorporates some experimental data
• The coefficients (∆G) in the equation were determined using
multiple linear regression on experimental binding data for 45
protein–ligand complexes
• Although the terms in the equation may differ, this general
approach has been applied to the development of many
different empirical scoring functions
contd…
Böhm’s empirical scoring
function
• In general, scoring functions assume that the free
energy of binding can be written as a linear sum of
terms to reflect the various contributions to binding.
• Bohm’s scoring function included contributions
from hydrogen bonding, ionic interactions, lipophilic
interactions and the loss of internal conformational
freedom of the ligand.
Here,

• The ∆G values on the right of the equation are all constants.


• ∆Go is a contribution to the binding energy that does not
directly depend on any specific interactions with the protein
• The hydrogen bonding and ionic terms are both
dependent on the geometry of the interaction, with large
deviations from ideal geometries (ideal distance R, ideal angle
α) being penalized.
Knowledge-based Scoring
Function
• Knowledge-based scoring functions are designed to
reproduce experimental structures rather than binding
energies.
• Free energies of molecular interactions are derived from
structural information on Protein-ligand complexes contained
in PDB.
• Boltzmann-Like Statistics of Interatomic
Contacts suggests:

[
P (s p , s l )= Pref exp - bF (s p , s l )

]
Distribution of interatomic distances is converted
into energy functions by inverting Boltzmann’s law.
Knowledge-based potentials
For example, creating the distributions of ligand carbonyl oxygens to protein
hydroxyl groups:

Ligand Ligand

Ligand O
O O

HO
HO HO Protein

Protein Protein

(imagine the minimum at 3.0Ang)


(Invented) distribution of a particular pairwise interaction

1200
Number of observations

1000

800

600

400

200

0
5

5
0

5
0.

1.

2.

3.

4.

Distance (Angstrom)
Force Field based Scoring
• Molecular mechanics force fields usually quantify the sum of
two energies, the receptor–ligand interaction energy and
internal ligand energy(such as steric strain induced by binding).
• Most force field scoring functions only consider a single
Protein conformation, which makes it possible to omit the
Calculation of internal protein energy, which greatly simplifies
Scoring.
contd…
Nonbonding interactions (ligand-protein):

-van der Waals


-electrostatics
Amber force field
CONSENSUS SCORING
• Consensus scoring combines information from different
scores to balance errors in single scores and improve the
Probability of identifying ‘true’ ligands.
• An exemplary implementation of consensus scoring is
X-CSCORE60, which combines GOLD-like, DOCK-like,
ChemScore, PMF and FlexX scoring functions.
High Throughput Screening, Virtual
Screening & Docking
High Throughput Screening
• Popular approach to target validation.
• Process of testing a large no. of diverse chemical structures to
identify ‘HITS’.
Benefits of HTS:
• Allows screening of thousand of compounds on repeatable
basis.
• More effective drugs can be developed at fast rate.
• Ability to optimize the compound lead selection and
eliminating compounds that do not show measurable activity.
• Reduces time and cost effective.
VIRTUAL SCREENING
• Computational technique used in drug discovery to search
libraries of small molecules in order to identify those
structures which are most likely to bind to a drug target,
typically a protein receptor or enzyme.
• Virtual screening uses computer based methods to discover
new ligands on the basis of biological structures.
• There are two broad categories of screening techniques:
I. Ligand-based and
II. Structure-based
Docking
• Ligand-Protein Docking
Steps:
Step 1: Preparation of Input files
Step 2: Grid Preparation
Step 1: Preparation of input files:
 Ligand preparation:
• Assign charges
• Define rotatable bonds
• Rename aromatic carbons
• Merge non-polar hydrogens
• Write .pdbqt ligand file
• Ligands can be obtained from various databases
like ZINC, PubChem or can be sketched using tools like
Chemsketch
• While selecting the ligand, the LIPINSKY’S RULE OF 5
should be applied.
 Protein preparation:
• -Add essential hydrogens
-Load charges
-Merge lone-pairs
-Add solvation parameters
-Write .pdbqt protein file
• PDB structures often contain water molecules
In general, all water molecules are removed except where it is
known that they play an important role in coordinating to
the ligand.
• PDB structures are missing all hydrogen atoms.
Many docking programs require the protein to have explicit
hydrogens.
contd...
Step 2: Docking Preparation – Grid
 Ligand-protein interaction
energies are pre-calculated and
then used as a look-up table
during simulation
 Grid maps are constructed
based on atoms of interest in
ligand.
Key points…
• rmsd/lb (RMSD lower bound) and rmsd/ub (RMSD upper
bound), differing in how the atoms are matched in the distance
calculation:
• rmsd/ub matches each atom in one conformation with itself
in the other conformation, ignoring any symmetry
• rmsd/lb is defined as follows: rmsd/lb(c1, c2) =
max(rmsd'(c1, c2), rmsd'(c2, c1))
• polar hydrogens are needed in the input structures to correctly
type heavy atoms as hydrogen bond donors.
Professor Dr. Baseri

Director: Mohamad Hosein Lotfi

You might also like

pFad - Phonifier reborn

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.


Alternative Proxies:

Alternative Proxy

pFad Proxy

pFad v3 Proxy

pFad v4 Proxy