Professional Documents
Culture Documents
Michel Cuendet
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 1
Plan
• Introduction
• Setting up a simulation
• Connection to
statistical mechanics
• Usage of MD simulation
Why we do simulation
simulation
In some cases, experiment is :
1. impossible Inside of stars
Weather forecast
2. too dangerous Flight simulation
Explosion simulation
3. expensive High pressure simulation
Windchannel simulation
4. blind Some properties cannot be
observed on very short time-scales
and very small space-scales
Molecular modeling
What is Molecular Modeling?
Computational tools
• Quantum Mechanics (QM)
Electronic structure (Schrödinger) 10-100 atoms
– ACCURATE 10-100 ps
– EXPENSIVE ! small system
Types of phenomena
Goal : simulate/predict processes such as
1. polypeptide folding
thermodynamic equilibria
2. biom
biomol olec
ecul
ular
ar as
asso
soci ciat
atiion
governed by weak
3. part
pa rtit
itio
ioni
ning
ng be
betw
twee
eenn so
solvlven
ents
ts
(non-bonded) forces
4. memb
me mbrarane
ne//mi
mice
cell
llee fo
form
rmat atio
ionn
5. chemic
chem ical
al rea
react
ctio
ions
ns,, enzy
enzyme
me cat
catal
alys
ysis
is
6. enzyme catalysis chemical transformations
7. phot
photoc
oche
hemi
mica
call rac
racti
tion
ons,
s, ele
elect
ctro
ronn tra
transf
nsfer
er governed by strong forces
characteristics (1-4):
- degrees of freedom: atomic (solute + solvent) classical MD
- equations of motion: classical dynamics
- governing theory: statistical mechanics
characteristics (5-7): quantum MD
- degrees of freedom: electronic, nuclear
- equations of motion: quantum dynamics
- governing theory: quantum statistical mechanics
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 6
Complexation Partitioning
Plan
• Introduction
• Setting up a simulation
• Connection to
statistical mechanics
• Usage of MD simulation
Degrees of freedom:
freedom:
atoms as elementary
particles + topology
topology
Forces or
interactions
between atoms Boundary conditions
conditions
MOLECULAR
MODEL
system
Force field = temperature
physico-chemical
Methods to generate
generate pressure
knowledge
walls
configurations of external forces
atoms: Newton
Goals -ofDefinition
classicalof(semi-empirical)
empirical potentialforce
energyfields
functions V( rr ) to model the
molecular interactions
- These functions need to be differentiable in order to compute the
=-! V( rr )
forces acting on each atom: F =-
r
2 r 0
=
E bond K b (r " r0 )
Type: CT1-CT1-CT3
Angles
2 !0
E angle K" (" # " 0 )
=
E = K 1 + cos n #
dihedral "
[ ( " $ )]
Type: OC-OC-CT1-CC
Improper angles
O O
C
C
H N R
"0
2
2
E improper K " (# $ # 0 )
=
,% r ( % r ( /
,
% r (
% r ( /
r m : distance at min r m
=
1 6
2 "
Type: CT3-CT3
Repulsive :
Pauli exclusion principle
1
" 12
r
# Attractive:
% induced dipole / induced dipole
EVdW
1
"# 6
r
r
m
Electrostatic interactions
Derived Interactions
Some from
result interactions
the twoare often referred
interactions to as particular
previously described,interactions, but they
i.e. the electrostatic
and the van der Waals interactions.
'(
1) Hydrogen bonds (Hb) & A
2) Hydrophobic effect
/) r ,
12
) rm , 2 6
qiq j
+ # ( 1+ . " 2+ . 4 + #
m
>
* r -
i j 0
* r - 3 i j 4 5( ( r
> 0
For a system with 1500 atoms
Introduction of cutoff
S(r) differentiable
cutonnb cutnb
Effect of cutoff
No cutoff 8 Å cutoff
Elec Elec
VdW VdW
Total Total
2 2
2
12 6
qiq j
E = K b ( r r ) + K ( K [1 + cos( n K ( ) + 4 r r + 4 r
bonds
0
) + dihedrals
angles 0
)] + impropers
0
i j >
i j
> 0
Solvation
Fundamental influence on the structure, dynamics and
thermodynamics of biological molecules
Effect through:
+
• Solvation of charge
%=1
• Screening of charge - charge interactions %=80
qi q j -
=
E elec 4 "# 0# ( r) r
• Hydrogen bonds between water molecules and
polar functions of the solute
Implicit solvation
Screening constant
" = Nr
N=4,8. The dielectric constant is a function of atom distance.
Mimic screening effect of solvent. Simple, unphysical but efficient.
• Poisson Boltzmann
B oltzmann (PB) equation.
{ }
" • # ( r )"$ ( r ) % & 'sinh
'sinh $ ( r ) [ ] = %4 '( ( r ) f(r) : electrostatic potential,
r(r) : charge density
solvation energy
Others: EEF1, SASA, etc...
Simply modelled
proportional to thebysolvent
a non-polar solvation
accessible (free)
surface areaenergy
(SASA)term,
:
) = 0.00542 kcal mol1 Å2
"G np ,
SASA + b
SASA
solv = # kcal mol-1
b = 0.92 kcal
Solvent accessible
Surface (SASA)
Connolly
(Contact)
Solvent accessible
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 24
Limitations of classical MD
Problems Solutions
1) Fixed set of atom types
2) No electronic polarization:
- fixed partial charges allow Fluctuating charges treated as
•
for conformational polarization
polarization
but not electronic polarization Charges
•
dynamicalonparameters
springs representing
e- clouds
• QM-MM
• Full QM simulations
values
- no bond creation or deletion
r
Centre of mass
A
A1 – A4
Centre of mass B
B1 – B4
Centre of mass C
C1 – C4
Centre of mass D
D1 – D4
Centre of mass
W
W1 – W4
• CHARMM
CHARMM (E / I; AA / UA), Amber
www.charmm.org
• Gromacs
www.gromacs.org Amber, Gromos, OPLS - (all E)
• NAMD
CHARMM, Amber, Gromos, ...
www.ks.uiuc.edu/Research/namd
E = explicit solvent AA = all atom
I = implicit solvent UA = united atom (apolar H omitted)
Plan
• Introduction
• Setting up a simulation
• Connection to
statistical mechanics
• Usage of MD simulation
28
Molecular Dynamics Simulation Michel Cuendet EMBL 2008
1) Topological properties:
2) Structural properties:
3) Energetical properties:
a force field
field describing
describing the force acting on each atom of the molecules
MD flowchart
Initial coordinates (X-ray, NMR)
Temp. Ok?
Rescale velocities Structure Ok?
Analysis
Calculation of trajectory
trajectoryvalues
of macroscopic
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 http://www.ch.embnet.org/MD_tutorial/ 30
Coordinate files
Cartesian coordinates
(x,y,z)
ATOM 1 N VAL 1 -0.008 -0.022 -0.030 1.00 0.00 PEP
ATOM 2 HT1 VAL 1 -0.326 0.545 0.778 1.00 0.00 PEP
ATOM 3 HT2 VAL 1 -0.450 -0.956 -0.084 1.00 0.00 PEP
ATOM 4 HT3 VAL 1 -0.172 0.566 -0.876 1.00 0.00 PEP
ATOM 5 CA VAL 1 1.477 -0.077 0.073 1.00 0.00 PEP
ATOM 6 HA VAL 1 1.777 -0.598 0.971 1.00 0.00 PEP
1
2 1
1 N
N 1
1 C
C 1
1 *CA
*CA 1
1 CB
HA 1.4896
1.4896 105.11 -118.25
105.11 117.88 111.68
108.30 1.5353
1.0807
3 1 N 1 CA 1 CB 1 CG1 1.4896 108.86 176.05 110.92 1.5421
4 1 CG1 1 CA 1 *CB 1 CG2 1.5421 110.92 121.67 110.44 1.5454
5 1 CG1 1 CA 1 *CB 1 HB 1.5421 110.92 -118.15 109.36 1.1177
6 1 CA 1 CB 1 CG1 1 HG11 1.5353 110.92 56.96 111.11 1.1099
7 1 HG11 1 CB 1 *CG1 1 HG12 1.1099 111.11 -119.80 110.
110.60
60 1.1134
8 1 HG11 1 CB 1 *CG1 1 HG13 1.1099 111.11 120.81 110.60 1.1103
It is possible to calculate
from missing
the existing cartesian
ones and the ICcoordinates
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 31
The residue
and the topology
standard file (RTF)
topology of
topology contains the atom types
of residues.
No. atom
type mass atom
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 32
sequence .
Easy construction of the protein topology from the sequence.
R O R O
R
Residue definition in
NH NH NH
NH NH CHARMM
O R O R
O
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 33
Total charge
RESI ALA 0.00
GROUP Atom name
ATOM NN NH1 -0.47 ! |
ATOM HN H 0.31 ! HN-N
ATOM CA CT1 0.07 ! | HB1 Atom type
ATOM HA HB 0.09 ! | /
GROUP ! HA-CA--CB-HB2
ATOM CB CT3 -0.27 ! | \ Atom charge
ATOM HB1 HA 0.09 ! | HB3
ATOM HB2 HA 0.09 ! O=C
ATOM HB3 HA 0.09 ! | Bond
GROUP !
ATOM C C 0.51 Improper
ATOM O O -0.51
BOND CB CA N HN N CA angle
BOND C CA C +N CA HA CB HB1 CB HB2 CB HB3
DOUBLE O C
IMPR N -C CA HN C CA +N O
Previous residue
DONOR HN N
ACCEPTOR O C Next residue
IC -C CA *N HN 1.3551 126.4900 180.0000 115.4200 0.9996
IC -C
-C N CA C 1.3551 126.4900 180.0000 114.4400 1.5390
IC N CA C +N
+ 1.4592 114.4400 180.0000 116.8400 1.3558
IC +N CA *C O 1.3558 116.8400 180.0000 122.5200 1.2297
IC CA
IC N
C
C
+N +CA
*CA CB
1.5390 116.8400 180.0000
1.4592 114.4400 123.2300
126.7700
111.0900
1.4613
1.5461 IC
IC N C *CA HA 1.4592 114.4400 -120.4500 106.3900 1.0840
IC C CA CB HB1 1.5390 111.0900 177.2500 109.6000 1.1109
IC HB1 CA *CB HB2 1.1109 109.6000 119.1300 111.0500 1.1119
IC HB1 CA *CB HB3 1.1109 109.6000 -119.5800 111.6100 1.1114
Patching residues
Treat protein
To make special
N- and features.
C-termini:
PRES NTER 1.00 ! standard N-terminus
GROUP ! use in generate statement
ATOM N NH3 -0.30 !
ATOM HT1 HC 0.33 ! HT1
ATOM HT2 HC 0.33 ! (+)/
ATOM HT3 HC 0.33 ! --CA--N--HT2
ATOM CA CT1 0.21 ! | \
ATOM HA HB 0.10 ! HA HT3
DELETE ATOM HN
BOND HT1 N HT2 N HT3 N
DONOR HT1 N
DONOR HT2 N
DONOR HT3 N
IC HT1 N CA C 0.0000 0.0000 180.0000 0.0000 0.0000
IC HT2 CA *N HT1 0.0000 0.
0.0000
0000 120.0
120.0000
000 0.000
0.0000
0 0.0000
IC HT3 CA *N HT2 0.0000 0.
0.0000
0000 120.0
120.0000
000 0.000
0.0000
0 0.0000
Contains
Bonds:
field parameters.
force field parameters. file : par_all22_prot.inp
: par_all22_prot.inp
BONDS
!
!V(bond) = Kb(b - b0)**2
!
!Kb: kcal/mole/A**2
!b0: A
Kb
!
!atom type Kb b0
!
!Carbon Dioxide
CST OST 937.96 1.1600 ! JES
!Heme to Sulfate (PSUL) link
SS FE 250.0 2.3200 !force constant a guess
!equilbrium bond length optimized to reproduce
!CSD survey values of
!2.341pm0.01 (mean, standard error)
!adm jr., 7/01
r 0
C C 600.000 1.3350 ! ALLOW ARO HEM
! Heme vinyl substituent (KK, from propene (JCS))
CA CA 305.000 1.3750 ! ALLOW ARO
! benzene, JES 8/25/89
CE1 CE1 440.000 1.3400 !
! for butene; from propene, yin/adm jr., 12/95
CE1 CE2 500.000 1.3420 !
! for propene, yin/adm jr., 12/95
CE1 CT2 365.000 1.5020 !
! for butene; from propene, yin/adm jr., 12/95
CE1 CT3 383.000 1.5040 !
! for butene, yin/adm jr., 12/95
CE2 CE2 510.000 1.3300 !
(PSF))
The protein structure file (PSF
It Residues
•
contains and
all the information
segments. Howneeded for future
the system simulations :
is divided into :
simulations
residues and segments.
• Atom information. Names, types, charges, masses.
• Bond, angle, dihedral and improper dihedral lists
• Electrostatic groupings. How some numbers of atoms are grouped
for the purpose of calculating long range electrostatic
A segment
segment is
is a group of molecules, for example:
one single protein
•
a collection of ions
•
a ligand
•
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 38
Energy landscape
"
&
3 N Spatial coordinates
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 40
Finding minimumenergy
given a potential energyfunction:
conformations
2
" E " E
= 0 and > 0
2
" x i " x i
Used to:
•
"
•
relieve strain in experimental
find (energetically)
conformations
stable states
*
MINI
EXTERN>
---------- 6.18507
--------- -41.85983
--------- 0.00000
--------- 0.00000
--------- 0.00000
---------
MINI> 200 -28.09805 0.00062 0.00826 0.00005
MINI INTERN> 0.92928 3.80437 0.58415 2.18679 0.06957
MINI EXTERN> 6.19000 -41.86221 0.00000 0.00000 0.00000
---------- --------- --------- --------- --------- ---------
mo tion:
Newton’s law of motion: MD algorithm
dE i
Fi = m ia i = " calculate
dr i forces F ( t )
t = t + " t
t
In discete Verlet
Example:
time : integration
algrorithm algorithm.
update
1 2 r ( t + " t
t )
r ( t + " t
t ) = r ( t ) + v ( t ) # " t
t + a( t ) # " tt
2
1 2
" # t
r ( t " t ) = r ( t ) " v ( t ) $ # tt + a( t ) $ # tt
2 (t ~ 1 fs = 10 -15 s
F ( t ) 2
r ( t + " t
t ) = 2 r( t ) # r (t # " t t ) + $ " tt
m
Propagation of time: position at time t+dt is a determined by position at time t and t-dt, and by
the acceleration at time t (i.e., the forces at time t)
The equations of motion are determinis
deterministic,
tic, e.g., the positions and the velocities at time zero
determine the positions and velocities at all other times, t.
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 43
!------ MD simulation
OPEN WRITE UNIT 31 CARD NAME traj/dyna.rs
UNIT traj/dyna.rst
t ! Restart file Control output
OPEN WRITE UNIT 32 FILE NAME traj/dyna.dc
UNIT traj/dyna.dcd
d ! Coordinates ffile
ile
OPEN WRITE UNIT 33 CARD NAME traj/dyna.en
UNIT traj/dyna.ene
e ! Energy file
Output of MD simulation
[...]
[...]
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 45
i 2mi
˙
p F = F (( r)
In practice, constant energy dynamics is not often used for two reasons :
1) Inaccuracies of the MD algorithm tend to heat
to heat up
up the
the system
We can couple the system to a heat reservoir to absorb the excess heat
cutoffs
constraints
barostat
Plan
• Introduction
• Setting up a simulation
• Connection to
statistical mechanics
• Usage of MD simulation
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 47
Thermodynamic
Thermodynamic ensembles
A macroscopic state is
state is described by :
Z is
is the partition function,
=
" # Ej P = 1
i
Z
$ e
j
such that :
"
i
Cave: if state (1) and (2) are composed of several microscopic states, ,E - ,G
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 49
Z = $ e
" # E i
1
O = " O e i
# $ E i
...
Z i
Expectation Value
"
# "ln( Z ) &
E = ( )
ln Z = U p = kT % ( A = "kT ln( Z )
"# $ "V ' N,T
Helmoltz free energy
Internal Energy Pressure
Macroscopic
Expectation value
value E2, P2 ~ e-" E
E 2
1 # $ E i
O = " e
O i E3, P3 ~ e-" E
E 3
Z
" # E i
Where Z = $ e
E4, P4 ~ e-" E
E 4
is the partition function
,P ~ e-" E
E 5
E5 5
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 51
Ergodic Hypothesis
The ergodic hypothesis is that the ensemble averages used
averages used to compute expectation
values can be replaced by time averages over
averages over the simulation.
Ergodicity
O = O
ensemble time
%
1 1
$ O(r, p) e " # E ( r, p )
drdp =
$ O( t )dt
Z % 0
the molecular modelling technique. The sampling should reach all important
minima and explore them with the correct probability,
- NVE si
simulations " Microcanonic ensemble " P
P = cst.
- NVT si
simulations " Canonical ensemble P(E) = e-" E
" P(E)
- NPT si
simulations " Isothermic-isobaric ensemble P(E) = e-" ( ( E+PV)
" P(E)
" #
E
Note that the Bolzmann weight e is not present in the time average because
it is assumed that conformations are sampled from the right probability.
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 52
E NVE simulation
simulation in a
local minimum
"
*
3 N Spatial coordinates
1 ? 1
%
O ensemble
=
Z
$ O( r, p) e " # E ( r, p )
drdp =
%
$ O( t ) dt = O time
0
function E(r,p)
1) Accurate energy function E(r,p)
2) Appropriate algorithm,
algorithm, which - generates the right ensemble
- samples efficiently
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 53
1) Microcanonical ensemble (constant N,V,E )
sampling is obtained by simple integration
of the Newtonian dynamics:
- Verlet, Leap-Frog, Velocity Verlet, Gear NVE
2) Canonical ensemble (constant N,V,T )
sampling is obtained using thermostats : T
1) Berendsen: scaling of velocities to
(infinite E
obtain an exponential relaxation of the reservoir)
NV
temperature to T
2) Nose-Hoover : additional degree of
freedom coupled to the physical system
acts as heat bath. fixed P
3) Isothermic-isobaric ensemble (constant N,P,T )
In addition to the thermostat, the volume of the T
system is allowed to fluctuate, and is regulated (infinite E
by barostat algorithms. N reservoir)
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 54
friction term
temperature regulation
pi
ṙri
˙ = p˙ i = F i ( r) + " pi + # ( t )
mi
This equation
solvent can for
in implicit example
solvent simulateThe
simulations. the friction and stochastic
temperature is adjustedeffect
via # of
andthe$ ,
using the dissipation-fluctuation
theorem.
The stochastic term can improve barrier crossing and hence sampling.
LD does not reproduce
reproduce dynamical properties
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 56
To
newinsure Boltzmann
conformer. Let C sampling, additional
be the initial
be criteria
conformer andneed thetorandomly
C' the be appliedmodified:
on the
Plan
• Introduction
• Setting up a simulation
• Connection to
statistical mechanics
• Usage of MD simulation
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 58
Entropic effects : Not only energy minima are of importance but whole range of
x-values with energies ~k BT
energy The free energy (F)
U(x)
governs the system
½ kBT S(x1) S(x2)
F = U - TS
U(x2)
Biological molecules
specific processes exhibit
occur; a wide range of time scales over which
for example:
Loop Motions
Domain Motions
•
Subunit motions
•
Types of problems
• Protein stability
• Conformational changes
• Protein folding
• Molecular recognition: proteins, DNA, membranes, complexes
• Ion transport in biological systems
Drug Design
• Structure determination: X-ray and NMR
http://www.ch.embnet.org/MD_tutorial/
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 61
Historical perspective
Theoretical milestones:
Newton (1643-1727):
(1643-1727): Classical eequations
quations of m
motion: F(t)=m a(t)
otion: F(t)=m
Boltzmann(1844-1906): Foundations of statistical mechanics
Schrödi
ödinger (1887-1961): Quantum mechanical eq. of motion: --ihih 1t 2(t)=H(t) 2(t)
Metr
Metrop
opol
olis
is (1
(195
953)
3):: Fi
Firs
rstt Mo
Mont
ntee Ca
Carl
rloo (M
(MC)
C) simu
simula
lati
tion
on of a liqu
liquid
id
(hard spheres) L
i
Wood (1957): First MC simulation with Lennard-Jones potential
q
u
i
Alder (1957): First Molecular Dynamics (MD) simulation of
d
s
a liquid (hard spheres)
Car-
Car-Pa
Parr
rrin
inel
ello
lo(1
(198
985)
5):: Fi
Firs
rstt fu
full
ll Q
QM
M si
simu
mula
lati
tion
onss
Kollmann(1986): First Q
QM
M-MM ssiimulations
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 62
System sizes
BPTI (VAC),
(VAC), bovine pancreatic
trypsin inhibitor without solvent
BPTI, bovine pancreatic
BPTI,
trypsin inhibitor with solvent
RHOD, photosynthetic reaction
RHOD,
center of Rhodopseudomonas
viridis
HIV-1,, HIV-1 protease
HIV-1
ES,
ES, estrogen–DNA
STR,, streptavidin
STR
DOPC,, DOPC lipid bilayer
DOPC
RIBO,, ribosome
RIBO
Solid curve,
curve, Moore’s law
doubling every 28.2 months.
Dashed curve,
curve, Moore’s law
doubling every 39.6 months.
K.Y. Sanbonmatsu, C.-S. Tung, Journal of Structural Biology 157(3) : 470, 2007
Molecular Dynamics Simulation - Michel Cuendet - EMBL 2008 63
1 million atoms
Simulation time : 50 ns
Exemple : Aquaporin
Exemple : Aquaporin