Polaris(MD)
Polarisation and Simulation (Molecular Dynamics)

This is the beta version of the POLARIS(MD) website that is still under construction.

Here we review in more detail the theoretical underpinnings of Polaris(MD) and some of the technology involved.

  1. The TCPE class of polarizable force fields
  2. TCPE/2013 force field for water
  3. Modeling strong hydrogen bonds and metal/ligand interactions
  4. Extrapolating bulk phase hydration enthalpies of single organic ions from droplet simulations
  5. A polarizable coarse grained approach to model the solvent water
  6. The Multiple Time Steps approach r-RESPAp
  7. A Fast Multipole Method to model non-periodic systems
  8. POLARIS(MD) efficiency

The TCPE class of polarizable force fields

The TCPE class of force fields is based on the decomposition of the molecular interaction energy into six main energy components: an intramolecular relaxation $U^\mathrm{rel}$, a repulsive $U^\mathrm{rep}$, a Coulombic $U^\mathrm{qq’}$, a polarization $U^\mathrm{pol}$ based on an induced point dipole moment approach, a dispersion $U^\mathrm{disp}$, and a short-range anisotropic many-body $U^\mathrm{mbp}$ energy term.

The intramolecular relaxation energy term is a sum of three major components: the standard 1-2 stretching $U^\mathrm{st}$ and bending 1-3 $U^\mathrm{bend}$ harmonic energy terms , and the 1-4 torsional energy $U^\mathrm{tor}$ defined as $$ U^\mathrm{tor} = \sum_{k=1}^{3} V_k \Big(1+\cos(k\phi -\phi_0) \Big) $$ The repulsive term is a pairwise potential defined as: $$ U^\mathrm{rep} = \sum_{i,j} A_{ij}\exp{(-B_{ij}r_{ij})} $$ where $r_{ij}$ is the distance between two atomic centers $i$ and $j$, and $A_{ij},B_{ij}$ are two adjustable parameters. The electrostatic term is a standard Coulombic potential: $$ U^\mathrm{qq'} = \sum_{i,j} \frac{q_iq_j}{4\pi\varepsilon_0r_{ij}} $$ The static charges $q_i$ are located on atomic centers. The polarization term $U^\mathrm{pol}$ is based on the introduction of a new set of degrees of freedom, the point induced dipole moments $\mathbf{p}_i$, located on atomic centers that obey $$ \mathbf{p}_i = \alpha_i\cdot\left(\mathbf{E}_i^q + \sum_{j=1}^{N^*_\mu}{\mathbf{T}_{ij}\cdot\mathbf{p}_j}\right). $$ where $\alpha_i$ is the isotropic polarizability of the polarizable center $i$, $\vec E_i^q$ is the electric field generated by the static charges surrounding the center $i$, and $\mathrm{T}_{ij}$ is the dipolar tensor defined as $$ \mathrm{T}_{ij} \cdot \mathbf{p}_j = 3 \frac{(\mathbf{p}_j \cdot \mathbf{r}_j) \mathbf{r}_i}{r_{ij}^5} - \frac{\mathbf{p}_j}{r_{ij}^3} $$ Both the static charge electric field and the dipolar tensor include a short range Thole-like damping component. The $U^\mathrm{pol}$ energy is defined as $$ U^\mathrm{pol} = \frac{1}{2}\sum_{i} \frac{\mathbf{p}^2_i}{\alpha_i}-\sum_{i} \mathbf{p}_i \cdot \mathbf{E}_i^{q} - \frac{1}{2} \sum_{i,j} \mathbf{p}_i \cdot \mathbf{T}_{ij} \cdot \mathbf{p}_j $$ $U^\mathrm{disp}$ is a basic dispersion pairwise potential: $$ U^\mathrm{disp} = -\sum_{i,j} \Big( \frac{\sigma_{ij}}{r_{ij}} \Big)^6 $$ where $\sigma_{ij}$ is an adjustable parameters expressed in Å (kcal mol$^{-1}$)$^{-6}$. That term is not used to model all non electrostatic/polarization interactions between non-bonded pairs of atoms (i.e. separated by more than 2 chemical bonds). For particular interactions, like hydrogen bonds (HBs), strong hydrogen bonds (SHBs) or metal/ligand (MLs) ones, the TCPE class of force fields consider a more sophisticated short range many body potential $U^\mathrm{mbp}$: $$ U^\mathrm{mbp} = \sum f(r)g(\Theta) $$ For hydrogen bonded systems, the sum runs over all HB/SHB pairs, $ r $ and $ r_e $ are the HB/SHB length and its equilibrium value, respectively. $ \Theta $ is a set of specific intermolecular angles. For SHBs, $ \Theta $ reduces to the sole angle $ \psi^1 = \ce{\angle{X^- \cdots H-O}}$. For HBs, it is a set of two angles $ (\psi^1,\psi^2) $, namely $ \ce{\angle{O \cdots H-O}} $ and the angle between the bisector of the water angle $\ce{\angle{HOH}}$ and the hydrogen of the second water molecule involved in a HB, respectively. The functions $ f $ and $ g $ are defined as $$ f(r) = \displaystyle D_e \exp\left(-\frac{(r-r_e)^2}{\gamma_r}\right), \\ g(\Theta\{ \psi^l\}) =\displaystyle \prod_l \exp\left(-\frac{(\psi^l-\psi^l_e)^2}{\gamma_\psi^l}\right) $$ For a given HB (SHB), the intensity of its $U^\mathrm{mbp}$ term is modulated by the chemical environment of the water molecule (anion) accepting the water hydrogen. This is achieved by taking $D_e$ as a linear function of the local density $n_b$ of the water oxygen (HB) or of the $\ce{O-H}$ bonds (SHB) in the vicinity of hydrogen acceptor species $$D_e = d_e(1-\xi n_b)$$ $d_e$ and $\xi$ being two adjustable parameters and the local density $n_b$ is computed as $$ n_b =\sum \exp\left(-\frac{(r-r_e)^2}{\gamma'_{rt}}\right) $$ in which, $ r $ and $ r_e $ are defined as for function $f$. $\gamma'_{rt}$ is a parameter adjusted to take into account only water molecules in the first hydration sphere of the hydrogen acceptor species and the sum runs over all the molecules surrounding it (apart from the local one considered explicitly in the present function $f$). $U^\mathrm{mbp}$ is a short-range energy term that is smoothly zeroed for distances $r$ included between 5.75 and 6.25 angstrom. We discussed this energy term as mainly mimicking charge-transfer effects in SHBs even if it also accounts for halide/water dispersion. For HBs see our discussions below. Lastly to model heavy ion/ligand interactions, a specific $U^\mathrm{mbp}$ energy term for which the above angular components is removed. In other words, for a given heavy ion, like $\ce{Cm^3+}$ and $\ce{Th^{4+}}$, its $U^\mathrm{mbp}$ energy term is defined as $$ f(r) = \displaystyle D_e \exp\left(-\frac{(r-r_e)^2}{\gamma_r}\right), $$ the function $D_e$ being defined as above. Here the sum runs on all the neighboring ligand molecule. For instance, when ligand = water, the sum runs on the neighboring water oxygens.

TCPE/2013 force field for water

The rigid and many-body water model TCPE/2013 is based on a water interaction energy term $U^{ww}$corresponding to the following decomposition $$ U^\mathrm{ww} = U^\mathrm{rep} + U^\mathrm{qq'} + U^\mathrm{pol} + U^\mathrm{hb} $$ here $U^\mathrm{hb}$ is the energy term $U^\mathrm{mbp}$ defined above to model HB networks. The TCPE/2013 parameters have been assigned to reproduce quantum ab initio computations at least at the MP2/CBS limit level of theory regarding small water hydrogen bonded clusters comprising up to 6 molecules, as well as the water/water interaction energy in cation first hydration shells comprising up to 10 water molecules. However, we refined the $\gamma_r$ of the term $U^\mathrm{hb}$ to accurately reproduce the liquid water density at ambient conditions (and only at ambient conditions). The relation between the $U^\mathrm{hb}$ term and the water ab initio dispersion energy (the data are taken from Ref JCP 128 (2008) 094313) can be assessed in the following figure, where the water dimer interaction energy is plotted as a function of the water oxygen/oxygen elongation distance:
In that figure, $U^\mathrm{hb}$ is plotted in green, ab initio dispersion in blue and the difference between $U^\mathrm{hb}$ and the ab initio dispersion term is shown in dashed line (right axis). The TCPE/2013 reliability to model liquid water may be assessed in particular on the following Figure where the water density on a wide range of temperature and pressure conditions are plotted. In particular, we note TCPE/2013 to predict a maximum in the water density at about 283 K for a pressure P = 1 atm. We note also that a water density maximum is observed for all the pressure conditions up to 1 000 atm and that this maximum is linearly shifted to lower temperature, in agreement with experiment.
That figure is taken Réal et al. (2013).

Modeling strong hydrogen bonds and metal/ligand interactions

To model weak charge redistribution phenomena between two chemical species, like those occurring in strong hydrogen bonds or in the interactions between a heavy cation and water molecules, we use the two specific $U^\mathrm{mbp}$ potentials described in the above section. For strong hydrogen bonds, the $U^\mathrm{shb}$ term includes a angular dependence, whereas for heavy cation/neutral ligand interaction, the corresponding $U^\mathrm{tc}$ term is purely radial.

Using the strong hydrogen bond term $U^\mathrm{shb}$, we developed a force field able to model the hydration of the halides $\ce{F^-}$ to $\ce{As^-}$, as well as the hydration of linearly alkylated carboxylates. Regarding the halide $\ce{As^-}$ we showed its diffusion coefficient in aqueous phase at ambient conditions to be close to the $\ce{I^-}$ one, in agreement with recent experimental findings, see Ref. Inorganic Chemistry 57 (2018) 4926. Regarding linearly alkylated carboxylates, we showed their propensity for water droplet and liquid water interfaces to increase with the length of their alkyl chain. At water interfaces, the anionic head $\ce{COO^-}$ remains solvated while the alkyl chain points outside of the water system interface, see the following figure where three MD simulation snapshots of methanoate, butanoate and pentanoate carboxylates interacting with a 300 water droplet is shown

That figure is taken from Houriez et al. (2015).

Regarding heavy cations, we developed a force field based on the energy term $U^\mathrm{tc}$ that allows to investigate the hydration process of two heavy cations, namely $\ce{Cm^{3+}}$ and $\ce{Th^{4+}}$. See Réal et al. (2010) and Réal et al. (2013).

Extrapolating bulk phase hydration enthalpies of single organic ions from droplet simulations

By performing molecular dynamics simulations of water droplet systems embedded in a large enough cavity to prevent evaporation phenomena without perturbing the droplet dynamics, we extrapolate the bulk limit values of ion/water interaction energies $\Delta U^{iw}(\infty)$ according to the power law function $$\Delta U^{iw}(n) = \Delta U^{iw}(\infty) + \frac{\xi}{n^{1/3}}$$ here $\Delta U^{iw}(n)$ is the ion/water interaction energy of a droplet system of size $n$ and $\xi$ is an adjustable parameter. The following plot shows an example of linearly extrapolating the $ \Delta U^{iw}(\infty)$ values for linearly alkylated carboxylates from water droplet data (the droplet molecular size ranges from 50 to 10 000). These data are taken from Coles et al. (2016).
We estimate then the bulk water deformation energy $\Delta U^{ww}(\infty)$ due to the ion presence from bulk water simulation. For a mono atomic ion, its hydration enthalpy is $$\Delta H_\mathrm{hyd} = \Delta U^{iw}(\infty) + \Delta U^{ww}(\infty) - \mathrm{RT}$$ For a poly atomic ion (like carboxylate, ammonium and guanidinium organic ions), we add a third component, the difference in the ion intramolecular relaxation energy between bulk water and vacuum, $\Delta U^{rel}$, to the above sum when the ion is not simulated as a rigid species.

To assess the reliability of ion hydration enthalpy values computed according to that protocol, we estimate the proton hydration enthalpy from the above ion hydration enthalpy and by considering well accepted experimental data, like the enthalpy component of the ion proton affinity in gas phase, the hydration enthalpy of the neutral species (protonated or unprotonated one) corresponding to the ion, and the dissociation data corresponding to the protonation/deprotonation reaction of that ion in bulk water. For instance, the proton hydration enthalpy can be computed from data corresponding to the carboxylate $\ce{C^-}$ according to $$ \Delta H_{q \rightarrow aq} (\mathrm{H^+}) = \Delta H_{g \rightarrow aq} (\mathrm{CH}) + \Delta H_{aq,dissociation} (\mathrm{CH}) - \Delta H_{g \rightarrow aq}(\mathrm{C^-}) + \Delta H_{g,prot} (\mathrm{C^-}). $$ From methylated ammonium droplet simulations, we estimate $\Delta H_{g \rightarrow aq} (\mathrm{H^+})$ to be about 272-273 kcal mol$^{-1}$ in between the two most accepted experimental values, i.e. 271.6 and 274.9 kcal mol$^{-1}$. For carboxylates, our former droplet simulations yield a $\Delta H_{g \rightarrow aq} (\mathrm{H^+})$ value of 268 kcal mol $^{-1}$ clearly outside the latter experimental range of values. However, we identify recently a drawbacks of the water model TCPE/2013 in accurately modeling large sized anion first hydration shell structures. Remediating that drawback allows us to predict now a proton hydration enthalpy values from mono atomic halide anions (from $\ce{F^-}$ to $\ce{I^-}) $ and carboxylate $\ce{CH_3COO^-}$ droplet data in line with the above ammonium-based estimate, about 273 kcal mol$^{-1}$.

Lastly, for two ions for which experimental data are not available or are considered as puzzling, like the halide $\ce{As^-}$ and the organic cation guanidinium, our droplet-based simulation protocol yields the single ion hydration enthalpy values of 59 and 83 kcal mol$^{-1}$ for both these ions, respectively. These values are close to those computed for $\ce{I^-}$ and $\ce{NH_4^+}$, respectively. See also the following snapshot extracted from a MD simulation and showing the guanidinium ion interacting at the water droplet/air interface taken from Houriez et al. (2017).

A polarizable coarse grained approach to model the solvent water

Besides all atoms force fields, POLARIS(MD) allows also the use of an efficient polarizable coarse grained approach to model the solvent water at ambient conditions. That approach relies on considering polarizable pseudo particles whose isotropic polarizability obeys a Clausius-Mosotti relation (see also Masella et al. 2008). $$ \alpha_{pp} = \frac{\epsilon_s - 1}{4\pi \rho \epsilon_s}$$ here, $\epsilon_s$ and $\rho$ are the solvent dielectric constant and density. A priori, a pseudo particle can be used to model a single three atomic water molecule. However the pseudo particle polarisability depends on the pseudo particle volume. That allowed us to propose a multi-scale coarse grained scheme where solute/solvent interactions are handled by larger and larger pseudo-particles as the solute/solvent distance increases, as shown on the following figure

In such a pseudo particle approach, the solute/solvent and solvent/solvent interactions are truncated for large distances, the approach complexity scales as O(N). More details may be found in Masella et al. (2013). Lastly, we assign the parameters tied to particle/particle interactions to reproduce the hydrophobic effects as defined by the Lum-Chandler-Weeks theory (see Masella et al. 2011). Combined with the TCPE force fields to handle intra solute interactions, our CG approach has been shown to accurately model ion pairs in water and the solvent effect on the conformation of small peptides. We have also shown its ability to efficiently model hydrated proteins

The Multiple Time Steps approach r-RESPAp

A multiple time step (MTS) algorithm devoted to polarizable force fields based on the induced dipole moment approach is implemented in POLARIS(MD). The MTS scheme relies on the introduction of two sets of induced dipole moments, the 'full' (or the exact ones) computed by accounting for all the electric field components, and the short range dipole set that depends on the sole short range components of the electric field. For instance, when considering a polarizable center embedded in bulk water, the short range dipole set results from the electric field generated by all the water molecules lying within less than a cut off distance from it (typically the cut off distance is set to 8 Angstrom). Denoting $\Delta t$ the time step used to solve the Newtonian equations of motion, i.e. the time interval between two successive computations of the fast varying intramolecular forces, like repulsion or dispersion, the short range dipole set is iteratively converged at each $\Delta t$, whereas the 'full' dipole set is iteratively converged each $m\Delta t$, $m$ being an integer so that $m\Delta t \leq 5$ fs.

We show the efficiency of that MTS scheme to depend on the number of iterations to converge to 'full' dipole moments each $m$ molecular dynamics iterations. However, using basic predictors, we are able to converge these dipoles within less than 8 iterations. That allows speed up factor larger than 3 when simulating a bulk phase system comprising about 3 000 atoms, using periodic boundary conditions and standard Particle Mesh Ewald sum techniques, as compared to converge the full dipole set each MD iteration. More details may be found in (Masella 2006).

A Fast Multipole Method to model non-periodic systems

The fast multipole method allowing to approximate long-range electrostatic interactions in $O(N)$ time instead of $O(N^2)$ as required by a direct summation approachi has been implemented in POLARIS(MD).. Using this approach, large systems with hundreds of thousands to millions of atoms may be simulated efficiently. First introduced almost three decades ago by Greengard & Rohklin (1989), FMM has found uses in many applications of physics, from astrophysics to molecular dynamics. The FMM implementation introduced to POLARIS(MD) also allows for computing polarization effects in the TCPEp model (Coles and Masella 2015) and explicitly conserves linear momentum. The current version of FMM in POLARIS(MD) is parallelized with OpenMP Tasks and supports distributed computing via MPI.

POLARIS(MD) efficiency

In the following figures, the POLARIS(MD) efficiency (expressed in ns per day) to perform a molecular dynamics on a INTEL SKYLAKE 8168 CPU (24 cores and running at 2.7 Ghz, the code is compiled with INTEL FORTRAN version 17.0.6.256) is plotted as a function of the OPENMP threads. The output sampling interval is set to 1 000 MD iterations for all the simulations discussed below.

Here the system simulated is the protein 1PGB (855 atoms) embedded in a first level coarse grained solvent cubic box (Masella et al. 2013) and comprising about 5500 pseudo-particles (see below). The box dimension is about 55 Angstrom. The simulation is performed in the NVT ensemble with the temperature monitored by the GGMT thermostat (Liu 2000). The point induced dipole moments are iteratively solved until to reach a mean dipole difference between two successive iterations smaller than 10$^{-6}$ Debye and a maximum difference for a single dipole of $15\times10^{-6}$ Debye. XH bonds and HXH bond angles are constrained using the RATTLE procedure (the convergence criterium is 10$^{-6}$). We use the standard Verlet scheme to integrate the Newtonian equations of motion (and not the Multiple Time Step scheme described above). The integration time step is 2 fs.

Here the system simulated is cubic box comprising 1 000 water molecules using the force field TCPE/2013 (Réal 2013). The simulation is performed in the NPT ensemble using the Nose-Hoover themostat/barostat (Martyna et al. 1996). With use infinite periodic conditions and the "full" Coulombic and polarization atomic forces are computed using the SPME approach (Toukmaji 2000) using a grid size of 1 Angstrom, 8th order interpolation spline polynomials, and a direct sum precision of 10$^{-6}$. The Fast Fourier Transform are computed using the FFTW 3 library. No surface term is considered here. The point induced dipole moments are iteratively solved until to reach a mean dipole difference between two successive iterations smaller than 10$^{-6}$ Debye and a maximum difference for a single dipole of $20\times10^{-6}$ Debye. OH bonds and HOH bond angles are constrained using the RATTLE procedure (the convergence criterium is 10$^{-6}$ Angstroms). We use the r-RESPAp scheme to integrate the Newtonian equations of motion with a short range time step of 1 fs and long range time step of 5 fs.

Regarding the FFM approach implemented in POLARIS(MD), the following right plot shows the strong scaling of a single electrostatic calculation made with FMM (from computations performed on the Hazelhen supercomputing system, Stuttgard, Germany), using the Satellite Panicum Mosaic Virus capsid with 550,620 atoms (the capsid 3D structure solvated in a solvent box comprising about 1.8 M water pseudo-particles is shown on the following left figure). The central plot shows the linear scaling of the FFM energy function computational time as a function of the atomic size of water droplets up to a 2 M water droplet..