Login| Sign Up| Help| Contact|

Patent Searching and Data


Title:
ANALYSIS OF BIOMOLECULAR SOLVATION SITES BY THE 3D-RISM THEORY.
Document Type and Number:
WIPO Patent Application WO/2014/162615
Kind Code:
A1
Abstract:
This invention is a method for equilibrium solvation-site analysis for biomolecules. The method utilizes 3D-RISM calculations to quickly obtain equilibrium solvent distributions without either necessity of simulation or limits of solvent sampling. The analysis of these distributions extracts highest likelihood poses of solvent as well as localized entropies, enthalpies and solvation free energies. As a test system we used a structure of HIV-1 protease bound to KNI-272 where excellent structural and thermodynamic data is available for comparison. The results, obtained within minutes, show systematic agreement with available experimental data. Further, our results are in good agreement with established simulation-based solvent analysis methods. This method can be used not only for visual analysis of active site solvation but also for virtual screening methods and experimental refinement.

Inventors:
HIRATA FUMIO (JP)
SINDHIKARA DANIEL JON (JP)
Application Number:
PCT/JP2013/060996
Publication Date:
October 09, 2014
Filing Date:
April 05, 2013
Export Citation:
Click for automatic bibliography generation   Help
Assignee:
MOLECULAR DESIGN FRONTIER CO LTD (JP)
HIRATA FUMIO (JP)
International Classes:
G16B15/00
Other References:
KOVALENKO A. ET AL.: "Three-dimensional density profiles of water in contact with a solute of arbitrary shape: a RISM approach", CHEMICAL PHYSICS LETTERS, vol. 290, no. 1, 1998, pages 237 - 244
ANNA YERSHOVA ET AL.: "Generating Uniform Incremental Grids on SO(3) Using the Hopf Fibration", THE INTERNATIONAL JOURNAL OF ROBOTICS RESEARCH, vol. 29, no. 7, 2010
THEMIS LAZARIDIS: "Inhomogeneous Fluid Approach to Solvation Thermodynamics.", 1. THEORY, J. PHYS. CHEM. B, vol. 102, 1998, pages 3531 - 3541
TAKASHI IMAI ET AL.: "Partial Molar Volume of Proteins Studied by the Three-Dimensional Reference Interaction Site Model Theory", J. PHYS. CHEM. B, vol. 109, 2005, pages 6658 - 6665
THEMIS LAZARIDIS ET AL.: "Orientational correlations and entropy in liquid water", J. CHEM. PHYS., vol. 105, no. 10, 1996, pages 4294 - 4316
DANIEL J. SINDHIKARA ET AL.: "Placevent: An Algorithm for Prediction of Explicit Solvent Atom Distribution-Application to HIV-1 Protease and F-ATP Synthase", JOURNAL OF COMPUTATIONAL CHEMISTRY, vol. 33, no. 18, 2012, pages 1536 - 1543
HIROTO TADANO ET AL.: "Application and Performance Evaluation of the Volumetric Parallel 3D-FFT to 3D-RISM on Massively Parallel Cluster", THE SPECIAL INTEREST GROUP TECHNICAL REPORTS OF IPSJ, vol. 122, 15 December 2009 (2009-12-15), pages 1 - 6
YUTAKA MARUYAMA ET AL.: "Acceleration 3D-RISM Calculations Using Graphics Processing Unit", SACSIS2010 SYMPOSIUM ON ADVANCED COMPUTING SYSTEMS AND INFRASTRUCTURES, vol. 2010, no. 5, 20 May 2010 (2010-05-20), pages 97 - 98
Download PDF:
Claims:
CLAIMS

[Claim 1]

A method to postprocess solvent distributions from 3D-RISM, which a) locates solvation-sites b) performs a 6 dimensional search for solvent poses, c) determines the optimal solvent pose using a 3D-RISM-based weight-function, d) calculates the site first order excess translational entropy using 3D-RISM distribution, e) calculates the site first order excess rotational entropy using the orientational distribution function based on the orientational search, f) calculates the site partial molar volume using 3D-RISM total and direct correlation functions, g) calculates the site solvation free energy using 3D-RISM total and direct correlation functions, h) calculates the site average solute-solvent interaction potential using the solvent interaction potential and 3D-RISM distribution function.

[Claim 2]

A computer program to perform the methodology and calculations stated in Claim 1 and outputs the resulting thermodynamic quantities and locations of solvation sites in a file format easily readable by standard molecular visualization and/or molecular analysis software.

Description:
DESCRIPTION

Title of Invention

Analysis of biomolecular solvation sites by the 3D-RISM theory.

Technical Field

This invention is an algorithm and software for investigating solvation sites in biomolecules. It is relevant to the fields of experimental refinement and pharmaceutical design.

Background Art Solvent plays several critical roles in the function of biomolecules. In particular, microscopic effects of solvent such as hydrogen bonding and binding affinity augmentation by the "solvent-displacement effect" are necessary for critical analysis of ligand binding. Minute knowledge of such effects is necessary for reliable prediction, explanation and design of molecular systems. Thus it is no wonder increasing attention is paid by the pharmaceutical- design community to solvent molecules inside active site of protein.

Experimental methodologies for analyzing atomistic water behavior can be extremely useful but are often hindered by limited spatial resolution. Theoretically, sophisticated analysis of water based on simulation has been development for over a decade. In 1998, Lazaridis derived inhomogeneous fluid theory in two landmark papers. 1 ' 2 Several years later, Young et al developed a solvation site based approach, commonly called "Watermap,"[PTLl-3] to the forefront by applying Lazaridis' theory towards identification of displaceable water sites to enhance ligand binding. 3 ' 4 Similarly, recent work by Nguyen et al uses, rather, a grid-based application of Lazaridis' theory. 5

Although the "explicit-solvent" simulations, on which this analysis is based, enable incredible precision, the sampling problem limits reliable analysis to smaller systems depending on the available computational budget. Even for extremely expensive simulations though, it is questionable whether sufficient sampling of water and ions has occurred in buried regions of the solute where the solvent exchange rate is very slow. 6'9 Citation List

Patent Literature

PTLl: USP 7,970,581

PTL2: USP 7,970,580

PTL3: USP 7,756,674

Non Patent Literature

NPLl : Lazaridis, T. Inhomogeneous Fluid Approach to Solvation Thermodynamics. 2. Applications to Simple Fluids. JPhys Chem B 102, 3542-3550 (1998).

NPL2: Lazaridis, T. Inhomogeneous fluid approach to solvation thermodynamics. 1. Theory. JPhys Chem B 102,

3531-3541 (1998).

NPL3: Young, T. T., Abel, R. R., Kim, B. B., Berne, B. J. B. & Friesner, R. A. R. Motifs for molecular

recognition exploiting hydrophobic enclosure in protein-ligand binding. Proc. Natl. Acad. Sci. U.S.A. 104,

808-813 (2007).

NPL4: Abel, R., Young, T., Farid, R., Berne, B. J. & Friesner, R. A. Role of the Active-Site Solvent in the

Thermodynamics of Factor Xa Ligand Binding. J. Am. Chem. Soc. 130, 2817-2831 (2008).

NPL5: Nguyen, C. N., Kurtzman Young, T. & Gilson, M. K. Grid inhomogeneous solvation theory: Hydration structure and thermodynamics of the miniature receptor cucurbit[7]uril. J Chem Phys 137, 044101 (2012).

NPL6: Deng, Y. & Roux, B. Computation of binding free energy with molecular dynamics and grand canonical

Monte Carlo simulations. J Chem Phys 128, 115103-115103-8 (2008).

NPL7: Michel, J. & Essex, J. W. Prediction of protein-ligand binding affinity by free energy simulations:

assumptions, pitfalls and expectations. J Comput Aided Mol Des 24, 639-658 (2010).

NPL8: Luccarelli, J., Michel, J., Tirado-Rives, J. & Jorgensen, W. L. Effects of Water Placement on Predictions of Binding Affinities for p38a MAP Kinase Inhibitors. Journal of Chemical Theory and Computation 6,

3850-3856 (2010).

NPL9: Mobley, D. L. Let's get honest about sampling. J Comput Aided Mol Des 1-3 (2012). doi: 10.1007/s 10822- 011-9497-y

NPL10:Beglov, D. & Roux, B. An integral equation to describe the solvation of polar molecules in liquid water. J Phys Chem B 101, 7821-7826 (1997).

NPL1 l:Kovalenko, A. & Hirata, F. Three-dimensional density profiles of water in contact with a solute of

arbitrary shape: a RISM approach. Chemical Physics Letters 290, 237-244 (1998).

NPL12:Imai, T., Hiraoka, R., Kovalenko, A. & Hirata, F. Water Molecules in a Protein Cavity Detected by a Statistical-Mechanical Theory. J. Am. Chem. Soc. 127, 15334-15335 (2005).

NPL13:Imai, T., Hiraoka, R., Kovalenko, A. & Hirata, F. Locating missing water molecules in protein cavities by the three-dimensional reference interaction site model theory of molecular solvation. Proteins 66, 804-813 (2006).

NPL14:Yokogawa, D., Sato, H. & Sakaki, S. The position of water molecules in Bacteriorhodopsin: A three- dimensional distribution function study. Biophysical Journal 147, 112-116 (2009).

NPL15:Sindhikara, D. J., Yoshida, N. & Hirata, F. Placevent: an algorithm for prediction of explicit solvent atom distribution-application to HIV-1 protease and F-ATP synthase. J Comput Chem 33, 1536-1543 (2012). NPL16:Stumpe, M. C, Blinov, N., Wishart, D., Kovalenko, A. & Pande, V. S. Calculation of local water densities in biological systems: a comparison of molecular dynamics simulations and the 3D-RISM-KH molecular theory of solvation. JPhys Chem B 115, 319-328 (2010).

NPL17:Phongphanphanee, S., Yoshida, N. & Hirata, F. The potential of mean force of water and ions in aquaponn channels investigated by the 3D-RISM method. Biophysical Journal 147, 107-111 (2009).

NPL18: Yoshida, N., Phongphanphanee, S., Maruyama, Y., Imai, T. & Hirata, F. Selective Ion-Binding by Protein Probed with the 3D-RISM Theory. J. Am. Chem. Soc. 128, 12042-12043 (2006).

NPL19:Sindhikara, D. J., Yoshida, N., Kataoka, M. & Hirata, F. Solvent penetration in photoactive yellow protein R52Q mutant: A theoretical study. Journal of Molecular Liquids 164, 120-122 (2011).

NPL20:Palmer, D. S., Frolov, A. I., Ratkova, E. L. & Fedorov, M. V. Towards a universal method for calculating hydration free energies: a 3D reference interaction site model with partial molar volume correction. J. Phys.: Condens. Matter 22, 492101 (2010).

NPL21 :Imai, T., Oda, K., Kovalenko, A., Hirata, F. & Kidera, A. Ligand mapping on protein surfaces by the 3D- RISM theory: toward computational fragment-based drug design. J. Am. Chem. Soc. 131, 12430-12440 (2009).

NPL22:Imai, T. et al. Functionality mapping on internal surfaces of multidrug transporter AcrB based on

molecular theory of solvation: implications for drug efflux pathway. JPhys Chem B 115, 8288-8295 (2011).

NPL23:Nikolic, D., Blinov, N., Wishart, D. S. & Kovalenko, A. 3D-RISM-Dock: a new fragment-based drug design protocol. Journal of Chemical Theory and Computation (2012). doi: 10.1021 /ct300257v

NPL24:Kiyota, Y., Yoshida, N. & Hirata, F. A New Approach for Investigating the Molecular Recognition of

Protein: Toward Structure-Based Drug Design Based on the 3D-RISM Theory. Journal of Chemical Theory and Computation 7, 3803-3815 (2011).

NPL25:Mitchell, J. C. Sampling rotation groups by successive orthogonal images. SIAM Journal on Scientific

Computing O, 525-547 (2008).

NPL26:Yershova, A., Jain, S., LaValle, S. M. & Mitchell, J. C. Generating Uniform Incremental Grids on SO(3)

Using the Hopf Fibration. The International Journal of Robotics Research 29, 801-812 (2010).

NPL27:Hirata, F. Molecular theory of solvation. (2003).

NPL28:Imai, T., Kovalenko, A. & Hirata, F. Partial Molar Volume of Proteins Studied by the Three-Dimensional

Reference Interaction Site Model Theory†. JPhys Chem B 109, 6658-6665 (2005). NPL29:Lazaridis, T. & Karplus, M. Orientational correlations and entropy in liquid water. JChem Phys 105, 4294-^316 (1996).

NPL30:Adachi, M. et al. Structure of HIV-1 protease in complex with potent inhibitor KNI-272 determined by high-resolution X-ray and neutron crystallography. Proc. Natl. Acad. Sci. U.S.A. 106, 4641-4646 (2009). NPL31 :Hornak, V. et al. Comparison of multiple Amber force fields and development of improved protein

backbone parameters. JComput Chem 65, 712-725 (2006).

NPL32:Gaussian 09, Revision A. 02, Gaussian. Inc., Wallingford, CT2, 4 (2009).

NPL33: Wang, J., Wolf, R. M., Caldwell, J. W., Kollman, P. A. & Case, D. A. Development and testing of a

general amber force field. JComput Chem 25, 1157-1174 (2004).

NPL34:Word, J. M., Lovell, S. C, Richardson, J. S. & Richardson, D. C. Asparagine and glutamine: using

hydrogen atom contacts in the choice of side-chain amide orientation. J. Mol. Biol. 285, 1735-1747 (1999). NPL35:Dupradeau, F.-Y. et al. The R.E.D. tools: advances in RESP and ESP charge derivation and force field library building. Phys. Chem. Chem. Phys. 12, 7821 (2010).

NPL36:Luchko, T. et al. Three-dimensional molecular theory of solvation coupled with molecular dynamics in Amber. Journal of Chemical Theory and Computation 6, 607-624 (2010).

NPL37:Case, D. A. et al. AMBER 12. University of California (2012). at <http://ambermd.org>

NPL38:Li, Z. & Lazaridis, T. Thermodynamic Contributions of the Ordered Water Molecule in HIV-1 Protease. J.

Am. Chem. Soc. 125, 6636-6637 (2003).

NPL39:Baldwin, E. T. et al. Structure of HIV-1 protease with KNI-272, a tight-binding transition-state analog containing allophenylnorstatine. Biophysical Journal 3, 581-590 (1995).

Summary of Invention

The invention is an algorithm and related software to analyze the local thermodynamics and structural properties of solvation sites in biomolecules using the 3D-RISM theory. The invention produces reliable spatial and thermodynamic properties of solvation sites readily usable for pharmaceutical design.

Technical Problem

In order to perform rational drug design, much information is needed about the interactions that affect drug binding. One important factor is the role of solvent sites in the active site. These sites are impossible to characterize experimentally. Conventional techniques such as implicit solvation are fundamentally useless with this regard. Postprocessing of explicit solvent simulations may be useful if it weren't for the fact that these simulations are severely hindered by the computational time to sample solvent. [PTL 1-3] Consequently such techniques are either extremely time consuming or simply unreliable. Thus there is no reliable, efficient method for performing analysis of solvation sites, severely limiting pharmeceutical design potential.

Solution to Problem

This invention utilizes a methodology, 3D-RISM, which quickly obtains reliable, converged solvent structural data for biomolecules, then postprocesses the data in order to clearly characterize solvation sites. The resulting information, including site locations, orientations, and thermodynamic properties can be directly used in pharmaceutical design efforts such as virtual screening pharmacaphore modeling.

Advantageous Effects of Invention

This invention will allow much faster analysis of biomolecular sites in various solvent conditions. The analysis will be a boon for pharmeceutical design efforts. The invention showed systematic agreement with high-resolution experimental data and with simulation-based water analysis techniques. Our calculations on HIV-1 protease took just under 11 minutes total on an 8-core workstation. This method can directly take into account cosolvent and ionic effects on solvent structure and thermodynamics, which cannot be realized by more conventional methods such as the continuum solvent models and molecular simulation. Thus we assert that our method is both extremely practical and reasonably accurate method for solvation-site analysis.

Brief Description of Drawings

Figure 1. shows a flowchart of the main aspects in the context of expected use of this invention.

Description of Embodiments The invention is embodied as a computer program written in Python. The program performs file search, reading, analysis and writing.

DETAILED DESCRIPTION

This invention circumvents the solvent sampling problem by using the three-dimensional reference interaction site model (3D-RISM) theory. 10,11 3D-RISM attains complete atomistic sampling of solvent, including ions, by utilizing an integral approach. 3D-RISM has been successful in locating water in proteins compared to experiment 12"15 and simulation, 16 ion locations and pathways, hydration free energies, fragment poses, " and drug poses, as well as many more applications less relevant to this work. With current implementations, the equilibrium solvent distribution of a biologically relevant system can be calculated in minutes to hours.

3D-RISM first utilizes RISM calculations to find the susceptibility function of the solvent based on specified atomic interaction potentials and mixture concentrations. The solvent susceptibility is then utilized in the 3D-RJSM equations, including the atomic solvent-solute interaction potential. The 3D-RISM equations coupled with an appropriate closure relation are iterated until self-consistency resulting in the 3D solvent distribution function g(r), the total correlation function h(r), the direct correlation c(r), as well as other distributions and properties.

To characterize solvation sites, the location of the site must first be identified. We find the sites using the maximum probability in: P( T )n, — P( T )n-i * (1 - Θ(|Γ - r™"* - R\)) ^ Here, P( T )o = oi?( r )c, n is the iteration number, and R is a present exclusion radius (here lA). This algorithm is iterated until a cutoff is reached, here we cutoff at a number of solvent molecules placed. The resulting discrete distribution represents high occupancy sites whose properties may be of interest in solvation site analysis. The embodiment employs the

Successive Orthogonal Images (SOI) approach, 6 which produces more uniform coverage of rotational space. A uniform distribution is obtained by 1) creating a uniform distribution of points on the unit sphere, V 1 , 2) For each point on the sphere create a set of uniform vectors, V 2 , also within the unit sphere which are orthogonal to the vector pointing from the center to that point, 3) trivially determine an orthogonal vector to the two previous vectors, 4) determine the rotation, M, expressed in Euler angles corresponding to the three orthogonal vectors based on an arbitrary reference geometry. The resulting set rotations exists within SO(3) and represents a uniform set of rotations. For efficiency in computation we find the optimal angular difference, o, and sizes of \V l I and l^ 2 l for a fixed number of rotations. Equation 3.1 in reference 28 shows that i^ 2 l = 4 /α 2 and Ι^Ί = 2TT/G thus we set a = (8n 2 /N rot ) Vhere N T ot = I V 2 \ |V X |. We take the solvent atom closest to the center of geometry (Oxygen for water) as a fixed, "anchor" for rotations. For each position of an anchor, the rotational probability distribution function can be described as:

= Z~ l g 1 (r anchor + Γ 7 , Ω )

yesites (2) Here Γ ,ω is the relative position of the solvent-site Ί given rotation ω from some arbitrary original geometry and Z is the partition function. The solvation-site is defined as the 6D conformational space available by moving the anchor atom within a 1.0 A radius sphere centered on the originally identified location, r n . The highest likelihood pose lies at the maximum of P(w\ r ) where l r - T n \≤ R.

The implementation characterizes "solvation sites" using integrations over the solvation site volume, Vn. The excess chemical potential or solvation free energy is evaluated simply using the total and direct correlation functions for many 3D-RISM closures including the KH- closure which we use here. 11,27

Δμ™ = Pok B T∑f v [i ( r)) 2 e (- r)) - ) ~ ^ 'tew]

For the partial molar volume, the adapted Kirkwood-Buff equations toward 3D-RISM

CALCULATION DETAILS

System geometries tested for this implementation were taken from the PDB: HIV-1 protease (2ZYE). 30 Protein interaction parameters were taken from the ff99SB parameter set. 31 KNI-272 was parameterized using Gaussian 09 32 using HF 6-31G* basis set and GAFF parameters 33 using antechamber 3 with RED IV 35 parameterization. Protonation states were taken from available deuterium data from the experimental structure. Alanine Dipeptide structure and all parameters were prepared using tLEaP. Pure water at 55.5 M using modified SPC parameters. 36 3D-RISM calculations were run using the AmberTools rism3d.snglpnt.MPI module. 36 ' 37 All calculations were performed on an HP Z 100 8-core Intel 3.2 GHz Xeon workstation. The 3D-RISM calculation on HIV-1 protease and Alanine Dipeptide using the KH closure took 6.5 minutes and 39 seconds respectively. Postprocessing was performed using a Python script, which took 4.3 minutes to run in serial.

RESULTS

The embodiment was tested by comparing experimental waters to their nearest predicted solvation site and compared the RMSD between the pair of poses with—TS. Linear fits give r - -0.47 and r = -0.67 for -TS tra ns and -TS rot respectively. The strongest correlation was with the total contribution of entropy to the free energy, -TS rot where r = -0.76.

-TStrans had a r =-0.48 against distance between experimental and predicted oxygens. The experimental B-factor also had a correlation of r - 0.46 with the 0-0 distance.

The data suggest that there is a reasonable correlation between "structural predictability" and entropy.

Next, quantitative comparisons of thermodynamic quantities to those obtained by simulation-based analysis were obtained. In 2003, Li and Lazaridis 38 studied an isolated water bound between the flaps and the inhibitor KNI-272 bound from of HIV- 1 protease using structure 1HPX 39 and the simulation-based approach. In Table 1, their data is compared to the results of this implementation. For this water, the estimates for solvation free energy agreed to less than 1 kcal/mol. The entropic contribution was underpredicted by 1-2 kcal/mol. The solvent-solute interaction energy was also under-predicted by about 7 kcal/mol. While the differences may be partly attributed to force field differences and solute structures, other factors are likely due to limitation of methodological errors for generating the initial data (sampling error in simulation and closure error in 3DRISM).

Although Lazaridis' work calculates the total solvation free energy by addition of explicit terms, this implementation calculates only the first order terms in energy and entropy but calculates the solvation free energy directly. The PV term in the free energy for most the sites including the flap water, this term was nearly zero.

The site in the vicinity of KNI-272, the "flap" water, despite its steep entropic penalty, favorably contributes to the solvation free energy. The results included less favorable waters nearby the inhibitor (e.g. water 354 with Δ(7=-4.96 kcal/mol) as well as other sites in the hinge region as high as -0.93, and near the termini as high as +1.65 kcal/mol. Such water sites may be reasonable targets for displacement in drug-design efforts.

Industrial Applicability

This invention can be used by pharmeceutical scientists and molecular design scientists in order to better understand the microscopic solvent effects in their system, enabling intelligent design.