Journal list menu
- Issue
ChemMedChem: Volume 13, Issue 6
Special Issue:Cheminformatics in Drug Discovery
464-645March 20, 2018
Export Citations
Cover Pictures
Front Cover: Chemogenomic Active Learning's Domain of Applicability on Small, Sparse qHTS Matrices: A Study Using Cytochrome P450 and Nuclear Hormone Receptor Families (ChemMedChem 6/2018)
- Pages: 464
- First Published: 25 March 2018
The Front Cover shows an emerging method for data mining ligand–target bioactivity matrices known as Chemogenomic Active Learning. As ligand–target pairs labeled as actives and inactives are systematically picked one-by-one, a set of decision trees serving as rules to explain active and inactive bioactivity is built. In this article, Rakers et al. show how the method can effectively model nuclear hormone receptor (NHR) and cytochrome P450 (CYP450) family-wide ligand–target interaction with only a fraction of available data. An actively learned model can be updated after new ligands are synthesized and assayed for activity, converging on hit discovery and optimization more effectively than brute-force screening. Cover artwork by Christin Rakers and J.B. Brown. More information can be found in the Full Paper by J.B. Brown et al. on page 511 in Issue 6, 2018 (DOI: 10.1002/cmdc.201700677).
Cover Feature: AquaMMapS: An Alternative Tool to Monitor the Role of Water Molecules During Protein–Ligand Association (ChemMedChem 6/2018)
- Pages: 465
- First Published: 25 March 2018
The Cover Feature shows the different behavior of structural water molecules and bulk water. We have developed the AquaMMapS tool, which is able to identify stationary hydration sites close to the protein surface starting from molecular dynamics simulation. Root-mean-square fluctuation (i.e., a 1.4 Å cutoff) is employed to discriminate stationary from nonstationary water molecules. Space is organized into a grid, where cells crossed by stationary water molecules are selected and their occupancy computed along the simulation. Moreover, an empirical scoring function, the AquaMMapScore, has been developed to evaluate the penalty of a ligand displacing a stationary water hot spot. More information can be found in the Full Paper by Stefano Moro et al. on page 522 in Issue 6, 2018 (DOI: 10.1002/cmdc.201700564).
Cover Feature: CHIPMUNK: A Virtual Synthesizable Small-Molecule Library for Medicinal Chemistry, Exploitable for Protein–Protein Interaction Modulators (ChemMedChem 6/2018)
- Pages: 466
- First Published: 25 March 2018
The Cover Feature shows three chipmunks involved in the creation, analysis, and clustering of the synthesizable virtual molecule library CHIPMUNK. Nearly 100 million compounds were generated with in silico reactions on accessible building blocks, and their descriptor profile was analysed. The library was clustered together with molecules from other public libraries in order to relate it to the known chemical space and to divide the huge library into manageable subsets. It serves as an idea generator and covers the chemical space beyond the rule of five and of protein–protein as well as protein–ligand interaction modulators. Artwork provided by Melanie Wilkesmann. More information can be found in the Full Paper by Oliver Koch et al. on page 532 in Issue 6, 2018 (DOI: 10.1002/cmdc.201700689).
Editorial
Special Issue: Cheminformatics in Drug Discovery
- Pages: 467-469
- First Published: 25 March 2018
Digital Drug Discovery: Guest Editors Andreas Bender (University of Cambridge) and Nathan Brown (BenevolentAI) present the 20 articles included in this Special Issue on Cheminformatics in Drug Design. As they summarize each article, they also discuss the common themes within in silico drug discovery that these papers represent.
Reviews
Caveat Usor: Assessing Differences between Major Chemistry Databases
- Pages: 470-481
- First Published: 16 February 2018
The three databases of PubChem, ChemSpider, and UniChem capture the majority of open chemical structure records. Collectively, they constitute a massively enabling resource for cheminformatics, chemical biology, and drug discovery. It is important for users to have at least some appreciation of differential content to enable utility judgments for the tasks at hand. This turns out to be challenging. By comparing the three resources in detail, this review assesses their differences, some of which are not obvious.
Minireviews
Exploring Structure–Activity Relationships with Three-Dimensional Matched Molecular Pairs—A Review
- Pages: 482-489
- First Published: 06 December 2017
Better in 3D: Exploration of activity cliffs in structure–activity relationship series is an excellent source of new insight. Three-dimensional matched molecular pairs (3D MMP) are an intuitive way to analyze existing data and to build models based on resulting observations. We present a concise summary of models and applications of recent years. Shown is a 3D MMP as published by Posy et al., J. Chem. Inf. Model. 2013, 53, 1576–1588 (reprinted with permission, © 2013 American Chemical Society).
Concepts
Rationalizing Promiscuity Cliffs
- Pages: 490-494
- First Published: 11 October 2017
Promiscuity cliff: Shown is an exemplary promiscuity cliff comprising two structural analogues that were extensively tested in screening assays and displayed an unexpectedly large difference in the number of targets (PD) they were active against. The compound on the right was consistently inactive in all assays.
Communications
Kinome-Wide Profiling Prediction of Small Molecules
- Pages: 495-499
- First Published: 23 May 2017
Kinome-wide prediction of kinase inhibitors: The capabilities of proteochemometric (PCM) models to make large-scale predictions on the entire kinome was explored. The combination of a compound fingerprint with a protein fingerprint 1) improves the activity prediction for each kinase relative to individually trained models and 2) enables prediction of the activity of compounds for the entire kinome, including cancer-related resistance mutations.
Comparative Molecular Dynamics Simulation of Aggregating and Non-Aggregating Inhibitor Solutions: Understanding the Molecular Basis of Promiscuity
- Pages: 500-506
- First Published: 23 October 2017
Some truth behind false positives: Promiscuous inhibitors show up in enzyme assays as false positives, mainly because of aggregation. Molecular dynamics (MD) simulations were conducted for known aggregators and non-aggregators, revealing the forces and physicochemical properties behind the phenomenon of aggregation. This study also investigated the potential use of MD simulations as predictive tools in this regard.
IChem: A Versatile Toolkit for Detecting, Comparing, and Predicting Protein–Ligand Interactions
- Pages: 507-510
- First Published: 11 October 2017
Taming complexity: IChem is a suite of software dedicated to the analysis and comparison of three-dimensional molecular objects. It converts an intricate three-dimensional information into much simpler fingerprints or graphs, thereby enabling high-throughput comparisons and fueling machine learning models for predicting important features like protein–protein interfaces, druggable cavities, interaction patterns, and binding poses.
Full Papers
Very Important Paper
Chemogenomic Active Learning's Domain of Applicability on Small, Sparse qHTS Matrices: A Study Using Cytochrome P450 and Nuclear Hormone Receptor Families
- Pages: 511-521
- First Published: 06 December 2017
Learning for big data: Chemogenomic active learning represents an alternative to current strategies of dumping large-scale databases into “black box” machine learning by automatically leveraging a reduced number of informative, retraceable ligand–target pairs. Recent reports have shown its efficiency on big datasets, and herein we complementarily assess its prediction performance in sparse data scenarios and applicability to de-orphanization tasks.
AquaMMapS: An Alternative Tool to Monitor the Role of Water Molecules During Protein–Ligand Association
- Pages: 522-531
- First Published: 29 November 2017
Water watch: The AquaMMapS tool was developed to individuate zones occupied by stationary water molecules during MD simulations. Individuation of stationary water voxels in a protein binding site can be exploited for drug design. AquaMMapScoring was developed to evaluate the penalty associated with different ligand substituents, considering electronegativity and capability of forming hydrogen bonds and stability of the water displaced.
Very Important Paper
CHIPMUNK: A Virtual Synthesizable Small-Molecule Library for Medicinal Chemistry, Exploitable for Protein–Protein Interaction Modulators
- Pages: 532-539
- First Published: 01 February 2018
CHIPMUNK is a library containing 95 million molecules derived from in silico reactions. It covers novel chemical space and is suited for the design of new protein–ligand and protein–protein interaction inhibitors extending the chemical space beyond the rule of five. It will therefore assist future drug design projects. One unique feature are clustered subsets that contain the target space based on ChEMBL data.
Mapping of the Available Chemical Space versus the Chemical Universe of Lead-Like Compounds
- Pages: 540-554
- First Published: 20 November 2017
A molecular atlas: A robust generative topographic map of fragment-like chemical space has been optimized. It accommodates more than 40 million molecules with no more than 17 heavy atoms, from the theoretically enumerated GDB-17 and real-world PubChem/ChEMBL databases. It serves as a library comparison tool to highlight biases in real-world molecules versus possible species from GDB-17. Specific patterns, proper to some libraries and absent from others, are highlighted.
Consensus Predictive Model for Human K562 Cell Growth Inhibition through Enalos Cloud Platform
- Pages: 555-563
- First Published: 01 December 2017
Predictive model: A large dataset of K562 biological inhibitors is computationally treated to identify compounds that possibly have therapeutic action against β-thalassemia. A predictive computational model for K562 inhibition is developed and validated. The model facilitates fast and reliable virtual screening of new molecules and is freely available online.
Very Important Paper
Hit Dexter: A Machine-Learning Model for the Prediction of Frequent Hitters
- Pages: 564-571
- First Published: 29 December 2017
Hit Dexter: False-positive assay signals triggered by badly behaving compounds continue to pose a major challenge to experimental screening. A free web service, called Hit Dexter, is able to identify such compounds with high accuracy, enabling chemists to make better-informed decisions on their hit compounds.
In Silico Prediction of Compounds Binding to Human Plasma Proteins by QSAR Models
- Pages: 572-581
- First Published: 22 October 2017
In silico strategies: We used data curation, descriptor selection, machine learning algorithms, consensus modeling techniques, diverse validation strategies, and applicability domain analysis to develop quantitative structure–activity relationship (QSAR) models of compound plasma protein binding. Experimental data uncertainty was also assessed, helping us form reasonable expectations for potential models.
Cross-Classified Multilevel Modelling of the Effectiveness of Similarity-Based Virtual Screening
- Pages: 582-587
- First Published: 06 November 2017
We describe the use of cross-classified multilevel modelling to analyse the results of similarity-based virtual screening searches using 2D fingerprints. We show that the choice of fingerprint is more important than the choice of similarity coefficient, and that multiple reference structures need to be used in benchmark studies such as this.
Comparison of Maximum Common Subgraph Isomorphism Algorithms for the Alignment of 2D Chemical Structures
- Pages: 588-598
- First Published: 23 October 2017
Finding the MCS (maximum common substructure) is a computationally difficult challenge, for which several attempts have been made to improve both the search speeds, and quality, of reported solutions. This article describes challenging benchmarks for a series of MCS algorithms, and reports the MCS type and algorithm combinations which generally yield the fastest and most sensible results in a chemoinformatic problem domain.
Matched Molecular Pair Analysis on Large Melting Point Datasets: A Big Data Perspective
- Pages: 599-606
- First Published: 26 June 2017
An MMP for MPs: A matched molecular pair analysis was used to examine the change in melting point (ΔMP) between molecules in a set of ∼275 000 compounds. We found many cases in which the ΔMP correlates with changes in functional groups and simple descriptors, such as number of hydrogen bond donors and acceptors. We observed that this method remains stable and scales well with larger datasets, indicating its utility as a simple privacy-preserving technique to analyze large proprietary databases and share findings.
Bioisosteric Replacements Extracted from High-Quality Structures in the Protein Databank
- Pages: 607-613
- First Published: 03 January 2018
Aiding lead optimisation: A data set of high-quality protein–ligand complexes spanning 121 protein targets was analysed for the presence of bioisosteric fragments. A pairwise analysis of all ligands for each target was carried out. The ligands were fragmented and a pair of fragments considered bioisosteric if they occupy a similar volume of the protein binding site. Only a small number of the bioisosteric pairs were found to be common to two or more targets.
3D-e-Chem: Structural Cheminformatics Workflows for Computer-Aided Drug Discovery
- Pages: 614-626
- First Published: 16 January 2018
Reusable structural cheminformatics for drug discovery: Data analytics platforms such as KNIME enable the creation of robust cheminformatics protocols that can easily be transferred, reused, and extended. 3D-e-Chem provides a set of KNIME nodes and reconfigurable workflows for data-driven drug discovery including: ligand-based target prediction, structure-based bioactivity data mapping and ligand scaffold replacement, and ligand repurposing.
Specific Noncovalent Interactions Determine Optimal Structure of a Buried Ligand Moiety: QM/MM and Pure QM Modeling of Complexes of the Small-Molecule CD4 Mimetics and HIV-1 gp120
- Pages: 627-633
- First Published: 16 January 2018
Cavity filling: Desolvation and σ-hole/dispersion interactions are the driving forces between the small-molecule CD4 mimetics halogenated phenyl motif, called region 1, and residues F376–N377 in the HIV-1 gp120 envelope protein. In this study, we demonstrate that only quantum-mechanics-based methods can capture those non-standard effects, opening a new perspective in the rational optimization of region 1 to improve binding affinity.
In Silico Studies Designed to Select Sesquiterpene Lactones with Potential Antichagasic Activity from an In-House Asteraceae Database
- Pages: 634-645
- First Published: 11 January 2018
Attention to the neglected: Chagas disease is a neglected tropical disease that affects more than eight million people in the Americas. Using a set of 1306 sesquiterpene lactones (SLs) obtained from SistematX, ligand- and structure-based virtual screening were performed. Afterward, these two methodologies were combined to select potential antichagasic SLs for the three parasitic forms of T. cruzi, establishing a possible mechanism of action.