A principal component regression model for predicting phytochemical binding to the H. pylori CagA protein
College of Science
Network Modeling Analysis in Health Informatics and Bioinformatics
Helicobater pylori is an important causative factor in the pathogenesis of multiple gastrointestinal diseases. One of the factors responsible for the virulence of H. pylori is the cagA protein, which can interfere with a number of cellular signaling processes once this protein is transferred inside the host cell. Thus, inhibiting the interaction of the cagA protein with the host cell membrane using small molecular inhibitors appears to be a promising pharmacological strategy. In this study, a predictive model for the binding free energy of natural compounds towards the cagA protein is presented. The formulated model which is built on principal component—multiple linear regression demonstrates reliable accuracy (r2test = 0.92, RMSEtest = 0.483), while only requiring five independent variables for the prediction. It was further noted that topological descriptors had the greatest influence on the generated principal components which served as the predictors. The created regression model can help promote and accelerate the discovery of natural compounds as cagA binders for the development of anti-H. pylori agents. © 2020, Springer-Verlag GmbH Austria, part of Springer Nature.
Digitial Object Identifier (DOI)
Janairo, J. B. (2020). A principal component regression model for predicting phytochemical binding to the H. pylori CagA protein. Network Modeling Analysis in Health Informatics and Bioinformatics, 9 (1) https://doi.org/10.1007/s13721-020-00252-9
Biology | Chemistry
Molecules—Computer-aided design; Machine learning; Binding energy; Linear free energy relationship