دورية أكاديمية
FWHT-RF: A Novel Computational Approach to Predict Plant Protein-Protein Interactions via an Ensemble Learning Method
العنوان: | FWHT-RF: A Novel Computational Approach to Predict Plant Protein-Protein Interactions via an Ensemble Learning Method |
---|---|
المؤلفون: | Jie Pan, Li-Ping Li, Chang-Qing Yu, Zhu-Hong You, Zhong-Hao Ren, Jing-Yu Tang |
المصدر: | Scientific Programming, Vol 2021 (2021) |
بيانات النشر: | Hindawi Limited |
سنة النشر: | 2021 |
المجموعة: | Directory of Open Access Journals: DOAJ Articles |
مصطلحات موضوعية: | Computer software, QA76.75-76.765 |
الوصف: | Protein-protein interactions (PPIs) in plants are crucial for understanding biological processes. Although high-throughput techniques produced valuable information to identify PPIs in plants, they are usually expensive, inefficient, and extremely time-consuming. Hence, there is an urgent need to develop novel computational methods to predict PPIs in plants. In this article, we proposed a novel approach to predict PPIs in plants only using the information of protein sequences. Specifically, plants’ protein sequences are first converted as position-specific scoring matrix (PSSM); then, the fast Walsh–Hadamard transform (FWHT) algorithm is used to extract feature vectors from PSSM to obtain evolutionary information of plant proteins. Lastly, the rotation forest (RF) classifier is trained for prediction and produced a series of evaluation results. In this work, we named this approach FWHT-RF because FWHT and RF are used for feature extraction and classification, respectively. When applying FWHT-RF on three plants’ PPI datasets Maize, Rice, and Arabidopsis thaliana (Arabidopsis), the average accuracies of FWHT-RF using 5-fold cross validation were achieved as high as 95.20%, 94.42%, and 83.85%, respectively. To further evaluate the predictive power of FWHT-RF, we compared it with the state-of-art support vector machine (SVM) and K-nearest neighbor (KNN) classifier in different aspects. The experimental results demonstrated that FWHT-RF can be a useful supplementary method to predict potential PPIs in plants. |
نوع الوثيقة: | article in journal/newspaper |
اللغة: | English |
تدمد: | 1058-9244 1875-919X |
العلاقة: | http://dx.doi.org/10.1155/2021/1607946Test; https://doaj.org/toc/1058-9244Test; https://doaj.org/toc/1875-919XTest; https://doaj.org/article/1536ae9ceac24dc6be74a7e3647c85b5Test |
DOI: | 10.1155/2021/1607946 |
الإتاحة: | https://doi.org/10.1155/2021/1607946Test https://doaj.org/article/1536ae9ceac24dc6be74a7e3647c85b5Test |
رقم الانضمام: | edsbas.B3FEA625 |
قاعدة البيانات: | BASE |
تدمد: | 10589244 1875919X |
---|---|
DOI: | 10.1155/2021/1607946 |