AllerStat: Finding Statistically Significant Allergen-Specific Patterns in Protein Sequences by Machine Learning

التفاصيل البيبلوغرافية
العنوان: AllerStat: Finding Statistically Significant Allergen-Specific Patterns in Protein Sequences by Machine Learning
المؤلفون: Kento Goto, Ichiro Takeuchi, Takumi Yoshida, Norimasa Tamehiro, Kazunari Kondo, Takuto Sakuma, Reiko Adachi, Hiroyuki Hanada
بيانات النشر: Cold Spring Harbor Laboratory, 2021.
سنة النشر: 2021
مصطلحات موضوعية: Synthetic biology, Allergen, Genome editing, Extant taxon, medicine, Brute-force search, Computational biology, Biology, medicine.disease_cause
الوصف: Cutting-edge technologies such as genome editing and synthetic biology allow us to produce novel foods and functional proteins. However, their toxicity and allergenicity must be accurately evaluated. Allergic reactions are caused by specific amino-acid sequences in proteins (Allergen Specific Patterns, ASPs), of which, many remain undiscovered. In this study, we introduce a data-driven approach and a machine-learning (ML) method to find undiscovered ASPs. The proposed method enables an exhaustive search for amino-acid sub-sequences whose frequencies are statistically significantly higher in allergenic proteins. As a proof-of-concept (PoC), we created a database containing 21,154 proteins of which the presence or absence allergic reactions are already known, and the proposed method was applied to the database. The detected ASPs in the PoC study were consistent with known biological findings, and the allergenicity prediction accuracy using the detected ASPs was higher than extant approaches.TeaserWe propose a computational method for finding statistically significant allergen-specific amino-acid sequences in proteins.
الوصول الحر: https://explore.openaire.eu/search/publication?articleId=doi_________::75dab28c791a5deb02a76ba26708ba7eTest
https://doi.org/10.1101/2021.08.17.456743Test
حقوق: OPEN
رقم الانضمام: edsair.doi...........75dab28c791a5deb02a76ba26708ba7e
قاعدة البيانات: OpenAIRE