Cost-based Feature Selection for Network Model Choice
العنوان: | Cost-based Feature Selection for Network Model Choice |
---|---|
المؤلفون: | Louis Raynal, Till Hoffmann, Jukka-Pekka Onnela |
المصدر: | Journal of Computational and Graphical Statistics. :1-10 |
بيانات النشر: | Informa UK Limited, 2023. |
سنة النشر: | 2023 |
مصطلحات موضوعية: | Methodology (stat.ME), FOS: Computer and information sciences, Statistics and Probability, Statistics - Machine Learning, Discrete Mathematics and Combinatorics, Machine Learning (stat.ML), Statistics, Probability and Uncertainty, Statistics - Computation, Statistics - Methodology, Computation (stat.CO) |
الوصف: | Selecting a small set of informative features from a large number of possibly noisy candidates is a challenging problem with many applications in machine learning and approximate Bayesian computation. In practice, the cost of computing informative features also needs to be considered. This is particularly important for networks because the computational costs of individual features can span several orders of magnitude. We addressed this issue for the network model selection problem using two approaches. First, we adapted nine feature selection methods to account for the cost of features. We show for two classes of network models that the cost can be reduced by two orders of magnitude without considerably affecting classification accuracy (proportion of correctly identified models). Second, we selected features using pilot simulations with smaller networks. This approach reduced the computational cost by a factor of 50 without affecting classification accuracy. To demonstrate the utility of our approach, we applied it to three different yeast protein interaction networks and identified the best-fitting duplication divergence model. Comment: 34 pages, 6 figures |
تدمد: | 1537-2715 1061-8600 |
الوصول الحر: | https://explore.openaire.eu/search/publication?articleId=doi_dedup___::c2c71080a6a419feecac796f697e36f7Test https://doi.org/10.1080/10618600.2022.2151453Test |
حقوق: | OPEN |
رقم الانضمام: | edsair.doi.dedup.....c2c71080a6a419feecac796f697e36f7 |
قاعدة البيانات: | OpenAIRE |
تدمد: | 15372715 10618600 |
---|