دورية أكاديمية

C_CART: An instance confidence-based decision tree algorithm for classification.

التفاصيل البيبلوغرافية
العنوان: C_CART: An instance confidence-based decision tree algorithm for classification.
المؤلفون: Yu, Shuang1,2,3 (AUTHOR), Li, Xiongfei1,2 (AUTHOR), Wang, Hancheng4 (AUTHOR), Zhang, Xiaoli1,2 (AUTHOR) zhangxiaoli@jlu.edu.cn, Chen, Shiping3 (AUTHOR)
المصدر: Intelligent Data Analysis. 2021, Vol. 25 Issue 4, p929-948. 20p.
مصطلحات موضوعية: *DECISION trees, *ALGORITHMS, CART algorithms, CLASSIFICATION algorithms, REGRESSION trees, CONFIDENCE
مستخلص: In classification, a decision tree is a common model due to its simple structure and easy understanding. Most of decision tree algorithms assume all instances in a dataset have the same degree of confidence, so they use the same generation and pruning strategies for all training instances. In fact, the instances with greater degree of confidence are more useful than the ones with lower degree of confidence in the same dataset. Therefore, the instances should be treated discriminately according to their corresponding confidence degrees when training classifiers. In this paper, we investigate the impact and significance of degree of confidence of instances on the classification performance of decision tree algorithms, taking the classification and regression tree (CART) algorithm as an example. First, the degree of confidence of instances is quantified from a statistical perspective. Then, a developed CART algorithm named C_CART is proposed by introducing the confidence of instances into the generation and pruning processes of CART algorithm. Finally, we conduct experiments to evaluate the performance of C_CART algorithm. The experimental results show that our C_CART algorithm can significantly improve the generalization performance as well as avoiding the over-fitting problem to a certain extend. [ABSTRACT FROM AUTHOR]
Copyright of Intelligent Data Analysis is the property of IOS Press and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Business Source Index
الوصف
تدمد:1088467X
DOI:10.3233/IDA-205361