A mixed integer linear model for clustering with variable selection.

التفاصيل البيبلوغرافية
العنوان: A mixed integer linear model for clustering with variable selection.
المؤلفون: Benati, Stefano1 Stefano.Benati@unitn.it, García, Sergio2 sergio.garcia-quiles@ed.ac.uk
المصدر: Computers & Operations Research. Mar2014, Vol. 43, p280-285. 6p.
مصطلحات موضوعية: *MATHEMATICAL models, *FUNCTIONAL analysis, MIXED integer linear programming, MATHEMATICAL variables, MEDIAN (Mathematics), SET theory
مستخلص: Abstract: This paper introduces an extension of the p-median problem in which the distance function between units is calculated as the distance sum on the q most important variables out of a set of size m. This model has applications in cluster analysis (for example, in sociological surveys), where analysts have a large list of variables available for inclusion, but only a subset of them (true variables) is appropriate for uncovering the cluster structure. Therefore, researchers must carefully separate the true variables from the other before computing data partitions. Here we show that this problem can be formulated as a mixed integer non-linear optimization model where clustering and variable selection are done simultaneously. Then we provide two different linearizations and compare their performance with the default method of clustering with all the variables (which is a p-median model) on a set of artificially generated binary data, showing that the model based on a radius formulation performs the best. [Copyright &y& Elsevier]
Copyright of Computers & Operations Research is the property of Pergamon Press - An Imprint of Elsevier Science and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Business Source Index
الوصف
تدمد:03050548
DOI:10.1016/j.cor.2013.10.005