دورية أكاديمية

Time- and Space-Efficient Algorithms for Least Median of Squares Regression.

التفاصيل البيبلوغرافية
العنوان: Time- and Space-Efficient Algorithms for Least Median of Squares Regression.
المؤلفون: Souvaine, Diane L., Steele, J. Michael
المصدر: Journal of the American Statistical Association; Sep87, Vol. 82 Issue 399, p795, 8p
مصطلحات موضوعية: DUALITY theory (Mathematics), LEAST squares, MEDIAN (Mathematics), MATHEMATICAL optimization, STATISTICS, ALGORITHMS, REGRESSION analysis, HYPERSPACE, STATISTICAL correlation, ESTIMATION theory, MATHEMATICAL statistics
مستخلص: The least median of squared residuals regression line (or LMS line) is that line y = ax + b for which the median of the residuals |y[sub 1] - ax[sub 1] - b|[sup 2] is minimized over all choices of a and b. If we rephrase the traditional ordinary least squares (OLS) problem as finding the a and b that minimize the mean of |y[sub 1] - ax[sub 1] - b|[sup 2], an see that in a formal sense LMS just replaces a "mean" by a "median." This way of describing LMS regression does not do justice to the remarkable properties of LMS. In fact, LMS regression behaves in ways that distinguish it greatly from OLS as well as from many other methods for robustifying OLS (see, e.g., Rousseeuw 1984). As illustrations given here show, the LMS regression line should provide a valuable tool for studying those data sets in which the usual linear model assumptions are violated by the presence of some (not too small) groups of data values that behave distinctly from the bulk of the data. This feature of LMS regression is illustrated by the fit given in Figure 1 and the residual plots of Figures 2a and 2b. The LMS regression line is an attractive tool for data analysis, but it is not easy to compute. Steele and Steiger (1986) established that the function f(a, b) = median {|y[sub l] - ax[sub l] - b|[sup 2]} have on the order of n[sup 2] local minima, so typical local methods have little hope of finding the global minimum of f. The main objective of this article is to provide algorithms that do minimize f and are efficient in terms of both time and space. Two algorithms are given here that determine the LMS regression line for n points in the plane. Both of these algorithms draw their strength from the systematic use of affine duality, and one objective pursued here is the exposition of the technique of affine duality so that it will become more commonly considered by... [ABSTRACT FROM AUTHOR]
Copyright of Journal of the American Statistical Association is the property of Taylor & Francis Ltd and its content may not be copied or emailed to multiple sites or posted to a listserv without the copyright holder's express written permission. However, users may print, download, or email articles for individual use. This abstract may be abridged. No warranty is given about the accuracy of the copy. Users should refer to the original published version of the material for the full abstract. (Copyright applies to all Abstracts.)
قاعدة البيانات: Complementary Index
الوصف
تدمد:01621459
DOI:10.1080/01621459.1987.10478500