Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/2093
Title: | Branch-and-bound algorithms for computing the best-subset regression models | Authors: | Gatu, Cristian Kontoghiorghes, Erricos John |
metadata.dc.contributor.other: | Κοντογιώργης, Έρρικος Γιάννης | Major Field of Science: | Natural Sciences | Field Category: | Computer and Information Sciences | Keywords: | Least squares;Algorithms | Issue Date: | Mar-2006 | Source: | Journal of Computational and Graphical Statistics, 2006, vol. 15, no. 1, pp. 139-156 | Volume: | 15 | Issue: | 1 | Start page: | 139 | End page: | 156 | Journal: | Journal of Computational and Graphical Statistics | Abstract: | An efficient branch-and-bound algorithm for computing the best-subset regression models is proposed. The algorithm avoids the computation of the whole regression tree that generates all possible subset models. It is formally shown that if the branch-and-bound test holds, then the current subtree together with its right-hand side subtrees are cut. This reduces significantly the computational burden of the proposed algorithm when compared to an existing leaps-and-bounds method which generates two trees. Specifically, the proposed algorithm, which is based on orthogonal transformations, outperforms by O(n 3) the leaps-and-bounds strategy. The criteria used in identifying the best subsets are based on monotone functions of the residual sum of squares (RSS) such as R 2, adjusted R 2, mean square error of prediction, and C p. Strategies and heuristics that improve the computational performance of the proposed algorithm are investigated. A computationally efficient heuristic version of the branch-and-bound strategy which decides to cut subtrees using a tolerance parameter is proposed. The heuristic algorithm derives models close to the best ones. However, it is shown analytically that the relative error of the RSS, and consequently the corresponding statistic, of the computed subsets is smaller than the value of the tolerance parameter which lies between zero and one. Computational results and experiments on random and real data are presented and analyzed. | URI: | https://hdl.handle.net/20.500.14279/2093 | ISSN: | 15372715 | DOI: | 10.1198/106186006X100290 | Rights: | © American Statistical Association. | Type: | Article | Affiliation : | University of Cyprus University of London |
Publication Type: | Peer Reviewed |
Appears in Collections: | Άρθρα/Articles |
CORE Recommender
SCOPUSTM
Citations
55
checked on Nov 9, 2023
WEB OF SCIENCETM
Citations
50
52
Last Week
0
0
Last month
0
0
checked on Oct 29, 2023
Page view(s) 5
602
Last Week
0
0
Last month
6
6
checked on Dec 22, 2024
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.