Efficient algorithms for computing the best subset regression models for large-scale problems

Hofmann, Marc H.; Gatu, Cristian; Kontoghiorghes, Erricos John

doi:10.1016/j.csda.2007.03.017

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/2017

DC Field	Value	Language
dc.contributor.author	Hofmann, Marc H.	-
dc.contributor.author	Gatu, Cristian	-
dc.contributor.author	Kontoghiorghes, Erricos John	-
dc.date.accessioned	2013-01-28T13:17:42Z	en
dc.date.accessioned	2013-05-16T08:22:27Z	-
dc.date.accessioned	2015-12-02T09:33:27Z	-
dc.date.available	2013-01-28T13:17:42Z	en
dc.date.available	2013-05-16T08:22:27Z	-
dc.date.available	2015-12-02T09:33:27Z	-
dc.date.issued	2007-09-15	-
dc.identifier.citation	Computational Statistics and Data Analysis, 2007, vol. 52, no. 1, pp. 16-29.	en_US
dc.identifier.issn	1679473	-
dc.identifier.uri	https://hdl.handle.net/20.500.14279/2017	-
dc.description.abstract	Several strategies for computing the best subset regression models are proposed. Some of the algorithms are modified versions of existing regression-tree methods, while others are new. The first algorithm selects the best subset models within a given size range. It uses a reduced search space and is found to outperform computationally the existing branch-and-bound algorithm. The properties and computational aspects of the proposed algorithm are discussed in detail. The second new algorithm preorders the variables inside the regression tree. A radius is defined in order to measure the distance of a node from the root of the tree. The algorithm applies the preordering to all nodes which have a smaller distance than a certain radius that is given a priori. An efficient method of preordering the variables is employed. The experimental results indicate that the algorithm performs best when preordering is employed on a radius of between one quarter and one third of the number of variables. The algorithm has been applied with such a radius to tackle large-scale subset-selection problems that are considered to be computationally infeasible by conventional exhaustive-selection methods. A class of new heuristic strategies is also proposed. The most important of these is one that assigns a different tolerance value to each subset model size. This strategy with different kind of tolerances is equivalent to all exhaustive and heuristic subset-selection strategies. In addition the strategy can be used to investigate submodels having noncontiguous size ranges. Its implementation provides a flexible tool for tackling large scale models.	en_US
dc.format	pdf	en_US
dc.language.iso	en	en_US
dc.relation.ispartof	Computational Statistics and Data Analysis	en_US
dc.rights	© Elsevier	en_US
dc.rights.uri	http://creativecommons.org/licenses/by-nc-nd/3.0/us/	*
dc.subject	Branch and bound algorithms	en_US
dc.subject	Decision trees	en_US
dc.subject	Regression analysis	en_US
dc.subject	Mathematical models	en_US
dc.title	Efficient algorithms for computing the best subset regression models for large-scale problems	en_US
dc.type	Article	en_US
dc.affiliation	Cyprus University of Technology	en
dc.collaboration	University of Cyprus	en_US
dc.journals	Open Access	en_US
dc.country	Cyprus	en_US
dc.subject.field	Social Sciences	en_US
dc.publication	Peer Reviewed	en_US
dc.identifier.doi	10.1016/j.csda.2007.03.017	en_US
dc.dept.handle	123456789/54	en
dc.relation.issue	1	en_US
dc.relation.volume	52	en_US
cut.common.academicyear	2007-2008	en_US
dc.identifier.spage	16	en_US
dc.identifier.epage	29	en_US
item.grantfulltext	none	-
item.languageiso639-1	en	-
item.cerifentitytype	Publications	-
item.openairecristype	http://purl.org/coar/resource_type/c_6501	-
item.openairetype	article	-
item.fulltext	No Fulltext	-
crisitem.journal.journalissn	0167-9473	-
crisitem.journal.publisher	Elsevier	-
crisitem.author.dept	Department of Finance, Accounting and Management Science	-
crisitem.author.faculty	Faculty of Tourism Management, Hospitality and Entrepreneurship	-
crisitem.author.orcid	0000-0001-9704-9510	-
crisitem.author.parentorg	Faculty of Management and Economics	-
Appears in Collections:	Άρθρα/Articles

CORE Recommender

Show simple item record

SCOPUS^TM
Citations

41

checked on Nov 9, 2023

WEB OF SCIENCE^TM
Citations 50

36

Last Week
0

Last month
0

checked on Oct 29, 2023

Page view(s)

551

Last Week
0

Last month
4

checked on Nov 6, 2024

Google Scholar^TM

Check

Altmetric

This item is licensed under a Creative Commons License

SCOPUSTM Citations

WEB OF SCIENCETM Citations 50

Page view(s)

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations 50

Google Scholar^TM