Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/1977
Title: | Graph-based strategies for performing the exhaustive and random k-fold cross-validations | Authors: | Yanev, Petko I. Kontoghiorghes, Erricos John |
Major Field of Science: | Natural Sciences | Field Category: | Computer and Information Sciences | Keywords: | Downdating;Model evaluation;Model selection;QR decomposition;Resampling;Updating | Issue Date: | 2009 | Source: | Journal of Computational and Graphical Statistics, 2009, vol. 18, no. 4, pp. 894-914 | Volume: | 18 | Issue: | 4 | Start page: | 894 | End page: | 914 | Journal: | Journal of Computational and Graphical Statistics | Abstract: | An efficient graph-based strategy for performing the exhaustive k-fold crossvalidation procedure is proposed. All training (and testing) subsets are presented as nodes of a complete weighted graph. The arcs between the nodes indicate the different possibilities for deriving the solution of the destination node given the solution of the source node. The weights of the arcs represent the complexities of (the numerical operations involved in) updating and downdating the corresponding data matrices. The complete graph with arcs connecting every pair of nodes is defined and its properties are investigated. The optimum way of performing the exhaustive k-fold cross-validation is equivalent in deriving the path within the graph that has the minimum computational complexity. Furthermore, a generalization of the complete k-fold cross-validation graph is used to derive new strategies for performing random k-fold cross-validations. The proposed strategies generate additional nodes during the computations, which are part of the generalized graph. The additional nodes represent new models which have not been required initially, but provide additional information about the evaluated model. The advantages and the drawbacks of the proposed strategies are discussed. Numerical results are presented and analyzed. Finally the computation of all nearest neighbors of a given node is also considered. The Fortran 90 source code for the algorithms in the manuscript is available on-line. | URI: | https://hdl.handle.net/20.500.14279/1977 | ISSN: | 15372715 | DOI: | 10.1198/jcgs.2009.08019 | Rights: | © Taylor & Francis | Type: | Article | Affiliation: | Cyprus University of Technology | Affiliation : | Université de Neuchâtel University of Cyprus University of London |
Publication Type: | Peer Reviewed |
Appears in Collections: | Άρθρα/Articles |
CORE Recommender
SCOPUSTM
Citations
2
checked on Nov 9, 2023
WEB OF SCIENCETM
Citations
50
2
Last Week
0
0
Last month
0
0
checked on Oct 29, 2023
Page view(s) 10
532
Last Week
0
0
Last month
1
1
checked on Nov 7, 2024
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.