Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/10120
Title: | Deep network regularization via bayesian inference of synaptic connectivity | Authors: | Partaourides, Charalampos Chatzis, Sotirios P. |
metadata.dc.contributor.other: | Παρταουρίδης, Χαράλαμπος Χατζής, Σωτήριος |
Major Field of Science: | Engineering and Technology | Field Category: | Electrical Engineering - Electronic Engineering - Information Engineering | Keywords: | Bayesian networks;Data mining;Hierarchical systems;Inference engines;Iterative methods | Issue Date: | 1-Jan-2017 | Source: | 21st Pacific-Asia Conference on Knowledge Discovery and Data Mining, Jeju, South Korea, 23 -26 May 2017 | Conference: | Pacific-Asia Conference on Knowledge Discovery and Data Mining | Abstract: | Deep neural networks (DNNs) often require good regularizers to generalize well. Currently, state-of-the-art DNN regularization techniques consist in randomly dropping units and/or connections on each iteration of the training algorithm. Dropout and DropConnect are characteristic examples of such regularizers, that are widely popular among practitioners. However, a drawback of such approaches consists in the fact that their postulated probability of random unit/connection omission is a constant that must be heuristically selected based on the obtained performance in some validation set. To alleviate this burden, in this paper we regard the DNN regularization problem from a Bayesian inference perspective: We impose a sparsity-inducing prior over the network synaptic weights, where the sparsity is induced by a set of Bernoulli-distributed binary variables with Beta (hyper-)priors over their prior parameters. This way, we eventually allow for marginalizing over the DNN synaptic connectivity for output generation, thus giving rise to an effective, heuristics-free, network regularization scheme. We perform Bayesian inference for the resulting hierarchical model by means of an efficient Black-Box Variational inference scheme. We exhibit the advantages of our method over existing approaches by conducting an extensive experimental evaluation using benchmark datasets. | Description: | Advances in Knowledge Discovery and Data Mining, 2017, Pages 30-41 Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) Volume 10234 LNAI, 2017, Pages 30-41 | ISBN: | 9783319574530 | DOI: | 10.1007/978-3-319-57454-7_3 | Rights: | © 2017, Springer | Type: | Conference Papers | Affiliation : | Cyprus University of Technology | Publication Type: | Peer Reviewed |
Appears in Collections: | Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation |
CORE Recommender
SCOPUSTM
Citations
3
checked on Nov 6, 2023
Page view(s) 50
411
Last Week
0
0
Last month
1
1
checked on Jan 7, 2025
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.