Sparse bayesian recurrent neural networks

Chatzis, Sotirios P.

Παρακαλώ χρησιμοποιήστε αυτό το αναγνωριστικό για να παραπέμψετε ή να δημιουργήσετε σύνδεσμο προς αυτό το τεκμήριο: https://hdl.handle.net/20.500.14279/8190

Τίτλος:	Sparse bayesian recurrent neural networks
Συγγραφείς:	Chatzis, Sotirios P.
metadata.dc.contributor.other:	Χατζής, Σωτήριος Π.
Major Field of Science:	Natural Sciences
Field Category:	Computer and Information Sciences
Λέξεις-κλειδιά:	Recurrent neural networks;RNN;Bayesian regression
Ημερομηνία Έκδοσης:	2015
Πηγή:	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases, 2015, Porto, Portugal, 7-11 September
Conference:	European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases
Περίληψη:	Recurrent neural networks (RNNs) have recently gained renewed attention from the machine learning community as effective methods for modeling variable-length sequences. Language modeling, handwriting recognition, and speech recognition are only few of the application domains where RNN-based models have achieved the state-of- the-art performance currently reported in the literature. Typically, RNN architectures utilize simple linear, logistic, or softmax output layers to perform data modeling and prediction generation. In this work, for the first time in the literature, we consider using a sparse Bayesian regression or classification model as the output layer of RNNs, inspired from the automatic relevance determination (ARD) technique. The notion of ARD is to continually create new components while detecting when a component starts to overfit, where overfit manifests itself as a precision hyperparameter posterior tending to infinity. This way, our method manages to train sparse RNN models, where the number of effective (“active”) recurrently connected hidden units is selected in a data-driven fashion, as part of the model inference procedure. We develop efficient and scalable training algorithms for our model under the stochastic variational inference paradigm, and derive elegant predictive density expressions with computational costs comparable to conventional RNN formulations. We evaluate our approach considering its application to challenging tasks dealing with both regression and classification problems, and exhibit its favorable performance over the state-of-the-art.
URI:	https://hdl.handle.net/20.500.14279/8190
Type:	Conference Papers
Affiliation:	Cyprus University of Technology
Εμφανίζεται στις συλλογές:	Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation

Αρχεία σε αυτό το τεκμήριο:

Αρχείο	Περιγραφή	Μέγεθος	Μορφότυπος
Chatzis.pdf		176.51 kB	Adobe PDF	Δείτε/ Ανοίξτε

CORE Recommender

Δείξε την πλήρη περιγραφή του τεκμηρίου

Page view(s) 50

398

Last Week
1

Last month
3

checked on 21 Νοε 2024

Download(s) 50

555

checked on 21 Νοε 2024

Google Scholar^TM

Check

Όλα τα τεκμήρια του δικτυακού τόπου προστατεύονται από πνευματικά δικαιώματα

Αρχεία σε αυτό το τεκμήριο:

Page view(s) 50

Download(s) 50

Google ScholarTM

Google Scholar^TM