Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/13499
DC FieldValueLanguage
dc.contributor.authorTolias, Kyriakos-
dc.contributor.authorChatzis, Sotirios P.-
dc.contributor.otherΧατζής, Σωτήριος Π.-
dc.date.accessioned2019-04-11T19:43:59Z-
dc.date.available2019-04-11T19:43:59Z-
dc.date.issued2018-12-25-
dc.identifier.citationIEEE Transactions on Neural Networks and Learning Systems, 2019, vol. 30, no. 8, pp. 2463-2467en_US
dc.identifier.issn21622388-
dc.description.abstractRecent advances in deep learning have brought to the fore models that can make multiple computational steps in the service of completing a task; these are capable of describing long-term dependencies in sequential data. Novel recurrent attention models over possibly large external memory modules constitute the core mechanisms that enable these capabilities. Our work addresses learning subtler and more complex underlying temporal dynamics in language modeling tasks that deal with sparse sequential data. To this end, we improve upon these recent advances by adopting concepts from the field of Bayesian statistics, namely, variational inference. Our proposed approach consists in treating the network parameters as latent variables with a prior distribution imposed over them. Our statistical assumptions go beyond the standard practice of postulating Gaussian priors. Indeed, to allow for handling outliers, which are prevalent in long observed sequences of multivariate data, multivariate t-exponential distributions are imposed. On this basis, we proceed to infer corresponding posteriors; these can be used for inference and prediction at test time, in a way that accounts for the uncertainty in the available sparse training data. Specifically, to allow for our approach to best exploit the merits of the t-exponential family, our method considers a new t-divergence measure, which generalizes the concept of the Kullback-Leibler divergence. We perform an extensive experimental evaluation of our approach, using challenging language modeling benchmarks, and illustrate its superiority over existing state-of-the-art techniques.en_US
dc.formatpdfen_US
dc.language.isoenen_US
dc.relation.ispartofIEEE transactions on neural networks and learning systemsen_US
dc.rights© IEEEen_US
dc.subjectBayes methodsen_US
dc.subjectComputational modelingen_US
dc.subjectData modelsen_US
dc.subjectHidden Markov modelsen_US
dc.subjectLanguage modelingen_US
dc.subjectMemory networks (MEM-NNs)en_US
dc.subjectt-exponential familyen_US
dc.subjectTask analysisen_US
dc.subjectTrainingen_US
dc.subjectUncertaintyen_US
dc.subjectVariational inferenceen_US
dc.titlet-Exponential Memory Networks for Question-Answering Machinesen_US
dc.typeArticleen_US
dc.collaborationCyprus University of Technologyen_US
dc.subject.categoryComputer and Information Sciencesen_US
dc.journalsSubscriptionen_US
dc.countryCyprusen_US
dc.subject.fieldEngineering and Technologyen_US
dc.publicationPeer Revieweden_US
dc.identifier.doi10.1109/TNNLS.2018.2884540en_US
dc.identifier.pmid30596586-
dc.relation.issue8en_US
dc.relation.volume30en_US
cut.common.academicyear2018-2019en_US
dc.identifier.spage2463en_US
dc.identifier.epage2467en_US
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairecristypehttp://purl.org/coar/resource_type/c_6501-
item.openairetypearticle-
item.languageiso639-1en-
crisitem.journal.journalissn2162237X-
crisitem.journal.publisherIEEE-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.orcid0000-0002-4956-4013-
crisitem.author.parentorgFaculty of Engineering and Technology-
Appears in Collections:Άρθρα/Articles
CORE Recommender
Show simple item record

SCOPUSTM   
Citations

3
checked on Mar 14, 2024

WEB OF SCIENCETM
Citations 50

3
Last Week
0
Last month
0
checked on Nov 1, 2023

Page view(s)

327
Last Week
2
Last month
10
checked on May 12, 2024

Google ScholarTM

Check

Altmetric


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.