t-Exponential Memory Networks for Question-Answering Machines

Tolias, Kyriakos; Chatzis, Sotirios P.

doi:10.1109/TNNLS.2018.2884540

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/13499

DC Field	Value	Language
dc.contributor.author	Tolias, Kyriakos	-
dc.contributor.author	Chatzis, Sotirios P.	-
dc.contributor.other	Χατζής, Σωτήριος Π.	-
dc.date.accessioned	2019-04-11T19:43:59Z	-
dc.date.available	2019-04-11T19:43:59Z	-
dc.date.issued	2018-12-25	-
dc.identifier.citation	IEEE Transactions on Neural Networks and Learning Systems, 2019, vol. 30, no. 8, pp. 2463-2467	en_US
dc.identifier.issn	21622388	-
dc.description.abstract	Recent advances in deep learning have brought to the fore models that can make multiple computational steps in the service of completing a task; these are capable of describing long-term dependencies in sequential data. Novel recurrent attention models over possibly large external memory modules constitute the core mechanisms that enable these capabilities. Our work addresses learning subtler and more complex underlying temporal dynamics in language modeling tasks that deal with sparse sequential data. To this end, we improve upon these recent advances by adopting concepts from the field of Bayesian statistics, namely, variational inference. Our proposed approach consists in treating the network parameters as latent variables with a prior distribution imposed over them. Our statistical assumptions go beyond the standard practice of postulating Gaussian priors. Indeed, to allow for handling outliers, which are prevalent in long observed sequences of multivariate data, multivariate t-exponential distributions are imposed. On this basis, we proceed to infer corresponding posteriors; these can be used for inference and prediction at test time, in a way that accounts for the uncertainty in the available sparse training data. Specifically, to allow for our approach to best exploit the merits of the t-exponential family, our method considers a new t-divergence measure, which generalizes the concept of the Kullback-Leibler divergence. We perform an extensive experimental evaluation of our approach, using challenging language modeling benchmarks, and illustrate its superiority over existing state-of-the-art techniques.	en_US
dc.format	pdf	en_US
dc.language.iso	en	en_US
dc.relation.ispartof	IEEE transactions on neural networks and learning systems	en_US
dc.rights	© IEEE	en_US
dc.subject	Bayes methods	en_US
dc.subject	Computational modeling	en_US
dc.subject	Data models	en_US
dc.subject	Hidden Markov models	en_US
dc.subject	Language modeling	en_US
dc.subject	Memory networks (MEM-NNs)	en_US
dc.subject	t-exponential family	en_US
dc.subject	Task analysis	en_US
dc.subject	Training	en_US
dc.subject	Uncertainty	en_US
dc.subject	Variational inference	en_US
dc.title	t-Exponential Memory Networks for Question-Answering Machines	en_US
dc.type	Article	en_US
dc.collaboration	Cyprus University of Technology	en_US
dc.subject.category	Computer and Information Sciences	en_US
dc.journals	Subscription	en_US
dc.country	Cyprus	en_US
dc.subject.field	Engineering and Technology	en_US
dc.publication	Peer Reviewed	en_US
dc.identifier.doi	10.1109/TNNLS.2018.2884540	en_US
dc.identifier.pmid	30596586	-
dc.relation.issue	8	en_US
dc.relation.volume	30	en_US
cut.common.academicyear	2018-2019	en_US
dc.identifier.spage	2463	en_US
dc.identifier.epage	2467	en_US
item.openairetype	article	-
item.cerifentitytype	Publications	-
item.fulltext	No Fulltext	-
item.grantfulltext	none	-
item.openairecristype	http://purl.org/coar/resource_type/c_6501	-
item.languageiso639-1	en	-
crisitem.author.dept	Department of Electrical Engineering, Computer Engineering and Informatics	-
crisitem.author.faculty	Faculty of Engineering and Technology	-
crisitem.author.orcid	0000-0002-4956-4013	-
crisitem.author.parentorg	Faculty of Engineering and Technology	-
crisitem.journal.journalissn	2162237X	-
crisitem.journal.publisher	IEEE	-
Appears in Collections:	Άρθρα/Articles

CORE Recommender

Show simple item record

SCOPUS^TM
Citations

3

checked on Mar 14, 2024

WEB OF SCIENCE^TM
Citations 50

3

Last Week
0

Last month
0

checked on Nov 1, 2023

Page view(s)

355

Last Week
0

Last month
1

checked on Jan 30, 2025

Google Scholar^TM

Check

SCOPUSTM Citations

WEB OF SCIENCETM Citations 50

Page view(s)

Google ScholarTM

Altmetric

SCOPUS^TM
Citations

WEB OF SCIENCE^TM
Citations 50

Google Scholar^TM