Please use this identifier to cite or link to this item:
https://hdl.handle.net/20.500.14279/9272
Title: | Age interval and gender prediction using PARAFAC2 applied to speech utterances | Authors: | Pantraki, Evangelia Kotropoulos, Constantine L. Lanitis, Andreas |
metadata.dc.contributor.other: | Λανίτης, Ανδρέας | Major Field of Science: | Natural Sciences | Field Category: | Computer and Information Sciences | Keywords: | PARAFAC2;Speaker ageing;Speaker biometrics | Issue Date: | 7-Apr-2016 | Source: | 4th International Workshop on Biometrics and Forensics, 2016, Limassol, Cyprus | DOI: | 10.1109/IWBF.2016.7449694 | Abstract: | Important problems in speech soft biometrics include the prediction of speaker's age or gender. Here, the aforementioned problems are addressed in the context of utterances collected during a long time period. A unified framework for age and gender prediction is proposed based on Parallel Factor Analysis 2 (PARAFAC2). PARAFAC2 is applied to a collection of three matrices, namely the speech utterance-feature matrix whose columns are the auditory cortical representations, the speaker age matrix whose columns are indicator vectors of suitable dimension, and the speaker gender matrix whose columns are proper indicator vectors associated to speaker's gender. PARAFAC2 is able to reduce the dimensionality of the auditory cortical representations by projecting these representations onto a semantic space dominated by the age and the gender concepts, yielding a sketch (i.e., a feature vector of reduced dimensions). To predict speaker's age interval associated to a test utterance, the speech utterance sketch is pre-multiplied by the left singular vectors of the speaker age matrix. To predict the gender of the speaker who uttered any test utterance, the speech utterance sketch is pre-multiplied by the left singular vectors of the speaker gender matrix. In both cases, a ranking vector is obtained that is exploited for decision making. Promising results are demonstrated, when the aforementioned framework is applied to the Trinity College Dublin Speaker Ageing Database. | URI: | https://hdl.handle.net/20.500.14279/9272 | ISBN: | 978-146739448-2 | Rights: | © 2016 IEEE. | Type: | Conference Papers | Affiliation : | Aristotle University of Thessaloniki Cyprus University of Technology |
Publication Type: | Peer Reviewed |
Appears in Collections: | Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation |
CORE Recommender
Page view(s) 20
495
Last Week
0
0
Last month
2
2
checked on Dec 22, 2024
Google ScholarTM
Check
Altmetric
Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.