Emotion recognition and synthesis based on MPEG-4 FAPs
Date Issued
July 2002
Abstract
In the framework of MPEG-4 hybrid coding of natural and synthetic data streams,
one can include teleconferencing and telepresence applications, in which a synthetic
proxy or a virtual agent is capable of substituting the actual user. Such agents can
interact with each other, analyzing input textual data entered by the user and multisensory
data, including human emotions, facial expressions and nonverbal speech.
This not only enhances interactivity, by replacing single media representations with
dynamic multimedia renderings, but also assists human–computer interaction issues,
letting the system become accustomed to the current needs and feelings of the user.
Actual application of this technology [1] is expected in educational environments, 3-D
videoconferencing and collaborative workplaces, online shopping and gaming, virtual
communities and interactive entertainment. Facial expression synthesis and animation
has gained much interest within the MPEG-4 framework; explicit facial animation
parameters (FAPs) have been dedicated to this purpose. However, FAP implementation
is an open research area [2]. In this chapter we describe a method for generating
emotionally enriched human–computer interaction, focusing on analysis and synthesis
of primary [3] and intermediate facial expressions [4]. To achieve this goal we utilize
both MPEG-4 facial definition parameters (FDPs) and FAPs. The contribution of
the work is twofold: it proposes a way of modeling primary expressions using FAPs
and it describes a rule-based technique for analyzing both archetypal and intermediate
expressions; for the latter we propose an innovative model generation framework. In
particular, a relation between FAPs and the activation parameter proposed in classical
psychological studies is established, extending the archetypal expression studies that
the computer society has concentrated on. The overall scheme leads to a parameterized approach to facial expression synthesis that is compatible with the MPEG-4 standard
and can be used for emotion understanding.
one can include teleconferencing and telepresence applications, in which a synthetic
proxy or a virtual agent is capable of substituting the actual user. Such agents can
interact with each other, analyzing input textual data entered by the user and multisensory
data, including human emotions, facial expressions and nonverbal speech.
This not only enhances interactivity, by replacing single media representations with
dynamic multimedia renderings, but also assists human–computer interaction issues,
letting the system become accustomed to the current needs and feelings of the user.
Actual application of this technology [1] is expected in educational environments, 3-D
videoconferencing and collaborative workplaces, online shopping and gaming, virtual
communities and interactive entertainment. Facial expression synthesis and animation
has gained much interest within the MPEG-4 framework; explicit facial animation
parameters (FAPs) have been dedicated to this purpose. However, FAP implementation
is an open research area [2]. In this chapter we describe a method for generating
emotionally enriched human–computer interaction, focusing on analysis and synthesis
of primary [3] and intermediate facial expressions [4]. To achieve this goal we utilize
both MPEG-4 facial definition parameters (FDPs) and FAPs. The contribution of
the work is twofold: it proposes a way of modeling primary expressions using FAPs
and it describes a rule-based technique for analyzing both archetypal and intermediate
expressions; for the latter we propose an innovative model generation framework. In
particular, a relation between FAPs and the activation parameter proposed in classical
psychological studies is established, extending the archetypal expression studies that
the computer society has concentrated on. The overall scheme leads to a parameterized approach to facial expression synthesis that is compatible with the MPEG-4 standard
and can be used for emotion understanding.
Subjects
File(s)![Thumbnail Image]()
Name
Tsapatsoulis.pdf
Size
652.73 KB
Format
Adobe PDF
Checksum (MD5)
56d5363d4db003843b53638be5f660af

