Repository logoCyprus University of Technology
Log In(current)
Ελληνικά
English
  1. Home
  2. Cyprus University of Technology (Research Output)
  3. Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation
  4. Stochastic Transformer Networks With Linear Competing Units: Application To End-to-End SL Translation
  • Details

Stochastic Transformer Networks With Linear Competing Units: Application To End-to-End SL Translation

Date Issued
October 10, 2021
Author(s)
Voskou, Andreas  
Panousis, Konstantinos P.  
Kosmopoulos, Dimitrios I.  
Metaxas, Dimitris  
Chatzis, Sotirios P.  
DOI
10.1109/ICCV48922.2021.01173
Abstract
Automating sign language translation (SLT) is a challenging real-world application. Despite its societal importance, though, research progress in the field remains rather poor. Crucially, existing methods that yield viable performance necessitate the availability of laborious to obtain gloss sequence groundtruth. In this paper, we attenuate this need, by introducing an end-to-end SLT model that does not entail explicit use of glosses; the model only needs text groundtruth. This is in stark contrast to existing end-to-end models that use gloss sequence groundtruth, either in the form of a modality that is recognized at an intermediate model stage, or in the form of a parallel output process, jointly trained with the SLT model. Our approach constitutes a Transformer network with a novel type of layers that combines: (i) local winner-takes-all (LWTA) layers with stochastic winner sampling, instead of conventional ReLU layers, (ii) stochastic weights with posterior distributions estimated via variational inference, and (iii) a weight compression technique at inference time that exploits estimated posterior variance to perform massive, almost lossless compression. We demonstrate that our approach can reach the currently best reported BLEU-4 score on the PHOENIX 2014T benchmark, but without making use of glosses for model training, and with a memory footprint reduced by more than 70%.
Funding(s)
aRTIFICIAL iNTELLIGENCE for the Deaf (aiD)  
Subjects

Memory management

Stochastic processes

Gesture recognition

Benchmark testing

Assistive technologie...

Machine learning arch...

Representation learni...

Vision + language

File(s)
Thumbnail Image
Name

Stochastic Transformer Networks.pdf

Size

690.84 KB

Format

Adobe PDF

Checksum (MD5)

f346cafdb15dd4463e4c1dda97e99615

Explore by
  • Collections
  • Research Outputs
  • Researchers
  • Faculty & Departments
  • Theses
  • Patents
  • Projects
  • Journals
  • Conferences
Useful Links
  • Researcher Portfolio Guide
  • Researcher Profile
  • Create an ORCID ID
  • CUT Open Access Author Fund
  • ETDS Guide
Copyright Policies

Use Sherpa/Romeo to find publisher copyright policies

Go
Go
  • SPARC Author Addendum Engine
  • National Open Access Policy in Cyprus
Deposit your work to Ktisis
  • Self-archiving. Please sign in to Ktisis.
  • Email your work to:
    library.dspace@cut.ac.cy
  • Contact your subject librarian

Member of

OpenAIREre3dataOpenDOARCOREDART
Cyprus University of Technology
Library and
Information
Services

Copyright © 2022 - Library and Information Services Feedback - Built with DSpace-CRIS - 4Science

  • Accessibility settings
  • Privacy policy
  • End User Agreement
COAR NotifyCOAR Notify