Competing Mutual Information Constraints with Stochastic Competition-Based Activations for Learning Diversified Representations

Panousis, Konstantinos P.; Antoniadis, Anastasios; Chatzis, Sotirios P.

Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/29868

Title:	Competing Mutual Information Constraints with Stochastic Competition-Based Activations for Learning Diversified Representations
Authors:	Panousis, Konstantinos P. Antoniadis, Anastasios Chatzis, Sotirios P.
Major Field of Science:	Engineering and Technology
Field Category:	Electrical Engineering - Electronic Engineering - Information Engineering
Keywords:	Artificial intelligence;Classification (of information);Information theory;Network layers;Stochastic systems
Issue Date:	30-Jun-2022
Source:	Proceedings of the 36th AAAI Conference on Artificial Intelligence (Virtual, Online), 2022, pp. 7931 - 7940
Volume:	36
Start page:	7931
End page:	7940
Conference:	36th AAAI Conference on Artificial Intelligence (AAAI)
Abstract:	This work aims to address the long-established problem of learning diversified representations. To this end, we combine information-theoretic arguments with stochastic competition-based activations, namely Stochastic Local Winner-Takes-All (LWTA) units. In this context, we ditch the conventional deep architectures commonly used in Representation Learning, that rely on non-linear activations; instead, we replace them with sets of locally and stochastically competing linear units. In this setting, each network layer yields sparse outputs, determined by the outcome of the competition between units that are organized into blocks of competitors. We adopt stochastic arguments for the competition mechanism, which perform posterior sampling to determine the winner of each block. We further endow the considered networks with the ability to infer the sub-part of the network that is essential for modeling the data at hand; we impose appropriate stick-breaking priors to this end. To further enrich the information of the emerging representations, we resort to information-theoretic principles, namely the Information Competing Process (ICP). Then, all the components are tied together under the stochastic Variational Bayes framework for inference. We perform a thorough experimental investigation for our approach using benchmark datasets on image classification. As we experimentally show, the resulting networks yield significant discriminative representation learning abilities. In addition, the introduced paradigm allows for a principled investigation mechanism of the emerging intermediate network representations.
URI:	https://hdl.handle.net/20.500.14279/29868
ISBN:	1577358767 978-157735876-3
Rights:	Copyright © Elsevier B.V
Type:	Article
Affiliation :	Cyprus University of Technology
Appears in Collections:	Δημοσιεύσεις σε συνέδρια /Conference papers or poster or presentation

CORE Recommender

Show full item record

Page view(s)

166

Last Week
4

Last month
9

checked on Feb 2, 2025

Google Scholar^TM

Check

Altmetric

This item is licensed under a Creative Commons License

Page view(s)

Google ScholarTM

Altmetric

Google Scholar^TM