Please use this identifier to cite or link to this item: https://ktisis.cut.ac.cy/handle/10488/13701
Title: Filtering Instagram Hashtags through crowdtagging and the HITS algorithm
Authors: Giannoulakis, Stamatios 
Tsapatsoulis, Nicolas 
Keywords: Bipartite graphs;Collective intelligence;Crowdtagging;FolkRank;Hyperlink-induced topic search (HITS) algorithm;Image retrieval;Image tagging;Instagram hashtags
Category: Computer and Information Sciences
Field: Natural Sciences
Issue Date: Jun-2019
Source: IEEE Transactions on Computational Social Systems, 2019, Volume 6, Issue 3, Pages 592 - 603
Journal: IEEE Transactions on Computational Social Systems 
Abstract: Instagram is a rich source for mining descriptive tags for images and multimedia in general. The tags-image pairs can be used to train automatic image annotation (AIA) systems in accordance with the learning by example paradigm. In previous studies, we had concluded that, on average, 20% of the Instagram hashtags are related to the actual visual content of the image they accompany, i.e., they are descriptive hashtags, while there are many irrelevant hashtags, i.e., stop-hashtags, that are used across totally different images just for gathering clicks and for searchability enhancement. In this paper, we present a novel methodology, based on the principles of collective intelligence that helps in locating those hashtags. In particular, we show that the application of a modified version of the well-known hyperlink-induced topic search (HITS) algorithm, in a crowdtagging context, provides an effective and consistent way for finding pairs of Instagram images and hashtags, which lead to representative and noise-free training sets for content-based image retrieval. As a proof of concept, we used the crowdsourcing platform Figure-eight to allow collective intelligence to be gathered in the form of tag selection (crowdtagging) for Instagram hashtags. The crowdtagging data of Figure-eight are used to form bipartite graphs in which the first type of nodes corresponds to the annotators and the second type to the hashtags they selected. The HITS algorithm is first used to rank the annotators in terms of their effectiveness in the crowdtagging task and then to identify the right hashtags per image.
URI: http://ktisis.cut.ac.cy/handle/10488/13701
ISSN: 2329-924X
DOI: 10.1109/TCSS.2019.2914080
Rights: © 2019 IEEE.
Type: Article
Appears in Collections:Άρθρα/Articles

Files in This Item:
File Description SizeFormat
Final Draft.pdfFinal Draft Version2.14 MBAdobe PDFView/Open
Show full item record

Page view(s)

54
Last Week
15
Last month
checked on Jun 14, 2019

Download(s)

10
checked on Jun 14, 2019

Google ScholarTM

Check

Altmetric


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.