Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/33102
DC FieldValueLanguage
dc.contributor.authorAristodimou, Aristos-
dc.contributor.authorDiavastos, Andreas-
dc.contributor.authorConstantinos, Pattichis-
dc.date.accessioned2024-10-15T08:05:56Z-
dc.date.available2024-10-15T08:05:56Z-
dc.date.issued2022-02-16-
dc.identifier.citationHealth Informatics Journal, 2022en_US
dc.identifier.issn17412811-
dc.identifier.urihttps://hdl.handle.net/20.500.14279/33102-
dc.description.abstractDiscretization is a preprocessing technique used for converting continuous features into categorical. This step is essential for processing algorithms that cannot handle continuous data as input. In addition, in the big data era, it is important for a discretizer to be able to efficiently discretize data. In this paper, a new supervised density-based discretization (DBAD) algorithm is proposed, which satisfies these requirements. For the evaluation of the algorithm, 11 datasets that cover a wide range of datasets in the medical domain were used. The proposed algorithm was tested against three state-of-the art discretizers using three classifiers with different characteristics. A parallel version of the algorithm was evaluated using two synthetic big datasets. In the majority of the performed tests, the algorithm was found performing statistically similar or better than the other three discretization algorithms it was compared to. Additionally, the algorithm was faster than the other discretizers in all of the performed tests. Finally, the parallel version of DBAD shows almost linear speedup for a Message Passing Interface (MPI) implementation (9.64× for 10 nodes), while a hybrid MPI/OpenMP implementation improves execution time by 35.3× for 10 nodes and 6 threads per node.en_US
dc.formatpdfen_US
dc.language.isoenen_US
dc.relation.ispartofHealth Informatics Journalen_US
dc.subjectdensity estimationen_US
dc.subjectsupervised discretizationen_US
dc.subjectbig dataen_US
dc.subjectclassificationen_US
dc.subjectdensity-based discretizationen_US
dc.titleA fast supervised density-based discretization algorithm for classification tasks in the medical domainen_US
dc.typeArticleen_US
dc.collaborationUniversity of Cyprusen_US
dc.collaborationUniversitat Politècnica de Catalunyaen_US
dc.subject.categoryComputer and Information Sciencesen_US
dc.journalsSubscriptionen_US
dc.countryCyprusen_US
dc.countrySpainen_US
dc.subject.fieldEngineering and Technologyen_US
dc.publicationPeer Revieweden_US
cut.common.academicyear2021-2022en_US
item.grantfulltextnone-
item.openairetypearticle-
item.openairecristypehttp://purl.org/coar/resource_type/c_6501-
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.languageiso639-1en-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.orcid0000-0002-7139-4444-
crisitem.author.parentorgFaculty of Engineering and Technology-
Appears in Collections:Άρθρα/Articles
CORE Recommender
Show simple item record

Page view(s)

4
checked on Oct 20, 2024

Google ScholarTM

Check


Items in KTISIS are protected by copyright, with all rights reserved, unless otherwise indicated.