Please use this identifier to cite or link to this item: https://hdl.handle.net/20.500.14279/29880
DC FieldValueLanguage
dc.contributor.authorKleanthous, Christos-
dc.contributor.authorChristophides, Theodoros-
dc.contributor.authorChatzis, Sotirios P.-
dc.date.accessioned2023-07-14T11:50:40Z-
dc.date.available2023-07-14T11:50:40Z-
dc.date.issued2020-10-15-
dc.identifier.citationProceedings of 1st ACM International Conference on AI in Finance, 2020, pp. 1-8en_US
dc.identifier.isbn9781450375849-
dc.identifier.urihttps://hdl.handle.net/20.500.14279/29880-
dc.description.abstractTax authorities need to maximize the yield of the limited tax audits they afford to perform each year. Thus, they need to predict the likelihood of a candidate audit resulting in a satisfactory yield; this predictive process is usually referred to as audit case selection. Random Forests (RFs) constitute a standard method for Value Added Tax (VAT) audit case selection. Despite, though, their success, their predictive performance is still below the expectations of tax authorities, that need to timely detect cases of significant audit yield potential. This lackluster performance is mainly attributed to the fact that RFs cannot deal with data that entail non-stationary nature, multiple modalities, or discontinuities. These are common characteristics of real-world datasets; thus, the incapacity to properly address them is a major suspect for undermining their performance. This work addresses these issues by considering a generative non-parametric Bayesian model with power-law behavior, capable of generating distinct (Bayesian) RFs over the observations space of the modeled data. This way, our approach enables capturing an indefinite number of distinct classification patterns, while being able to effectively handle outliers. The latter advantage is of paramount importance for the effectiveness of the modeling procedure in cases where few large parts of the observations space can be modeled by few RF classifiers, yet there is a large number of small parts of the observations space that require distinct RFs to be properly modeled (power-law nature). We provide an efficient algorithm for model inference, based on the variational Bayesian framework, and prove its efficacy using real-world datasets.en_US
dc.language.isoenen_US
dc.rightsCopyright © Elsevier B.Ven_US
dc.rights.urihttp://creativecommons.org/licenses/by-nc-nd/4.0/-
dc.subjectBayesian networksen_US
dc.subjectDecision treesen_US
dc.subjectInference enginesen_US
dc.subjectMixturesen_US
dc.subjectRandom forestsen_US
dc.subjectAudit selectionen_US
dc.subjectNon-parametric bayesian mixture modelen_US
dc.subjectRandom forestsen_US
dc.titlePower-law mixtures of bayesian forests for value added tax audit case selectionen_US
dc.typeConference Papersen_US
dc.collaborationCyprus University of Technologyen_US
dc.subject.categoryMechanical Engineeringen_US
dc.countryCyprusen_US
dc.subject.fieldEngineering and Technologyen_US
dc.relation.conference1st ACM International Conference on AI in Financeen_US
dc.identifier.doi10.1145/3383455.3422515en_US
dc.identifier.scopus2-s2.0-85118117062-
dc.identifier.urlhttps://api.elsevier.com/content/abstract/scopus_id/85118117062-
cut.common.academicyear2020-2021en_US
dc.identifier.spage1en_US
dc.identifier.epage8en_US
item.fulltextNo Fulltext-
item.cerifentitytypePublications-
item.grantfulltextnone-
item.openairecristypehttp://purl.org/coar/resource_type/c_c94f-
item.openairetypeconferenceObject-
item.languageiso639-1en-
crisitem.author.deptDepartment of Electrical Engineering, Computer Engineering and Informatics-
crisitem.author.facultyFaculty of Engineering and Technology-
crisitem.author.orcid0000-0002-4956-4013-
crisitem.author.parentorgFaculty of Engineering and Technology-
Appears in Collections:Άρθρα/Articles
CORE Recommender
Show simple item record

Page view(s)

131
Last Week
3
Last month
14
checked on May 17, 2024

Google ScholarTM

Check

Altmetric


This item is licensed under a Creative Commons License Creative Commons