Skip to main content
Article
Enhanced label noise filtering with multiple voting
Applied Sciences (Switzerland)
  • Donghai Guan, Nanjing University of Aeronautics and Astronautics
  • Maqbool Hussain, Sejong University
  • Weiwei Yuan, Nanjing University of Aeronautics and Astronautics
  • Asad Masood Khattak, Zayed University
  • Muhammad Fahim, Innopolis University
  • Wajahat Ali Khan, Kyung Hee University
Document Type
Article
Publication Date
12-1-2019
Abstract

© 2019 by the authors. Label noises exist in many applications, and their presence can degrade learning performance. Researchers usually use filters to identify and eliminate them prior to training. The ensemble learning based filter (EnFilter) is the most widely used filter. According to the voting mechanism, EnFilter is mainly divided into two types: single-voting based (SVFilter) and multiple-voting based (MVFilter). In general, MVFilter is more often preferred because multiple-voting could address the intrinsic limitations of single-voting. However, the most important unsolved issue in MVFilter is how to determine the optimal decision point (ODP). Conceptually, the decision point is a threshold value, which determines the noise detection performance. To maximize the performance of MVFilter, we propose a novel approach to compute the optimal decision point. Our approach is data driven and cost sensitive, which determines the ODP based on the given noisy training dataset and noise misrecognition cost matrix. The core idea of our approach is to estimate the mislabeled data probability distributions, based on which the expected cost of each possible decision point could be inferred. Experimental results on a set of benchmark datasets illustrate the utility of our proposed approach.

Publisher
MDPI AG
Disciplines
Keywords
  • Cost minimization,
  • Mislabeled data filter,
  • Multiple-voting,
  • Optimal decision point,
  • Single-voting
Scopus ID
85076482207
Creative Commons License
Creative Commons Attribution 4.0 International
Indexed in Scopus
Yes
Open Access
Yes
Open Access Type
Gold: This publication is openly available in an open access journal/series
Citation Information
Donghai Guan, Maqbool Hussain, Weiwei Yuan, Asad Masood Khattak, et al.. "Enhanced label noise filtering with multiple voting" Applied Sciences (Switzerland) Vol. 9 Iss. 23 (2019) ISSN: <a href="https://v2.sherpa.ac.uk/id/publication/issn/2076-3417" target="_blank">2076-3417</a>
Available at: http://works.bepress.com/asad-khattak/43/