Ir al contenido

Documat


Does the Order of Attributes Play an Important Role in Classification?

  • Tallón-Ballesteros, Antonio J. [1] ; Simon Fong [2] ; Rocío Leal-Díaz [3]
    1. [1] Universidad de Huelva

      Universidad de Huelva

      Huelva, España

    2. [2] University of Macau

      University of Macau

      RAE de Macao (China)

    3. [3] Universidad de Sevilla

      Universidad de Sevilla

      Sevilla, España

  • Localización: Hybrid Artificial Intelligent Systems. 14th International Conference, HAIS 2019: León, Spain, September 4–6, 2019. Proceedings / coord. por Hilde Pérez García Árbol académico, Lidia Sánchez González Árbol académico, Manuel Castejón Limas Árbol académico, Héctor Quintián Pardo Árbol académico, Emilio Santiago Corchado Rodríguez Árbol académico, 2019, ISBN 978-3-030-29858-6, págs. 370-380
  • Idioma: inglés
  • Enlaces
  • Resumen
    • This paper proposes a methodology to feature sorting in the context of supervised machine learning algorithms. Feature sorting is defined as a procedure to order the initial arrangement of the attributes according to any sorting algorithm to assign an ordinal number to every feature, depending on its importance; later the initial features are sorted following the ordinal numbers from the first to the last, which are provided by the sorting method. Feature ranking has been chosen as the representative technique to fulfill the sorting purpose inside the feature selection area. This contribution aims at introducing a new methodology where all attributes are included in the data mining task, following different sortings by means of different feature ranking methods. The approach has been assessed in ten binary and multiple class problems with a number of features lower than 37 and a number of instances below than 106 up to 28056; the test-bed includes one challenging data set with 21 labels and 23 attributes where previous works were not able to achieve an accuracy of at least a fifty percent. ReliefF is a strong candidate to be applied in order to re-sort the initial characteristic space and C4.5 algorithm achieved a promising global performance; additionally, PART -a rule-based classifierand Support Vector Machines obtained acceptable results.


Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno