Splitting criteria for ordinal decision trees: an experimental study

  • 2025-07-21 17:19:05
  • Rafael Ayllón-Gavilán, Francisco José Martínez-Estudillo, David Guijo-Rubio, César Hervás-Martínez, Pedro Antonio Gutiérrez
  • 0

Abstract

Ordinal Classification (OC) addresses those classification tasks where thelabels exhibit a natural order. Unlike nominal classification, which treats allclasses as mutually exclusive and unordered, OC takes the ordinal relationshipinto account, producing more accurate and relevant results. This isparticularly critical in applications where the magnitude of classificationerrors has significant consequences. Despite this, OC problems are oftentackled using nominal methods, leading to suboptimal solutions. Althoughdecision trees are among the most popular classification approaches, ordinaltree-based approaches have received less attention when compared to otherclassifiers. This work provides a comprehensive survey of ordinal splittingcriteria, standardising the notations used in the literature to enhance clarityand consistency. Three ordinal splitting criteria, Ordinal Gini (OGini),Weighted Information Gain (WIG), and Ranking Impurity (RI), are compared to thenominal counterparts of the first two (Gini and information gain), byincorporating them into a decision tree classifier. An extensive repositoryconsidering $45$ publicly available OC datasets is presented, supporting thefirst experimental comparison of ordinal and nominal splitting criteria usingwell-known OC evaluation metrics. The results have been statistically analysed,highlighting that OGini stands out as the best ordinal splitting criterion todate, reducing the mean absolute error achieved by Gini by more than 3.02%. Topromote reproducibility, all source code developed, a detailed guide forreproducing the results, the 45 OC datasets, and the individual results for allthe evaluated methodologies are provided.

 

Quick Read (beta)

loading the full paper ...