Multivariate supervised discretization,
a neighborhood graph approach
Fabrice Muhlenbach and Ricco Rakotomalala
ERIC Laboratory
Lumière University -- Lyon II
5 avenue Pierre Mendès-France
F-69676 BRON Cedex -- FRANCE
{fmuhlenb,rakotoma}@univ-lyon2.fr
Abstract
We present a new discretization method in the context of supervised learning.
This method entitled HyperCluster Finder is characterized by its
supervised and polythetic behavior. The method is based on the notion of
clusters and processes in two steps. First, a neighborhood graph
construction from the learning database allows discovering homogenous clusters.
Second, the minimal and maximal values of each cluster are transferred to each
dimension in order to define some boundaries to cut the continuous attribute in
a set of intervals. The discretization abilities of this method are illustrated
by some examples, in particular, processing the XOR problem.