Ir al contenido

Documat


Optimizing logistic regression coefficients for discrimination and calibration using estimation of distribution algorithms

  • V. Robles [1] ; C. Bielza [1] ; P. Larrañaga [1] ; S. González [1] ; L. Ohno-Machado [2]
    1. [1] Universidad Politécnica de Madrid

      Universidad Politécnica de Madrid

      Madrid, España

    2. [2] Harvard University and MIT, USA
  • Localización: Top, ISSN-e 1863-8279, ISSN 1134-5764, Vol. 16, Nº. 2, 2008, págs. 345-366
  • Idioma: inglés
  • DOI: 10.1007/s11750-008-0054-3
  • Enlaces
  • Resumen
    • Logistic regression is a simple and efficient supervised learning algorithm for estimating the probability of an outcome or class variable. In spite of its simplicity, logistic regression has shown very good performance in a range of fields. It is widely accepted in a range of fields because its results are easy to interpret. Fitting the logistic regression model usually involves using the principle of maximum likelihood. The Newton–Raphson algorithm is the most common numerical approach for obtaining the coefficients maximizing the likelihood of the data.

      This work presents a novel approach for fitting the logistic regression model based on estimation of distribution algorithms (EDAs), a tool for evolutionary computation. EDAs are suitable not only for maximizing the likelihood, but also for maximizing the area under the receiver operating characteristic curve (AUC).

      Thus, we tackle the logistic regression problem from a double perspective: likelihood-based to calibrate the model and AUC-based to discriminate between the different classes. Under these two objectives of calibration and discrimination, the Pareto front can be obtained in our EDA framework. These fronts are compared with those yielded by a multiobjective EDA recently introduced in the literature.


Fundación Dialnet

Mi Documat

Opciones de artículo

Opciones de compartir

Opciones de entorno