Publication:
Support vector machine tools for multi-class classification problems

Loading...
Thumbnail Image
Identifiers
Publication date
2016-03
Defense date
2016-05-05
Tutors
Journal Title
Journal ISSN
Volume Title
Publisher
Impact
Google Scholar
Export
Research Projects
Organizational Units
Journal Issue
Abstract
Lately, Support Vector Machine (SVM) methods have become a very popular technique in the machine learning field for classification problems. It was originally proposed for classifications of two classes. The e↵ectiveness of this method has not only been shown in hundreds of experiments, but also been proved in theory. In our real life, we usually have more than two classes. Various multi-class models with a single objective have been proposed mostly based on two families of methods: an all-together approach and a combination of binary classifiers. However, most of these single-objective models consider neither the di↵erent costs of di↵erent misclassifications nor the users’ preferences. To overcome these drawbacks, we have two approaches. A direct way is to give di↵erent weights to the penalties in the objective functions. The difficulty for this way is how to choose proper values for the weights. Alternatively, multi-objective approaches have been proposed. However, these multi-objective approaches need to solve a set of large Second-Order-Cone Programs (SOCPs) and gives us weakly Pareto-optimal solutions. This thesis is comprised of two working papers on multi-class SVMs. We summarize the contributions of these two working papers as follows. In the first article, we propose a multi-objective technique that we denominate Projected Multiobjective SVM (PM), which works in a higher dimensional space than the object space. For PM, we can characterize its Pareto-optimal solutions. And for classifications with large numbers of classes, PM significantly alleviates the computational bottlenecks. From our experimental results, and compared with the single-objective multi-class SVMs (based on an all-together method, one-against-all method and one-against-one method), PM obtains comparable values for the training classification accuracies, testing classification accuracies and training time, with the advantage of providing a wider set of options, each of them designed for di↵erent misclassification costs. Compared to other multi-objective methods, PM outperforms them in terms of the out-of-sample quality of the approximation of the Pareto frontier, with a considerable reduction of the computational burden. In the second article, we focus on finding the appropriate values of the weight parameters for the single-objective multi-class SVM which considers all classes in one quadratic program (QP). We propose a partial parametric path algorithm (PPPA) taking advantage of the piecewise linearity of the optimal solutions of the weighted single-objective SVMs with respect to the trade-o↵ parameter C. Compared to the traditional grid search method which needs repeatedly solving the QPs, using PPPA we need to solve only one QP and several linear equations. Thus we can save a lot of computation using PPPA. To systematically explore the di↵erent weights for the misclassification costs, we combine the PPPA with a variable neighborhood search method. Our numerical experiments shows the efficiency and reliability of PPPA.
Description
Keywords
Artificial intelligent, Machine learning, Multiclass multiobjective SVM, Weakly Pareto-optimal solution, Multi-objective approach (PM), Single-objective approach (PPPA)
Bibliographic citation
Collections