Machine Learning Strategies for Large-scale Taxonomies

Prendre des notes

Il n’y a pas de note disponible pour vous pour cette vidéo.

Connectez-vous pour en créer une nouvelle.

Disciplines

Types

Mots clés

perform 302 fle 291 techniques 290 sciences 288 filipé 284 fos 282 dsim 189 gricad 179 lig 169 mathematiques 153 soutenance 151 cpp 137 thèse 137 prepa inp 134 prepa des inp 133 stage 126 mooc 121 uga 97 recherche 94 2a 86

Rohit Babbar / LIG

In the era of Big Data, we need efficient and scalable machine learning algorithms which can perform automatic classification of Tera-Bytes of data. In this thesis, we study the machine learning challenges for classification in large-scale taxonomies. These challenges include computational complexity of training and prediction and the performance on unseen data. In the first part of the thesis, we study the underlying power-law distribution in large-scale taxonomies. This analysis then motivates the derivation of bounds on space complexity of hierarchical classifiers. Exploiting the study of this distribution further, we then design classification scheme which leads to better accuracy on large-scale power-law distributed categories. We also propose an efficient method for model-selection when training multi-class version of classifiers such as Support Vector Machine and Logistic Regression. Finally, we address another key model selection problem in large scale classification!

Concerning the choice between flat versus hierarchical classification from a learning theoretic aspect. The presented generalization error analysis provides an explanation to empirical findings in many recent studies in large-scale hierarchical classification. We further exploit the developed bounds to propose two methods for adapting the given taxonomy of categories to output taxonomies which yield better test accuracy when used in a top-down setup.

Mots clés : soutenance thèse

Ajouté par : Gricad Vidéos
Mis à jour le : 1 janvier 2021 00:00
Chaîne :
- Recherche
Type : Autres
Langue principale : Français

Les commentaires ont été désactivés pour cette vidéo.

Machine Learning Strategies for Large-scale Taxonomies

Informations