Semisupervised Deep Learning for Image Classification with Distribution Mismatch: A Survey

Research output: Contribution to journalArticlepeer-review

21 Scopus citations

Abstract

Deep learning methodologies have been employed in several different fields, with an outstanding success in image recognition applications, such as material quality control, medical imaging, autonomous driving, etc. Deep learning models rely on the abundance of labeled observations to train a prospective model. These models are composed of millions of parameters to estimate, increasing the need of more training observations. Frequently, it is expensive to gather labeled observations of data, making the usage of deep learning models not ideal, as the model might overfit data. In a semisupervised setting, unlabeled data are used to improve the levels of accuracy and generalization of a model with small labeled datasets. Nevertheless, in many situations different unlabeled data sources might be available. This raises the risk of a significant distribution mismatch between the labeled and unlabeled datasets. Such phenomena can cause a considerable performance hit to typical semisupervised deep learning (SSDL) frameworks, which often assume that both labeled and unlabeled datasets are drawn from similar distributions. Therefore, in this article we study the latest approaches for SSDL for image recognition. Emphasis is made in SSDL models designed to deal with a distribution mismatch between the labeled and unlabeled datasets. We address open challenges with the aim to encourage the community to tackle them, and overcome the high data demand of traditional deep learning pipelines under real-world usage settings.

Original languageEnglish
Pages (from-to)1015-1029
Number of pages15
JournalIEEE Transactions on Artificial Intelligence
Volume3
Issue number6
DOIs
StatePublished - 1 Dec 2022
Externally publishedYes

Keywords

  • Deep learning
  • distribution mismatch
  • image classification
  • semisupervised learning

Fingerprint

Dive into the research topics of 'Semisupervised Deep Learning for Image Classification with Distribution Mismatch: A Survey'. Together they form a unique fingerprint.

Cite this