On the Generalization for Transfer Learning: An Information-Theoretic Analysis

X Wu; JH Manton; U Aickelin; J Zhu

Journal article

On the Generalization for Transfer Learning: An Information-Theoretic Analysis

X Wu, JH Manton, U Aickelin, J Zhu

IEEE Transactions on Information Theory | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC | Published : 2024

DOI: 10.1109/TIT.2024.3441574

Download PDF

Abstract

Transfer learning, or domain adaptation, is concerned with machine learning problems in which training and testing data come from possibly different probability distributions. In this work, we give an information-theoretic analysis of the generalization error and excess risk of transfer learning algorithms. Our results suggest, perhaps as expected, that the Kullback-Leibler (KL) divergence D(μ|μ') plays an important role in the characterizations where μ and μ' denote the distribution of the training data and the testing data, respectively. Specifically, we provide generalization error and excess risk upper bounds for learning algorithms where data from both distributions are available in the..

View full abstract

University of Melbourne Researchers

Jonathan Manton Author

Uwe Aickelin Author

Jingge Zhu Author

Grants

Awarded by Defence Science and Technology Group

Funding Acknowledgements

This work was supported in part by Melbourne Research Scholarships (MRS), in part by Australian Defence Science and Technology Group (DSTG) under the scheme of the Artificial Intelligence for Decision Making Initiative 2022, and in part by Australian Research Council under Project DE210101497. An earlier version of this paper was presented at the ISIT2020 Conference [DOI: 10.1109/ISIT44484.2020.9173989].