Deep Learning Model Selection of Suboptimal Complexity
- Authors: Bakhteev O.Y.1, Strijov V.V.1,2
-
Affiliations:
- Moscow Institute of Physics and Technology
- Dorodnicyn Computing Centre
- Issue: Vol 79, No 8 (2018)
- Pages: 1474-1488
- Section: Optimization, System Analysis, and Operations Research
- URL: https://ogarev-online.ru/0005-1179/article/view/150990
- DOI: https://doi.org/10.1134/S000511791808009X
- ID: 150990
Cite item
Abstract
We consider the problem of model selection for deep learning models of suboptimal complexity. The complexity of a model is understood as the minimum description length of the combination of the sample and the classification or regression model. Suboptimal complexity is understood as an approximate estimate of the minimum description length, obtained with Bayesian inference and variational methods. We introduce probabilistic assumptions about the distribution of parameters. Based on Bayesian inference, we propose the likelihood function of the model. To obtain an estimate for the likelihood, we apply variational methods with gradient optimization algorithms. We perform a computational experiment on several samples.
About the authors
O. Yu. Bakhteev
Moscow Institute of Physics and Technology
Author for correspondence.
Email: bakhteev@phystech.edu
Russian Federation, Moscow
V. V. Strijov
Moscow Institute of Physics and Technology; Dorodnicyn Computing Centre
Email: bakhteev@phystech.edu
Russian Federation, Moscow; Moscow
Supplementary files
