Deep Learning Model Selection of Suboptimal Complexity
- 作者: Bakhteev O.Y.1, Strijov V.V.1,2
-
隶属关系:
- Moscow Institute of Physics and Technology
- Dorodnicyn Computing Centre
- 期: 卷 79, 编号 8 (2018)
- 页面: 1474-1488
- 栏目: Optimization, System Analysis, and Operations Research
- URL: https://ogarev-online.ru/0005-1179/article/view/150990
- DOI: https://doi.org/10.1134/S000511791808009X
- ID: 150990
如何引用文章
详细
We consider the problem of model selection for deep learning models of suboptimal complexity. The complexity of a model is understood as the minimum description length of the combination of the sample and the classification or regression model. Suboptimal complexity is understood as an approximate estimate of the minimum description length, obtained with Bayesian inference and variational methods. We introduce probabilistic assumptions about the distribution of parameters. Based on Bayesian inference, we propose the likelihood function of the model. To obtain an estimate for the likelihood, we apply variational methods with gradient optimization algorithms. We perform a computational experiment on several samples.
作者简介
O. Bakhteev
Moscow Institute of Physics and Technology
编辑信件的主要联系方式.
Email: bakhteev@phystech.edu
俄罗斯联邦, Moscow
V. Strijov
Moscow Institute of Physics and Technology; Dorodnicyn Computing Centre
Email: bakhteev@phystech.edu
俄罗斯联邦, Moscow; Moscow
补充文件
