دورية أكاديمية

Automatic layer selection for transfer learning and quantitative evaluation of layer effectiveness.

التفاصيل البيبلوغرافية
العنوان: Automatic layer selection for transfer learning and quantitative evaluation of layer effectiveness.
المؤلفون: Nagae, Satsuki1 (AUTHOR) nagae@cmu.iit.tsukuba.ac.jp, Kanda, Daigo1 (AUTHOR) kanda@cmu.iit.tsukuba.ac.jp, Kawai, Shin1 (AUTHOR) kawai@cmu.iit.tsukuba.ac.jp, Nobuhara, Hajime1 (AUTHOR) nobuhara@cmu.iit.tsukuba.ac.jp
المصدر: Neurocomputing. Jan2022, Vol. 469, p151-162. 12p.
مصطلحات موضوعية: *CONVOLUTIONAL neural networks, *GENETIC algorithms, *GENETIC models
مستخلص: • Automatic layer selection for transfer learning using genetic algorithm. • Evaluate effectiveness of layers for transfer learning numerically. • Transfer learning performance is improved by genetic algorithm. • Proposed effectiveness value correlate with test accuracy in transfer learning. The performance of transfer learning in convolutional neural networks depends on the selection of which layer to update and fix. Because the number of layers is increasing, it is becoming increasingly difficult for humans to select layers. Therefore, in this study, we propose a method to automatically select effective update layers for transfer learning using a genetic algorithm. In our experiments, we conducted transfer learning from InceptionV3 pretrained with ImageNet to Canadian Institute for Advanced Research-100 dataset, The Street View House Numbers dataset and Food-101 dataset. We found that the test accuracy obtained by an ensemble of models selected by the genetic algorithm was greater than that obtained by from-scratch and fine-tuning for all target dataset. The distribution of the layers selected by the genetic algorithm as effective update layers was spread over the entire network. We also employed the optimal transport distance to evaluate whether each convolutional layer is an effective update layer for transfer learning. In our experiments, we compared the layer importance values and the accuracy of transfer learning. The layer importance was then correlated with the test accuracy of transfer learning, and the results demonstrate that the proposed method can quantitatively evaluate how well each network layer can detect general features in the target datasets. [ABSTRACT FROM AUTHOR]
قاعدة البيانات: Academic Search Index
الوصف
تدمد:09252312
DOI:10.1016/j.neucom.2021.10.051