Özet
One of the most efficient methods for model compression is hint distillation, where the student model is injected with information (hints) from several different layers of the teacher model. Although the selection of hint points can drastically alter the compression performance, conventional distillation approaches overlook this fact and use the same hint points as in the early studies. Therefore, we propose a clustering based hint selection methodology, where the layers of teacher model are clustered with respect to several metrics and the cluster centers are used as the hint points. Our method is applicable for any student network, once it is applied on a chosen teacher network. The proposed approach is validated in CIFAR-100 and ImageNet datasets, using various teacher–student pairs and numerous hint distillation methods. Our results show that hint points selected by our algorithm results in superior compression performance compared to state-of-the-art knowledge distillation algorithms on the same student models and datasets.
Orijinal dil | İngilizce |
---|---|
Makale numarası | 119040 |
Dergi | Expert Systems with Applications |
Hacim | 213 |
DOI'lar | |
Yayın durumu | Yayınlandı - 1 Mar 2023 |
Bibliyografik not
Publisher Copyright:© 2022 Elsevier Ltd
Finansman
This work has been supported by Arcelik ITU R&D Center , The Scientific and Technological Research Council of Turkey (TUBITAK) under the grant number 121E378 and ITU Scientific Research Projects Fund under the grant number MOA-2019-42321 .
Finansörler | Finansör numarası |
---|---|
Arcelik ITU R&D Center | |
ITU Scientific Research Projects Fund | MOA-2019-42321 |
Türkiye Bilimsel ve Teknolojik Araştırma Kurumu | 121E378 |