Derin grenme alisma Zamani Eniyileme Y ntemlerinin Tasinabilirliginin Irdelenmesi

Translated title of the contribution: Examining the Portability of Deep Learning Runtime Optimization Methods

Muhammet Alpaslan Tavukcu*, Ayse Yilmazer-Metin

*Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

Abstract

Computational graph level optimizations are highly utilized to reduce the runtime of deep learning models. The effectiveness of these optimization methods depends on their cost (runtime) model and the target runtime environment. In this study, we analyzed the behavior of existing optimization tools cost models across different runtime environments, examining their performance portability. Our analysis demonstrates that optimizations performed using a cost model developed for a specific runtime environment cannot be directly transferred to a different runtime environment. Optimizations that yield performance improvements in one environment may result in performance degradation in another. Based on these findings, we determined that to make optimization methods viable on widely used tools such as ONNX Runtime, the cost model must be aligned with the performance characteristics of the relevant tool. To address this inconsistency, we trained Graph Neural Networks that predict the runtime of computational graphs. We established that when optimization is performed by replacing existing cost models with trained cost predictors, more consistent optimizations can be achieved.

Translated title of the contributionExamining the Portability of Deep Learning Runtime Optimization Methods
Original languageTurkish
Title of host publication33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 - Proceedings
PublisherInstitute of Electrical and Electronics Engineers Inc.
ISBN (Electronic)9798331566555
DOIs
Publication statusPublished - 2025
Event33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 - Istanbul, Turkey
Duration: 25 Jun 202528 Jun 2025

Publication series

Name33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025 - Proceedings

Conference

Conference33rd IEEE Conference on Signal Processing and Communications Applications, SIU 2025
Country/TerritoryTurkey
CityIstanbul
Period25/06/2528/06/25

Bibliographical note

Publisher Copyright:
© 2025 IEEE.

Fingerprint

Dive into the research topics of 'Examining the Portability of Deep Learning Runtime Optimization Methods'. Together they form a unique fingerprint.

Cite this