DBM: Delay-sensitive Buffering Mechanism for DNN Offloading Services

Guoliang Gao, Liantao Wu, Yang Yang, Kai Li

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

DNN offloading has become an important supporting technology for edge intelligence. However, most of the existing works do not consider thread scheduling, which can achieve the parallelism of multiple threads in the practical distributed DNN inference system. To address this issue, we discuss the thread scheduling of the computing units participating in offloading in this paper, considering a single-core Central Processing Unit (CPU) and the Round Robin Scheduling (RRS). We deduce the relationship between the blocking of DNN inference-related threads and the Average Task Delay (ATD) and prove that an appropriate buffer setting can reduce blocking times. Theoretical analysis verifies that the buffering mechanism (DBM) can reduce the ATD significantly, and experimental results demonstrate that the DBM-improved DNN offloading can achieve a delay reduction of 14%-71%.

Original languageEnglish
Title of host publicationAPCC 2022 - 27th Asia-Pacific Conference on Communications
Subtitle of host publicationCreating Innovative Communication Technologies for Post-Pandemic Era
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages421-426
Number of pages6
ISBN (Electronic)9781665499279
DOIs
StatePublished - 2022
Externally publishedYes
Event27th Asia-Pacific Conference on Communications, APCC 2022 - Jeju Island, Korea, Republic of
Duration: 19 Oct 202221 Oct 2022

Publication series

NameAPCC 2022 - 27th Asia-Pacific Conference on Communications: Creating Innovative Communication Technologies for Post-Pandemic Era

Conference

Conference27th Asia-Pacific Conference on Communications, APCC 2022
Country/TerritoryKorea, Republic of
CityJeju Island
Period19/10/2221/10/22

Keywords

  • DNN offloading
  • buffering mechanism
  • streaming tasks

Fingerprint

Dive into the research topics of 'DBM: Delay-sensitive Buffering Mechanism for DNN Offloading Services'. Together they form a unique fingerprint.

Cite this