Work-in-Progress: Cooperative MLP-Mixer Networks Inference on Heterogeneous Edge Devices through Partition and Fusion

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

1 Scopus citations

Abstract

As a newly proposed DNN architecture, MLP-Mixer is attracting increasing attention due to its competitive results compared to CNNs and attention-base networks in various tasks. Although MLP-Mixer only contains MLP layers, it still suffers from high communication costs in edge computing scenarios, resulting in long inference time. To improve the inference performance of an MLP-Mixer model on correlated resource-constrained heterogeneous edge devices, this paper proposes a novel partition and fusion method specific for MLP-Mixer layers, which can significantly reduce the communication costs. Experimental results show that, when the number of devices increases from 2 to 6, our partition and fusion method can archive 1.01-1.27x and 1.54-3.12x speedup in scenarios with heterogeneous and homogeneous devices, respectively.

Original languageEnglish
Title of host publicationProceedings - 2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, CASES 2022
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages29-30
Number of pages2
ISBN (Electronic)9781665472968
DOIs
StatePublished - 2022
Event2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, CASES 2022 - Shanghai, China
Duration: 7 Oct 202214 Oct 2022

Publication series

NameProceedings - 2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, CASES 2022

Conference

Conference2022 International Conference on Compilers, Architecture, and Synthesis for Embedded Systems, CASES 2022
Country/TerritoryChina
CityShanghai
Period7/10/2214/10/22

Keywords

  • DNN Inference
  • Edge Intelligence
  • MLP-Mixer

Fingerprint

Dive into the research topics of 'Work-in-Progress: Cooperative MLP-Mixer Networks Inference on Heterogeneous Edge Devices through Partition and Fusion'. Together they form a unique fingerprint.

Cite this