Polynomial-time nested loop fusion with full parallelism

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

5 Scopus citations

Abstract

Data locality and synchronization overhead are two important factors that affect the performance of applications on multiprocessors. Loop fusion is an effective way of reducing synchronization and improving data locality. Traditional fusion techniques, however either cannot address the case when fusion-preventing dependence exists in nested loops, or cannot achieve good parallelism after fusion. This paper gives a significant improvement by presenting several efficient polynomial-time algorithms to solve these problems. These algorithms combined with the retiming technique allow nested loop fusion in the existence of outmost loop-carried dependence as in the presence of fusion-preventing dependence. Furthermore, the technique is proved to achieve fully parallel execution of the fused loops.

Original languageEnglish
Title of host publicationSoftware
EditorsK. Pingali
PublisherInstitute of Electrical and Electronics Engineers Inc.
Pages9-16
Number of pages8
ISBN (Electronic)081867623X
DOIs
StatePublished - 1996
Externally publishedYes
Event25th International Conference on Parallel Processing, ICPP 1996 - Ithaca, United States
Duration: 12 Aug 199616 Aug 1996

Publication series

NameProceedings of the International Conference on Parallel Processing
Volume3
ISSN (Print)0190-3918

Conference

Conference25th International Conference on Parallel Processing, ICPP 1996
Country/TerritoryUnited States
CityIthaca
Period12/08/9616/08/96

Fingerprint

Dive into the research topics of 'Polynomial-time nested loop fusion with full parallelism'. Together they form a unique fingerprint.

Cite this