Achieving full parallelism using multidimensional retiming

Research output: Contribution to journalArticlepeer-review

47 Scopus citations

Abstract

Most scientific and Digital Signal Processing (DSP) applications are recursive or iterative. Transformation techniques are usually applied to get optimal execution rates in parallel and/or pipeline systems. The retiming technique is a common and valuable transformation tool in one-dimensional problems, when loops are represented by data flow graphs (DFGs). In this paper, uniform nested loops are modeled as multidimensional data flow graphs (MDFGs). Full parallelism of the loop body, i.e., all nodes in the MDFG executed in parallel, substantially decreases the overall computation time. It is well known that, for one-dimensional DFGs, retiming can not always achieve full parallelism. Other existing optimization techniques for nested loops also can not always achieve full parallelism. This paper shows an important and counter-intuitive result, which proves that we can always obtain full-parallelism for MDFGs with more than one dimension. This result is obtained by transforming the MDFG into a new structure. The restructuring process is based on a multidimensional retiming technique. The theory and two algorithms to obtain full parallelism are presented in this paper. Examples of optimization of nested loops and digital signal processing designs are shown to demonstrate the effectiveness of the algorithms.

Original languageEnglish
Pages (from-to)1150-1163
Number of pages14
JournalIEEE Transactions on Parallel and Distributed Systems
Volume7
Issue number11
DOIs
StatePublished - 1996
Externally publishedYes

Keywords

  • Instruction level parallelism
  • Loop transformation
  • Multidimensional data-flow graphs
  • Nested loops
  • Retiming
  • Superscalar
  • VLIW

Fingerprint

Dive into the research topics of 'Achieving full parallelism using multidimensional retiming'. Together they form a unique fingerprint.

Cite this