Optimizing overall loop schedules using prefetching and partitioning

Fei Chen, Timothy W. O'Neil, Edwin H.M. Sha

Research output: Contribution to journalArticlepeer-review

22 Scopus citations

Abstract

In this paper, a method combining the loop pipelining technique with data prefetching, called Partition Scheduling with Prefetching (PSP), is proposed. In PSP, the iteration space is first divided into regular partitions. Then a two-part schedule, consisting of the ALU and memory parts, is produced and balanced to produce high throughput. These two parts are executed simultaneously, and hence, the remote memory latencies are overlapped. We study the optimal partition shape and size so that a well-balanced overall schedule can be obtained. Experiments on DSP benchmarks show that the proposed methodology consistently produces optimal or near optimal solutions.

Original languageEnglish
Pages (from-to)604-614
Number of pages11
JournalIEEE Transactions on Parallel and Distributed Systems
Volume11
Issue number6
DOIs
StatePublished - Jun 2000
Externally publishedYes

Fingerprint

Dive into the research topics of 'Optimizing overall loop schedules using prefetching and partitioning'. Together they form a unique fingerprint.

Cite this