Optimal two level partitioning and loop scheduling for hiding memory latency for DSP applications

Zhong Wang, Michael Kirkpatrick, Edwin Hsing Mean Sha

Research output: Contribution to journalConference articlepeer-review

7 Scopus citations

Abstract

The large latency of memory accesses in modern computers is a key obstacle in achieving high processor utilization. To hide this latency, this paper proposes a new memory management technique that can be applied to computer architectures with three levels of memory. The technique takes advantage of access pattern information that is available at compile time by prefetching certain data elements from the higher level memory. It as well maintains certain data for a period of time to prevent unnecessary data swapping. Data locality is much improved compared with the usual pattern by partitioning the iteration space and reducing execution in each partition. These combined approaches lead to improvements in average execution times of approximately 35% over the one-level partition algorithm and more than 80% over list scheduling and hardware prefetching.

Original languageEnglish
Pages (from-to)540-545
Number of pages6
JournalProceedings - Design Automation Conference
DOIs
StatePublished - 2000
Externally publishedYes
EventDAC 2000: 37th Design Automation Conference - Los Angeles, CA, USA
Duration: 5 Jun 20009 Jun 2000

Fingerprint

Dive into the research topics of 'Optimal two level partitioning and loop scheduling for hiding memory latency for DSP applications'. Together they form a unique fingerprint.

Cite this