Publication View

Efficient Pipelining of Nested Loops: Unroll-and-Squash (2002)

Abstract
The size and complexity of current custom VLSI have forced the use of high-level programming languages to describe hardware, and compiler and synthesis technology to map abstract designs into silicon. Many applications operating on large streaming data usually require a custom VLSI because of high performance or low power restrictions. Since the data processing is typically described by loop constructs in a high-level language, loops are the most critical portions of the hardware description and special techniques are developed to optimally synthesize them. In this thesis, we introduce a new method for mapping nested loops into hardware and pipelining them efficiently. The technique achieves fine-grain parallelism even on strong intra- and inter-iteration datadependent inner loops and, by economically sharing resources, improves performance at the expense of a small amount of additional area. We implemented the transformation within the Nimble Compiler environment and evaluated its performance on several signal-processing benchmarks. The method achieves up to 2x increase in the area efficiency compared to the best known optimization techniques.

Publication details
Download http://citeseer.ist.psu.edu/503315.html
Source http://cag.lcs.mit.edu/commit/papers/01/petkov-MEthesis.pdf
Publisher unknown
Contributors The Pennsylvania State University CiteSeer Archives
Repository CiteSeer (United States)
Keywords Darin S. Petkov,Randolph E. Harr,Saman P. Amarasinghe,Arthur C. Smith Efficient Pipelining of Nested Loops: Unroll-and-Squash
Language Englisch
Relation oai:CiteSeerPSU:521861, oai:CiteSeerPSU:249441, oai:CiteSeerPSU:258306, oai:CiteSeerPSU:13885, oai:CiteSeerPSU:243525, oai:CiteSeerPSU:262903, oai:CiteSeerPSU:137255, oai:CiteSeerPSU:281804, oai:CiteSeerPSU:722776, oai:CiteSeerPSU:369289, oai:CiteSeerPSU:324818, oai:CiteSeerPSU:221670, oai:CiteSeerPSU:33421