Static and Dynamic Instruction Mappingfor Spatial Architectures

Liu, Feng

Please use this identifier to cite or link to this item: http://arks.princeton.edu/ark:/88435/dsp01kk91fp258

Full metadata record

DC Field	Value	Language
dc.contributor.advisor	August, David	-
dc.contributor.author	Liu, Feng	-
dc.contributor.other	Electrical Engineering Department	-
dc.date.accessioned	2018-06-12T17:43:02Z	-
dc.date.available	2018-06-12T17:43:02Z	-
dc.date.issued	2018	-
dc.identifier.uri	http://arks.princeton.edu/ark:/88435/dsp01kk91fp258	-
dc.description.abstract	In response to the technology scaling trends, spatial architectures have emerged as a new style of processors for executing programs more efficiently. Unlike traditional Out-of-Order (OoO) processors, which time-share a small set of functional units, a spatial computer is composed of hundreds or even thousands of simple and replicated functional units. Spatial architectures avoid the overheads of time-sharing and of generating schedules repeatedly, by mapping instruction sequences onto the functional units explicitly and reusing the schedules across multiple invocations. Currently, spatial architectures mainly use static methods to map instructions onto the arrays of functional units. The existing methods have several limitations: First, for programs with irregular memory accesses and control flows, they yield poor performance because the functional units need to be invoked sequentially to respect data and control dependences. Second, static methods cannot fully exploit speculation techniques, which are the dominant performance sources in OoO processors. Finally, static methods cannot adapt to changing workloads and are not compatible across hardware generations. To address these issues and improve the applicability of spatial architectures, this dissertation proposes two techniques. The first, Coarse-Grained Pipelined Accelerators (CGPA), is a static compiling framework that exploits the hidden parallelism within irregular C/C++ loops and translates them into spatial hardware modules. The proposed technique has been implemented as a compiler pass and the experiment shows 3.3x speedup over the performance achieved by an open-source tool baseline. The second technique, Dynamic Spatial Architecture Mapping (DYNASPAM), reuses the speculation system in the OoO processors to dynamically produce high-performance scheduling and execution on a dedicated spatial fabric. The proposed technique is modeled by a cycle accurate simulator and the experiment shows the new technique can achieve 1.4x geomean performance improvement and 23.9% energy consumption reduction, compared to an OoO processor baseline.	-
dc.language.iso	en	-
dc.publisher	Princeton, NJ : Princeton University	-
dc.relation.isformatof	The Mudd Manuscript Library retains one bound copy of each dissertation. Search for these copies in the library's main catalog: <a href=http://catalog.princeton.edu> catalog.princeton.edu </a>	-
dc.subject	compiling	-
dc.subject	computer architecture	-
dc.subject	high level synthesis	-
dc.subject	reconfigurable	-
dc.subject.classification	Computer engineering	-
dc.subject.classification	Computer science	-
dc.title	Static and Dynamic Instruction Mappingfor Spatial Architectures	-
dc.type	Academic dissertations (Ph.D.)	-
pu.projectgrantnumber	690-2143	-
Appears in Collections:	Electrical Engineering

Files in This Item:

File	Description	Size	Format
Liu_princeton_0181D_12487.pdf		2.21 MB	Adobe PDF	View/Download

Show simple item record

Search

Browse