Many-Accelerator Computing

This NSF CAREER project, “Combating Dark Silicon with Tiled Many-Accelerator Architectures”, aims to address the challenges of increasing transistor power-density in modern microprocessors that limits performance and threatens the future of computing. We are investigating the use of specialized accelerator cores, each designed to compute one particular application 10-100 times more efficiently than general purpose core, to combat the increase of Dark Silicon. The project has two main aspects: (1) accelerator selection algorithms, designed to improve the coverage of the application by accelerators. These algorithms and methodologies will be tested through the design of canonical tiles of accelerators each tile and the accelerators on that tile are designed to accelerate a particular application domain; (2) the design of memory systems and interconnect that can provide the high-amount of bandwidth and storage to accelerators that are active only occasionally.

This effort is to make many-accelerator architectures more general and available for a wide range of devices and applications. We are currently studying Internet-of-Things (IoT) workloads, imaging workloads, and high-performance computing applications.

Because the performance of modern systems is limited by cost of communicating between accelerator cores, project uses the PI’s prior work on communication classification and Sigil workload characterization tool and the Accelerator Store.

This project broadly encompasses the research efforts on Many-Accelerator Computing over the last few years. Two new distinct projects have been created as part of these investigations:


“CAREER: Combating Dark Silicon through Specialization: Communication-Aware Tiled Many-Accelerator Architectures” 2/1/2014 – 1/31/2021. $470,000. National Science Foundation (NSF)


  • Parnian Mokri, Mark Hempstead, Improving HLS with Shared Accelerators: A Retrospective, Latte 2021, April 16, [Video][Camera_Ready_Papaer][Latte21_FinalSlides)]
  • Parnian Mokri, Mark Hempstead, Early-stage Automated Accelerator Identification Tool forEmbedded Systems with Limited Area, ICCAD 2020, Nov 2 [video] [slides]
  • Parnian Mokri and Mark Hempstead. “Early-stage Automated Identification Tool for Shared Accelerators” Poster Presentation. Field-Programmable Custom Computing Machines (FCCM2020), April 2020. [PDF] [Video]
  • Parnian Mokri, Maziar Amiraski, Yuelin Liu, and Mark Hempstead. Building Reconfigurable Shared Accelerators through Early-stage Automated Identification of Similar Hardware Implementations with Abstract Syntax Trees. Poster Presentation. In Proceedings of the 28th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, (FPGA), February 2020. [PDF]
  • David Werner, Kyle Juretus, Ioannis Savidis and Mark Hempstead. Machine Learning on the Thermal Side-Channel: Analysis of Accelerator-rich Architectures. The 36th IEEE International Conference on Computer Design (ICCD), Oct 2018.
  • P. Mokri and M. Hempstead, “Fingerprinting Coarse-Grained Reconfigurable
    Accelerators Using Data Movement and Structural Similarities in Applications” Boston Area Architecture Workshop (BARC), Jan. 2018.
  • P. Mokri and M. Hempstead, “ReconfASTs: Early-stage Identification of Reconfigurable Accelerators with Annotated Abstract Syntax Tree”, Boston Area Architecture (BARC) Workshop, January 2017.
  • D. Werner, M. Hempstead, K. Juretus, and I. Daulagala, “The Vulnerability of Specialized Architectures to Temperature Side-Channel Information Leakage,” Boston Area Architecture Workshop (BARC), Jan. 2016.
  • P. Mokri and M. Hempstead, “Stockpile Of Accelerators: A Methodology To Increase Accelerators’ Coverage”, Boston Area Architecture (BARC) Workshop, January 2016.
  • Steven Battle and Mark Hempstead, Characterizing the Costs and Benefits of Hardware Parallellism in Accelerator Cores, Proceedings of the International Conference on Computer Design (ICCD), Asheville, NC Oct 2013.
  • Siddharth Nilakantan, Steven Battle, Mark Hempstead Metrics for Early-Stage Modeling of Many-Accelerator Architectures, Computer Architecture Letters (CAL) . July-Dec 2012
  • M. Lyons, M. Hempstead, D. Brooks, G.-Y. Wei The Accelerator Store Framework for High-Performance, Low-Power Accelerator-based Systems ACM Transactions on Architecture and Code Optimization (TACO). Joint Presentation at HiPEAC, Paris France. January, 2012