Home
  • English
  • ÄŒeÅ¡tina
  • Deutsch
  • Español
  • Français
  • Gàidhlig
  • LatvieÅ¡u
  • Magyar
  • Nederlands
  • Português
  • Português do Brasil
  • Suomi
  • Log In
    New user? Click here to register. Have you forgotten your password?
Home
  • Browse Our Collections
  • Publications
  • Researchers
  • Research Data
  • Institutions
  • Statistics
    • English
    • ÄŒeÅ¡tina
    • Deutsch
    • Español
    • Français
    • Gàidhlig
    • LatvieÅ¡u
    • Magyar
    • Nederlands
    • Português
    • Português do Brasil
    • Suomi
    • Log In
      New user? Click here to register. Have you forgotten your password?
  1. Home
  2. Resources
  3. UniMAP Index Publications
  4. Publications 2024
  5. Optimization and analysis of FPGA-based systolic array for matrix multiplication
 
Options

Optimization and analysis of FPGA-based systolic array for matrix multiplication

Journal
AIP Conference Proceedings
ISSN
0094243X
Date Issued
2024-02-08
Author(s)
Shapri A.H.M.
Rahman N.A.
Zakaria S.M.M.S.
Chieh K.K.
Saat S.
DOI
10.1063/5.0192098
Handle (URI)
https://hdl.handle.net/20.500.14170/7783
Abstract
Over the years, field-programmable gate array (FPGA)-based accelerators have attracted interest and attention due to their performance and energy efficiency factors. This paper presents an optimized FPGA-based accelerator using a systolic array for matrix multiplication. In a systolic array, many identical processing elements (PEs) are arranged in a well-organized structure, and each PE is connected with the other PEs. The data will flow between neighboring elements in different directions synchronously. PE is an arithmetic logic unit (ALU) with attached working registers and local memory. This paper coded the accelerator in Verilog and simulated using the Quartus Prime with PowerPlay Power Analyzer tool for power optimization. A 3-bit ALU has been implemented using the Synopsys electronic design automation (EDA) tool. The schematic diagram, layout and verification for a complete ALU design have been accomplished. This paper provides detailed results and analyses of the systolic array and ALU in power dissipation and area density based on method and circuit implementation. The results showed that the power consumption efficiency of the accelerator improved after optimization. Also, power dissipation and the area of 3-bit ALU are reduced by 0.2 mW and 4.55 % at the back-end, respectively.
Thumbnail Image
Views
3
Acquisition Date
Jan 12, 2026
View Details
google-scholar
Downloads
  • About Us
  • Contact Us
  • Policies