Skip to main navigation Skip to search Skip to main content

Resource-aware Montgomery modular multiplication optimization for digital signal processing

  • Qiqi Tao
  • , Liying Li
  • , Junlong Zhou
  • , Guitao Cao*
  • , Dan Meng
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

Homomorphic encryption is an important technology for protecting data privacy, and the performance of modular multiplication directly affects the efficiency of homomorphic encryption. Currently, there are numerous FPGA-based acceleration techniques targeting modular multiplication. However, many of these implementations require substantial hardware resources or suffer from resource imbalance. This leads to a lower throughput. Therefore, we present a novel FPGA-based implementation of Montgomery Modular Multiplication aimed at addressing these challenges. Our design employs a suitable radix bit width and word size based on the digital signal processing (DSP) bit width rather than the conventional binary powers of two. We aim to instantiate more modular multipliers using limited resources while minimizing latency. We also introduce a novel DSP cascade structure, called parallel grouping cascade DSP, which reduces the number of clock cycles of internal multipliers. To balance the ratio of lookup table (LUT) and DSP usage, we also use multipliers implemented in the LUT to replace some DSPs. Our results, implemented on Xilinx Virtex-7 field-programmable gate array (FPGA), demonstrate more than 27% improvement in throughput on 1024-bit modular multiplication and more than 70% improvement on 2048-bit compared to the best previous state-of-the-art references.

Original languageEnglish
Article number103142
JournalJournal of Systems Architecture
Volume151
DOIs
StatePublished - Jun 2024

Keywords

  • Cryptosystem
  • Digital signal processing
  • Field Programmable Gate Array
  • Large integer arithmetic
  • Montgomery modular multiplication

Fingerprint

Dive into the research topics of 'Resource-aware Montgomery modular multiplication optimization for digital signal processing'. Together they form a unique fingerprint.

Cite this