A Resource-efficient FIR Filter Design Based on an RAG Improved Algorithm

0upvotes

By: Mengwei Hu, Zhengxiong Li, Xianyang Jiang

In modern digital filter chip design, efficient resource utilization is a hot topic. Due to the linear phase characteristics of FIR filters, a pulsed fully parallel structure can be applied to address the problem. To further reduce hardware resource consumption, especially related to multiplication functions, an improved RAG algorithm has been proposed. Filters with different orders and for different algorithms have been compared, and the e... more

Hardware ArchitectureOctober 3, 2023 1:56pm

Comments (0)
Views (334)

Subtractor-Based CNN Inference Accelerator

0upvotes

By: Victor Gao, Issam Hammad, Kamal El-Sankary, Jason Gu

This paper presents a novel method to boost the performance of CNN inference accelerators by utilizing subtractors. The proposed CNN preprocessing accelerator relies on sorting, grouping, and rounding the weights to create combinations that allow for the replacement of one multiplication operation and addition operation by a single subtraction operation when applying convolution during inference. Given the high cost of multiplication in ter... more

Hardware ArchitectureOctober 3, 2023 5:58am

Comments (0)
Views (341)

JugglePAC: A Pipelined Accumulation Circuit

0upvotes

By: Ahmad Houraniah, H. Fatih Ugurdag, Furkan Aydin

Summing a set of numbers, namely, "Accumulation," is a subtask within many computational tasks. If the numbers to sum arrive non-stop in back-to-back clock cycles at high clock frequencies, summing them without allowing them to pile up can be quite a challenge, that is, when the latency of addition (i.e., summing two numbers) is longer than one clock cycle, which is always the case for floating-point numbers. This could also be the case for... more

Hardware ArchitectureOctober 3, 2023 5:55am