Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays
  • Rhe, Johnny
  • Jeon, Kang Eun
  • Lee, Joo Chan
  • Jeong, Seongmoon
  • Ko, Jong Hwan
Citations

WEB OF SCIENCE

2
Citations

SCOPUS

3

초록

Processing-in-memory (PIM) architectures have been highlighted as one of the viable solutions for faster and more power-efficient convolutional neural networks (CNNs) inference. Recently, shift and duplicate kernel (SDK) convolutional weight mapping scheme was proposed, achieving up to 50% throughput improvement over the prior arts. However, the traditional pattern-based pruning methods, which were adopted for row-skipping and computing cycle reduction, are not optimal for the latest SDK mapping due to structural irregularity caused by the shifted and duplicated kernels. To address this issue, we propose a method called kernel shape control (KERNTROL) that aims to promote structural regularity for achieving a high row-skipping ratio and model accuracy. Instead of pruning certain weight elements permanently, KERNTROL controls the kernel shapes through the omission of certain weights based on their mapped columns. In comparison to the latest pattern-based pruning approaches, KERNTROL achieves up to 36.4% improvement in the compression rate, and 38.6% in array utilization with maintaining the original model accuracy.

키워드

processing-in-memoryshift and duplicate (SDK) weight mappingweight pruningneural compressionARCHITECTUREPRECISION
제목
Kernel Shape Control for Row-Efficient Convolution on Processing-In-Memory Arrays
저자
Rhe, JohnnyJeon, Kang EunLee, Joo ChanJeong, SeongmoonKo, Jong Hwan
DOI
10.1109/ICCAD57390.2023.10323749
발행일
2024-11
유형
Proceedings Paper
저널명
IEEE/ACM International Conference on Computer-Aided Design, Digest of Technical Papers, ICCAD