Publications

2024

  1. palu_concept.png
    Palu: Compressing KV-Cache with Low-Rank Projection
    Chi-Chih Chang, Wei-Cheng Lin, Chien-Yu Lin, Chong-Yan Chen, Yu-Fang Hu, and 4 more authors
    arXiv preprint arXiv:2407.21118, 2024
  2. quamba.jpg
    Quamba: A Post-Training Quantization Recipe for Selective State Space Models
    Hung-Yueh Chiang, Chi-Chih Chang, Natalia Frumkin, Kai-Chiang Wu, and Diana Marculescu
    arXiv preprint arXiv:2410.13229, 2024
  3. elsa.png
    ELSA: Exploiting Layer-wise N:M Sparsity for Vision Transformer Acceleration
    Ning-Chi Huang, Chi-Chih Chang, Wei-Cheng Lin, Endri Taka, Diana Marculescu, and 1 more author
    In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops, Jun 2024
  4. flora.png
    FLORA: Fine-Grained Low-Rank Architecture Search for Vision Transformer
    Chi-Chih Chang, Yuan-Yao Sung, Shixing Yu, Ning-Chi Huang, Diana Marculescu, and 1 more author
    In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV), Jan 2024
  5. gdbn.png
    Transformer and Its Variants for Identifying Good Dice in Bad Neighborhoods
    Cheng-Che Lu, Chi-Chih Chang, Chia-Heng Yen, Shuo-Wen Chang, Ying-Hua Chu, and 2 more authors
    In 2024 IEEE 42nd VLSI Test Symposium (VTS), Jan 2024

2023

  1. q_yolop.png
    Q-YOLOP: Quantization-Aware You Only Look Once for Panoptic Driving Perception
    Chi-Chih Chang, Wei-Cheng Lin, Pei-Shuo Wang, Sheng-Feng Yu, Yu-Chen Lu, and 2 more authors
    In 2023 IEEE International Conference on Multimedia and Expo Workshops (ICMEW) , Jul 2023