publications

(*) denotes equal contribution.

2024

  1. LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
    Yuzhang Shang*, Mu Cai*, Bingxin Xu, Yong Jae Lee^, and Yan Yan^
    arXiv Mar. 2024
  2. LLM Inference Unveiled: Survey and Roofline Model Insights
    Zhihang Yuan*, Yuzhang Shang*, Yang Zhou*, Zhen Dong, Chenhao Xue, Bingzhe Wu, Zhikai Li, and 6 more authors
    arXiv Feb. (Survey Paper) 2024
  3. Enhancing Post-training Quantization Calibration through Contrastive Learning
    Yuzhang Shang, Gaowen Liu, Ramana Kompella, and Yan Yan
    Computer Vision and Pattern Recognition (CVPR) 2024
  4. Efficient Multitask Dense Predictor via Binarization
    Yuzhang Shang, Dan Xu, Gaowen Liu, Ramana Kompella, and Yan Yan
    Computer Vision and Pattern Recognition (CVPR) 2024
  5. FBPT: A Fully Binary Point Transformer
    Zhixing Hou, Yuzhang Shang, and Yan Yan
    International Conference on Robotics and Automation (ICRA) 2024
  6. QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
    Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, and Yan Yan
    arXiv Feb. 2024

2023

  1. ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models
    Zhihang Yuan*, Yuzhang Shang*, Yue Song, Qiang Wu, Yan Yan, and Guangyu Sun
    arXiv Dec. 2023
  2. MIM4DD: Mutual Information Maximization for Dataset Distillation
    Yuzhang Shang, Zhihang Yuan, and Yan Yan
    Conference on Neural Information Processing Systems (NeurIPS) 2023
  3. Causal-DFQ: Causality Guided Data-free Network Quantization
    Yuzhang Shang, Bingxin Xu, Gaowen Liu, Ramana Kompella, and Yan Yan
    International Conference on Computer Vision (ICCV) 2023
  4. arXiv 2023
    RPTQ: Reorder-based Post-training Quantization for Large Language Models
    Zhihang Yuan, Lin Niu, Jiawei Liu, Wenyu Liu, Xinggang Wang, Yuzhang Shang, Guangyu Sun, and 3 more authors
    arXiv 2023
  5. Post-training Quantization on Diffusion Models
    Yuzhang Shang*, Zhihang Yuan*, Bin Xie, Bingzhe Wu, and Yan Yan
    Computer Vision and Pattern Recognition (CVPR) 2023

2022

  1. Network Binarization via Contrastive Learning
    Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, and Yan Yan
    European Conference on Computer Vision (ECCV) 2022
  2. Lipschitz Continuity Retained Binary Neural Network
    Yuzhang Shang, Dan Xu, Bin Duan, Ziliang Zong, Liqiang Nie, and Yan Yan
    European Conference on Computer Vision (ECCV) 2022
  3. ICASSP 2022
    Win The Lottery Ticket Via Fourier Analysis: Frequencies Guided Network Pruning
    Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, and Yan Yan
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

2021

  1. Lipschitz Continuity Guided Knowledge Distillation
    Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, and Yan Yan
    International Conference on Computer Vision (ICCV) 2021