Publications

(*) denotes equal contribution.

2024

  1. NeurIPS 2024
    PTQ4DiT: Post-training Quantization for Diffusion Transformers
    Junyi Wu, Haoxuan Wang, Yuzhang Shang, Mubarak Shah, and Yan Yan
    Conference on Neural Information Processing Systems (NeurIPS) 2024
  2. NeurIPS 2024
    HEPrune: Fast Private Training of Deep Neural Networks With Encrypted Data Pruning
    Yancheng Zhang, Mengxin Zheng, Yuzhang Shang, Xun Chen, and Qian Lou
    Conference on Neural Information Processing Systems (NeurIPS) 2024
  3. ECCV 2024
    Dataset Quantization with Active Learning-based Adaptive Sampling
    Zhenghao Zhao, Yuzhang Shang, Junyi Wu, and Yan Yan
    European Conference on Computer Vision (ECCV) 2024
  4. LLaVA-PruMerge: Adaptive Token Reduction for Efficient Large Multimodal Models
    Yuzhang Shang*, Mu Cai*, Bingxin Xu, Yong Jae Lee^, and Yan Yan^
    arXiv Mar. 2024
  5. LLM Inference Unveiled: Survey and Roofline Model Insights
    Zhihang Yuan*, Yuzhang Shang*, Yang Zhou*, Zhen Dong, Chenhao Xue, Bingzhe Wu, Zhikai Li, and 6 more authors
    arXiv Feb. (Survey Paper) 2024
  6. ICLR 2024
    PB-LLM: Partially Binarized Large Language Models
    Yuzhang Shang, Zhihang Yuan, and Zhen Dong
    International Conference on Learning Representations (ICLR) 2024
  7. CVPR 2024
    Enhancing Post-training Quantization Calibration through Contrastive Learning
    Yuzhang Shang, Gaowen Liu, Ramana Kompella, and Yan Yan
    Computer Vision and Pattern Recognition (CVPR) 2024
  8. CVPR 2024
    Efficient Multitask Dense Predictor via Binarization
    Yuzhang Shang, Dan Xu, Gaowen Liu, Ramana Kompella, and Yan Yan
    Computer Vision and Pattern Recognition (CVPR) 2024
  9. ICRA 2023
    FBPT: A Fully Binary Point Transformer
    Zhixing Hou, Yuzhang Shang, and Yan Yan
    International Conference on Robotics and Automation (ICRA) 2024
  10. QuEST: Low-bit Diffusion Model Quantization via Efficient Selective Finetuning
    Haoxuan Wang, Yuzhang Shang, Zhihang Yuan, Junyi Wu, and Yan Yan
    arXiv Feb. 2024

2023

  1. ASVD: Activation-aware Singular Value Decomposition for Compressing Large Language Models
    Zhihang Yuan*, Yuzhang Shang*, Yue Song, Qiang Wu, Yan Yan, and Guangyu Sun
    arXiv Dec. 2023
  2. NeurIPS 2023
    MIM4DD: Mutual Information Maximization for Dataset Distillation
    Yuzhang Shang, Zhihang Yuan, and Yan Yan
    Conference on Neural Information Processing Systems (NeurIPS) 2023
  3. ICCV 2023
    Causal-DFQ: Causality Guided Data-free Network Quantization
    Yuzhang Shang, Bingxin Xu, Gaowen Liu, Ramana Kompella, and Yan Yan
    International Conference on Computer Vision (ICCV) 2023
  4. arXiv 2023
    RPTQ: Reorder-based Post-training Quantization for Large Language Models
    Zhihang Yuan, Lin Niu, Jiawei Liu, Wenyu Liu, Xinggang Wang, Yuzhang Shang, Guangyu Sun, and 3 more authors
    arXiv 2023
  5. CVPR 2023
    Post-training Quantization on Diffusion Models
    Yuzhang Shang*, Zhihang Yuan*, Bin Xie, Bingzhe Wu, and Yan Yan
    Computer Vision and Pattern Recognition (CVPR) 2023

2022

  1. ECCV 2022
    Network Binarization via Contrastive Learning
    Yuzhang Shang, Dan Xu, Ziliang Zong, Liqiang Nie, and Yan Yan
    European Conference on Computer Vision (ECCV) 2022
  2. ECCV 2022
    Lipschitz Continuity Retained Binary Neural Network
    Yuzhang Shang, Dan Xu, Bin Duan, Ziliang Zong, Liqiang Nie, and Yan Yan
    European Conference on Computer Vision (ECCV) 2022
  3. ICASSP 2022
    Win The Lottery Ticket Via Fourier Analysis: Frequencies Guided Network Pruning
    Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, and Yan Yan
    IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP) 2022

2021

  1. ICCV 2021
    Lipschitz Continuity Guided Knowledge Distillation
    Yuzhang Shang, Bin Duan, Ziliang Zong, Liqiang Nie, and Yan Yan
    International Conference on Computer Vision (ICCV) 2021