
Dr. Boyu Diao is currently an Associate Professor at the Institute of Computing Technology, Chinese Academy of Sciences (ICT CAS). In 2012, he graduated from Beijing Institute of Technology, earning a Bachelor of Engineering degree in Computer Science. Subsequently, in 2018, he obtained his Ph.D. in Computer Architecture from ICT CAS.
He leads a research team dedicated to Efficient Machine Learning Systems. His primary research interests encompass the co-design of algorithms and hardware for efficient deep learning, deep learning compilers, and on-device learning systems, etc.
Research Areas
Efficient Machine Learning Systems, On-Device Learning Systems, Edge Intelligence
Publications
Selected Papers
[IPDPS'25] Hangda Liu, Boyu Diao*, Yu Yang, Wenxin Chen, Xiaohui Peng, Yongjun Xu, Gensor: A Graph-based Construction Tensor Compilation Method for Deep Learning, 39th IEEE International Parallel & Distributed Processing Symposium, 2025 (CCF-B)
[ICASSP'25] Zijia An, Boyu Diao*, Libo Huang, Ruiqi Liu, Zhulin An, Yongjun Xu, IOR: Inversed Objects Replay for Incremental Object Detection, 50th IEEE International Conference on Acoustics, Speech, and Signal Processing, 2025 (CCF-B)
[CCGRID'25] Chen Yan, Boyu Diao*, Hangda Liu, Zhulin An, Yongjun Xu, A Nonlinear Hash-based Optimization Method for SpMV on GPUs, 25th IEEE international Symposium on Cluster, Cloud and Internet Computing, 2025 (CCF-C)
[CCGRID'25] Junzhou Xu, Boyu Diao*, Shaobo Zhao, Chengxiang Qi, Ruisheng Wang, Yongjun Xu, A Sensitivity-Driven Expert Allocation Method in LoRA-MoE for Efficient Fine-Tuning, 25th IEEE international Symposium on Cluster, Cloud and Internet Computing, 2025 (CCF-C)
[AAAI'25] Weilun Feng, Haotong Qin, Chuanguang Yang, Zhulin An, Libo Huang, Boyu Diao, Fei Wang, Renshuai Tao, Yongjun Xu, Michele Magno, MPQ-DM: Mixed Precision Quantization for Extremely Low Bit Diffusion Models, 39th Annual AAAI Conference on Artificial Intelligence,2025 (CCF-A)
[NeurIPS'24] Ruiqi Liu, Boyu Diao*, Libo Huang, Zijia An, Zhulin An, Yongjun Xu, Continual Learning in the Frequency Domain, 38th Annual Conference on Neural Information Processing Systems, 2024 (CCF-A)
[AAAI'24] Libo Huang, Yan Zeng, Chuanguang Yang, Zhulin An, Boyu Diao, Yongjun Xu, eTag: Class-Incremental Learning via Embedding Distillation and Task-Oriented Generation, 38th AAAI Conference on Artificial Intelligence, 2024. (CCF-A)
[CVPR'24] Chuanguang Yang, Zhulin An, Libo Huang, Junyu Bi, Han Yang, Boyu Diao, Yongjun Xu, CLIP-KD: A Comprehensive Study of Distilling CLIP Models, The Thirty-Fourth IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2024 (CCF-A)
[ACM MM'24] Weilun Feng, Chuanguang Yang, Zhulin An, Libo Huang, Boyu Diao, Fei Wang, Yongjun Xu, Relational diffusion distillation for efficient image generation. 32nd ACM International Conference on Multimedia, 2024 (CCF-A)
[JPDC] Lingfei Dai, Luqi Gong, Zhulin An, Yogjun Xu, Boyu Diao*, Sketch-fusion: A gradient compression method with multi-layer fusion for communication-efficient distributed training, JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING, 2024 (CCF-B)
[CompJ] Hangda Liu, Boyu Diao*, Wenxin Chen, Yongjun Xu, A resource-aware workload scheduling method for unbalanced GEMMs on GPUs, The Computer Journal, 2024 (CCF-B)
[NPC'24] Wenxin Chen, Boyu Diao*, Hangda Liu, Ruisheng Wang, Yongjun Xu, DTuner: a Construction-based Optimization Method for Dynamic Tensor Operators Accelerating, 20th The IFIP International Conference on Network and Parallel Computing, 2024 (CCF-C)
[ICPR'24] Yu Yang, Boyu Diao*, Hangda Liu, Qiyun Chen, Qi Wang, Yongjun Xu, An Evolutionary Search-Based Operator Fusion Method with Binary Representation for Deep Learning Inference Acceleration, 27th International Conference on Pattern Recognition, 2024 (CCF-C)
[ICPR'24] Qiyun Chen, Boyu Diao*, Yu Yang, Yongjun Xu, SCP: A Structure Combination Pruning method via Structured Sparse for Deep Convolutional Neural Networks, 27th International Conference on Pattern Recognition, 2024 (CCF-C)
[CompJ] Dong Dong, Hongxu Jiang*, Boyu Diao*, AKGF: Automatic Kernel Generation for DNN on CPU-FPGA, The Computer Journal, 2023 (CCF-B)
[Innovation] Qi Wang, Tingting Li, Fei Wang, Boyu Diao, Lei Zheng, Jincai Huang, How to prevent malicious use of intelligent unmanned swarms?, THE INNOVATION, 2023 (IF:33.2)
[ICCV'23] Kelu Yao, Jin Wang, Boyu Diao, Chao Li, Towards Understanding the Generalization of Deepfake Detectors from a Game-Theoretical View, IEEE/CVF International Conference on Computer Vision, 2023 (CCF-A)
[IJCNN'22] Jianrong Xu, Boyu Diao*, Bifeng Cui, Kang Yang, Chao Li, Yongjun Xu, PFGDF: Pruning Filter via Gaussian Distribution Feature for Deep Neural Networks Acceleration, International Joint Conference on Neural Networks, 2022(CCF-C)
[AAAI'22] Chao Li, Kelu Yao, Jin Wang, Boyu Diao, Yongjun Xu, Quanshi Zhang, Interpretable Generative Adversarial Networks, The 36th AAAI Conference on Artificial Intelligence, 2022 (CCF-A)