基本信息

黄怀波  副研究员 中科院自动化研究所

北京市科技新星创新新星

北京市科协青年人才托举工程

中国科学院青年创新促进会会员

电子邮件: huaibo.huang@cripac.ia.ac.cn
通信地址: 北京市海淀区中关村东路95号
邮政编码: 100190

研究领域

计算机视觉、多模态理解与生成、视觉合成与安全、图像恢复和增强

招生信息

每年招收硕士研究生一名,欢迎具有自驱力、致力于发表高水平文章、解决实际科研问题的同学联系。

招生专业
081104-模式识别与智能系统
081203-计算机应用技术
招生方向
模式识别,计算机视觉

工作经历

2019-06   中国科学院大学   博士
2016-01   北京航空航天大学   硕士
2012-07   西安交通大学   学士

工作简历
2021-04~现在, 中国科学院自动化研究所, 副研究员
2019-07~2021-04,中国科学院自动化研究所, 助理研究员
学术兼职

1. 北京图象图形学学会理事

2. 中国图象图形学学会视觉大数据专委会委员

3. IEEE TIFS、IEEE  TBIOM期刊编委

4. ICLR、ACM MM、PRCV等学术会议领域主席

5. TPAMI、IJCV、TIP、NeurIPS、ICML、ICLR、CVPR、ICCV等期刊和会议审稿人

奖励信息

(1) 北京市科技新星, 省级, 2023

(2) 吴文俊人工智能科学技术奖技术发明奖, 一等奖, 其他, 2023

(3) 中国科学院青年创新促进会, , 院级, 2022

(4) 北京市科协青年人才托举工程, , 其他, 2020

(5) 北京市优秀毕业生, , 省级, 2019

(6) 中国科学院院长优秀奖, 院级, 2019

(7) ICME研讨会最佳学生论文奖, 其他, 2019

出版信息

近几年在人工智能领域国际权威期刊和会议发表/录用论文共计90余篇,其中CCF-A类论文54篇,包含TPAMI 3篇、IJCV 8篇、NeurIPS 9篇、CVPR 14篇、ICCV 5篇;出版Springer专著1部。

全部论文列表参考:个人主页 谷歌学术 

期刊论文
  1. Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He. Advancing Vision Transformer with Enhanced Spatial Priors. Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2026. (IF: 23.6,CCF-A,人工智能领域顶级期刊)
  2. Yuang Ai, Jie Cao, Ran He, Huaibo Huang. Uncertainty-Aware Source-Free Adaptive Image Restoration with State Space Augmentation. International Journal of Computer Vision (IJCV), 2026. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  3. Jiayang Sun, Hongbo Wang, Jie Cao, Huaibo Huang. Marmot: Object-Level Self-Correction via Multi-Agent Reasoning. Machine Intelligence Research (MIR). 2026. (IF: 8.7,机器智能领域权威期刊)
  4. Junxian Duan, Hao Sun, Fan Ji, Kai Zhou, Zhiyong Wang, Huaibo Huang, Lianwen Jin. RealDTT: Towards A Comprehensive Real-World Dataset for Tampered Text Detection. International Journal of Computer Vision (IJCV),  2025. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  5. Nan Gao, Jia Li, Huaibo Huang, Zhi Zeng, Ran He. InfoBFR: Real-World Blind Face Restoration via Information Bottleneck. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2025. (IF: 8.4, CCF-B, 视频处理领域权威期刊)
  6. Junxian Duan, Siyu Liu, Yiming Hao, Huaibo Huang, Ran He. Dual Frequency-Guided Spatiotemporal Feature Learning for Face Forgery Detection. IEEE Transactions on Biometrics, Behavior, and Identity Science (TBIOM), 2025. (IF: 5.0,生物特征识别领域权威期刊)
  7. Junxian Duan, Yuang Ai, Jipeng Liu, Shenyuan Huang, Huaibo Huang, Jie Cao, Ran He. Test-time Forgery Detection with Spatial-Frequency Prompt Learning. International Journal of Computer Vision (IJCV), 2024. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  8. Xiaoqiang Zhou, Chaoyou Fu, Huaibo Huang, Ran He. Dynamic Graph Memory Bank for Video Inpainting. IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 2024. (IF: 8.4, CCF-B, 视频处理领域权威期刊)
  9. Xiaoqiang Zhou, Huaibo Huang, Zilei Wang, Ran He. RISTRA: Recursive Image Super-resolution Transformer with Relativistic Assessment. IEEE Transactions on Multimedia (TMM), 2024. (IF: 7.3, CCF-A, 多媒体领域权威期刊)
  10. Huaibo Huang, Mandi Luo, Ran He. Memory Uncertainty Learning for Real-World Single Image Deraining. Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2023. (IF: 23.6,CCF-A,人工智能领域顶级期刊)
  11. Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, Ran He. DVG-Face: Dual Variational Generation for Heterogeneous Face Recognition. Trans. Pattern Analysis and Machine Intelligence (TPAMI), 2022. (IF: 23.6,CCF-A,人工智能领域顶级期刊)
  12. Jianze Wei, Huaibo Huang, Yunlong Wang, Ran He, Zhenan Sun. Towards More Discriminative and Robust Iris Recognition by Learning Uncertain Factors. IEEE Transactions on Information Forensics & Security (TIFS), 2022. (IF: 6.8,CCF-A,信息安全领域顶级期刊)
  13. Jianze Wei, Yunlong Wang, Huaibo Huang, Ran He, Zhenan Sun, Xingyu Gao. Contextual Measures for Iris Recognition. IEEE Transactions on Information Forensics & Security (TIFS), 2022. (IF: 6.8,CCF-A,信息安全领域顶级期刊)
  14. Mandi Luo, Haoxue Wu, Huaibo Huang, Weizan He, Ran He. Memory-Modulated Transformer Network for Heterogeneous Face Recognition. IEEE Transactions on Information Forensics & Security (TIFS), 2022. (IF: 6.8,CCF-A,信息安全领域顶级期刊)
  15. Aijing Yu, Haoxue Wu, Huaibo Huang, Zhen Lei, Ran He. LAMP-HQ: A Large-Scale Multi-Pose High-Quality Database for NIR-VIS Face Recognition. International Journal of Computer Vision (IJCV), 2021. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  16. Huaibo Huang, Aijing Yu, Zhenhua Chai, Ran He, Tieniu Tan. Selective Wavelet Attention Learning for Single Image Deraining. International Journal of Computer Vision (IJCV), 2021. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
  17. Xin Ma, Xiaoqiang Zhou, Huaibo Huang, Gengyun Jia, Zhenhua Chai, Xiaolin Wei.  Contrastive Attention Network with the Dense Field Estimation for Face Completion. Pattern Recognition (PR), 2021.  (IF: 8,CCF-B,模式识别领域权威期刊)
  18. Yi Li#, Huaibo Huang#, Jie Cao, Ran He, Tieniu Tan. Disentangled Representation Learning of Makeup Portraits in the Wild. International Journal of Computer Vision (IJCV), 2020, 128: 2166–2184. (Co-first author)(IF: 19.5,CCF-A,人工智能领域顶级期刊)
  19. Xin Zheng, Yanqing Guo, Huaibo Huang, Yi Li, Ran He. A Survey to Deep Facial Attribute Analysis. International Journal of Computer Vision (IJCV), 2020. (IF: 19.5,CCF-A,计算机视觉领域顶级期刊)
  20. Xin Zheng, Huaibo Huang, Yanqing Guo, Ran He.  BLAN: Bi-directional Ladder Attentive Network for Facial Attribute Prediction. Pattern Recognition (PR), 2020. (IF: 8,CCF-B,模式识别领域权威期刊)
  21. Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan. Wavelet Domain Generative Adversarial Network for Multi-scale Face Hallucination. International Journal of Computer Vision (IJCV), 127(6-7): 763-784, 2019. (IF: 19.5,CCF-A,人工智能领域顶级期刊)
会议论文
  1. Hongbo Wang, Huaibo Huang, Pin Wang, Jinhua Hao, Chao Zhou, Ran He. Coloring the Noise: Adversarial Sobolev Alignment for Faithful Image Super Resolution. International Conference on Machine Learning (ICML), 2026. (CCF-A,人工智能领域顶级会议)
  2. Qihang Fan, Yuang Ai, Huaibo Huang, Ran He. Random Wins All: Rethinking Grouping Strategies for Vision Tokens. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  3. Mingrui Chen, Hexiong Yang, Haogeng Liu, Huaibo Huang, Ran He. Think 360°: Evaluating the Width-centric Reasoning Capability of MLLMs Beyond Depth. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  4. Jiayang Sun, Pin Wang, Hongbo Wang, Xinyue Liu, Huaibo Huang, Ran He. Towards Fine-Grained Attribution: Instance-Aware Preference Optimization for Aligning Diffusion Models. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  5. Xinyue Liu, Jin Liu, Hongbo Wang, Ran He, Huaibo Huang. Think-Then-Generate: Structural Chain-of-Thought Reasoning for Consistent 3D Generation. Computer Vision and Pattern Recognition (CVPR), 2026. (Highlight, CCF-A,计算机视觉领域顶级会议)
  6. Shiran Ge, Chenyi Huang, Yuang Ai, Qihang Fan, Huaibo Huang, Ran He. Expand and Prune: Maximizing Trajectory Diversity for Effective GRPO in Generative Models. Computer Vision and Pattern Recognition (CVPR), 2026. (CCF-A,计算机视觉领域顶级会议)
  7. Xing Cui, Yueying Zou, Zekun Li, Peipei Li, Xinyuan Xu, Xuannan Liu, Huaibo Huang, Ran He. T2Agent: A Tool-augmented Multimodal Misinformation Detection Agent with Monte Carlo Tree Search. AAAI Conference on Artificial Intelligence (AAAI), 2026. (Oral, CCF-A,人工智能领域顶级会议)
  8. Yuang Ai, Qihang Fan, Xuefeng Hu, Zhenheng Yang, Ran He, Huaibo Huang. DiCo: Revitalizing ConvNets for Scalable and Efficient Diffusion Modeling. Neural Information Processing Systems (NeurIPS), 2025.(Spotlight, CCF-A,人工智能领域顶级会议)
  9. ​Xuannan Liu, Zekun Li, Zheqi He, Pei Pei Li, Shuhan Xia, Xing Cui, Huaibo Huang, Xi Yang, Ran He. Video-SafetyBench: A Benchmark for Safety Evaluation of Video LVLMs. Neural Information Processing Systems (NeurIPS), 2025. (CCF-A,人工智能领域顶级会议)
  10. Yuguang Zhang, Qihang Fan, Huaibo Huang*. Vision Transformer with Sparse Scan Prior. ACM International Conference on Multimedia (ACM MM), 2025. (CCF-A,多媒体计算领域顶级会议)
  11. Qihang Fan, Huaibo Huang*, Yuang Ai, Ran He. Rectifying Magnitude Neglect in Linear Attention.  International Conference on Computer Vision (ICCV), 2025. (Highlight, CCF-A,计算机视觉领域顶级会议)
  12. Qihang Fan, Huaibo Huang*, Mingrui Chen, Ran He. Semantic Equitable Clustering: A Simple and Effective Strategy for Clustering Vision Tokens. International Conference on Computer Vision (ICCV), 2025. (CCF-A,计算机视觉领域顶级会议)
  13. Qihang Fan, Huaibo Huang*, Ran He. Breaking the Low-Rank Dilemma of Linear Attention. Computer Vision and Pattern Recognition (CVPR), 2025.  (CCF-A,计算机视觉领域顶级会议)
  14. Xuannan Liu, Zekun Li, Pei Pei Li, Huaibo Huang, Shuhan Xia, etc. MMFakeBench: A Mixed-Source Multimodal Misinformation Detection Benchmark for LVLMs. International Conference on Learning Representations (ICLR), 2025. (CCF-A,人工智能领域顶级会议)
  15. Yuang Ai, Xiaoqiang Zhou, Huaibo Huang*, Xiaotian Han, Zhengyu Chen, Quanzeng You, Hongxia Yang. DreamClear: High-Capacity Real-World Image Restoration with Privacy-Safe Dataset Curation. Neural Information Processing Systems (NeurIPS), 2024. (CCF-A,机器学习领域顶级会议)
  16. Haogeng Liu, Quanzeng You, Xiaotian Han, Yongfei Liu, Huaibo Huang*, Ran He, Hongxia Yang. Visual Anchors Are Strong Information Aggregators For Multimodal Large Language Model.  Neural Information Processing Systems (NeurIPS), 2024. (CCF-A,机器学习领域顶级会议)
  17. Hongbo Wang, Jin Liu, Xiaoqiang Zhou, Jie Cao, Huaibo Huang*, Ran He. Hallo3D: Multi-Modal Hallucination Detection and Mitigation for Consistent 3D Content Generation. Neural Information Processing Systems (NeurIPS), 2024. (CCF-A,机器学习领域顶级会议)
  18. Jin Liu, Huaibo Huang*, Jie Cao, Ran He. ZePo: Zero-Shot Portrait Stylization with Faster Sampling. ACM International Conference on Multimedia (ACM MM). 2024. (CCF-A, 多媒体计算领域顶级会议)
  19. Xuannan Liu, Pei Pei Li, Huaibo Huang, Zekun Li, Xing Cui, et al. FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs. ACM International Conference on Multimedia (ACM MM). 2024. (CCF-A, 多媒体计算领域顶级会议)
  20. Xing Cui, Zekun Li, Peipei Li, Huaibo Huang, Xuannan Liu, Zhaofeng He. InstaStyle: Inversion Noise of a Stylized Image is Secretly a Style Adviser. European Conference on Computer Vision (ECCV), 2024. (CCF-B, 计算机视觉领域顶级会议)
  21. Tingkai Liu, Yunzhe Tao, Haogeng Liu, Qihang Fan, Ding Zhou, Huaibo Huang, Ran He, Hongxia Yang. DeVAn: Dense Video Annotation for Video-Language Models. Association for Computational Linguistics (ACL), 2024. (CCF-A,自然语言处理领域顶级会议)
  22. Yuang Ai, Huaibo Huang*, Xiaoqiang Zhou, Jiexiang Wang, Ran He. Multimodal Prompt Perceiver: Empower Adaptiveness, Generalizability and Fidelity for All-in-One Image Restoration. Computer Vision and Pattern Recognition (CVPR), 2024. (Corresponding author) (CCF-A,计算机视觉领域顶级会议)
  23. Yuang Ai, Xiaoqiang Zhou, Huaibo Huang*, Lei Zhang, Ran He. Uncertainty-Aware Source-Free Adaptive Image Super-Resolution with Wavelet Augmentation Transformer. Computer Vision and Pattern Recognition (CVPR), 2024.(Corresponding author) (CCF-A,计算机视觉领域顶级会议)
  24. Qihang Fan, Huaibo Huang, Mingrui Chen, Hongmin Liu, Ran He. RMT: Retentive Networks Meet Vision Transformers. Computer Vision and Pattern Recognition (CVPR), 2024. (CCF-A,计算机视觉领域顶级会议)
  25. Zi Wang, Huaibo Huang, Aihua Zheng, Ran He. Heterogeneous Test-time Training for Multi-modal Person Re-identification. AAAI Conference on Artificial Intelligence (AAAI), 2024.(CCF-A,人工智能领域顶级会议)
  26. Qihang Fan, Huaibo Huang, Xiaoqiang Zhou, Ran He. Lightweight Vision Transformer with Bidirectional Interaction. Neural Information Processing Systems (NeurIPS), 2023.(CCF-A,机器学习领域顶级会议)
  27. Rui Wang, Pei Pei Li, Huaibo Huang, Chunshui Cao, Ran He, Zhaofeng He. Learning-to-Rank Meets Language: Boosting Language-Driven Ordering Alignment for Ordinal Classification. Neural Information Processing Systems (NeurIPS), 2023.(CCF-A,机器学习领域顶级会议)
  28. Xiaoqiang Zhou, Huaibo Huang, Zilei Wang, Jie Hu, Ran He, Tieniu Tan. MSRA-SR: Image Super-resolution Transformer with Multi-scale Shared Representation Acquisition. International Conference on Computer Vision (ICCV), 2023. (CCF-A,计算机视觉领域顶级会议)
  29. Peipei Li, Rui Wang, Huaibo Huang, Ran He, Zhaofeng He. Pluralistic Aging Diffusion Autoencoder. International Conference on Computer Vision (ICCV), 2023. (CCF-A,计算机视觉领域顶级会议)
  30. Huaibo Huang, Xiaoqiang Zhou, Jie Cao, Ran He, Tieniu Tan. Vision Transformer with Super Token Sampling. Computer Vision and Pattern Recognition (CVPR), 2023. (CCF-A,计算机视觉领域顶级会议)
  31. Huaibo Huang, Xiaoqiang Zhou, Ran He. Orthogonal Transformer: An Efficient Vision Transformer Backbone with Token Orthogonalization. Neural Information Processing Systems (NeurIPS), 2022. (CCF-A,机器学习领域顶级会议)
  32. Gengyun Jia, Huaibo Huang, Chaoyou Fu, Ran He. Rethinking Image Cropping: Exploring Diverse Compositions from Global Views. Computer Vision and Pattern Recognition (CVPR), 2022. (CCF-A,计算机视觉领域顶级会议)
  33. Xin Xie, Yi Li, Huaibo Huang, Haiyan Fu, Wanwan Wang, Yanqing Guo. Artistic Style Discovery With Independent Components. Computer Vision and Pattern Recognition (CVPR), 2022. (CCF-A,计算机视觉领域顶级会议)
  34. Huaibo Huang, Aijing Yu, Ran He. Memory Oriented Transfer Learning for Semi-Supervised Image Deraining. Computer Vision and Pattern Recognition (CVPR), 2021. (CCF-A,计算机视觉领域顶级会议)
  35. Gege Gao, Huaibo Huang, Chaoyou Fu, Zhaoyang Li, Ran He. Information Bottleneck Disentanglement for Identity Swapping. Computer Vision and Pattern Recognition (CVPR), 2021. (CCF-A,计算机视觉领域顶级会议)
  36. Peipei Li#, Huaibo Huang#, Yibo Hu, Xiang Wu, Ran He, Zhenan Sun. Hierarchical Face Aging through Disentangled Latent Characteristics. European Conference on Computer Vision (ECCV), 2020. (Co-first author)(CCF-B, 计算机视觉领域顶级会议)
  37. Jie Cao, Huaibo Huang, Yi Li, Jingtuo Liu, Ran He, Zhenan Sun. Informative Sample Mining Network for Multi-Domain Image-to-Image Translation. European Conference on Computer Vision (ECCV), 2020. (CCF-B, 计算机视觉领域顶级会议)
  38. Hao Zhu, Huaibo Huang, Yi Li, Aihua Zheng, Ran He. Arbitrary Talking Face Generation via Attentional Audio-Visual Coherence Learning. International Joint Conference on Artificial Intelligence (IJCAI), 2020. (CCF-B,人工智能领域顶级会议)
  39. Chaoyou Fu, Xiang Wu, Yibo Hu, Huaibo Huang, Ran He. Dual Variational Generation for Low-Shot Heterogeneous Face Recognition. Neural Information Processing Systems (NeurIPS), 2019.(CCF-A,机器学习领域顶级会议)
  40. Weikuo Guo, Huaibo Huang, Xiangwei Kong, Ran He. Learning Disentangled Representation for Cross-Modal Retrieval with Deep Mutual Information Estimation. ACM International Conference on Multimedia (ACMMM), 2019.(CCF-A,人工智能领域顶级会议)
  41. Xiang Wu, Huaibo Huang, Vishal Patel, Ran He, Zhenan Sun. Disentangled Variational Representation for Heterogeneous Face Recognition. AAAI Conference on Artificial Intelligence (AAAI), 2019.(CCF-A,人工智能领域顶级会议)
  42. Rui Wang, Huaibo Huang, Xufeng Zhang, Jixin Ma, Aihua Zheng. A Novel Distance Learning for Elastic Cross Modal Audio-Visual Matching. Workshops: 2019 IEEE International Conference on Multimedia and Expo (ICMEW), 2019. (Best Student Paper) (CCF-B, 计算机图形学与多媒体领域权威会议)
  43. Huaibo Huang, Zhihang Li, Ran He, Zhenan Sun, Tieniu Tan. IntroVAE: Introspective Variational Autoencoders for Photographic Image Synthesis. Neural Information Processing Systems (NeurIPS), 2018: 52-63. (CCF-A,机器学习领域顶级会议)
  44. Huaibo Huang, Ran He, Zhenan Sun, Tieniu Tan. Wavelet-SRNet: A Wavelet-based CNN for Multi-scale Face Super Resolution. International Conference on Computer Vision (ICCV), 2017: 1698-1706. (CCF-A,计算机视觉领域顶级会议)
著作
Yi Li, Huaibo Huang, Ran He, Tieniu Tan. Heterogeneous Facial Analysis and Synthesis. Springer, 2020.

科研项目

( 1 ) 隐私-效用协同优化的多模态视觉内容生成研究, 负责人, 国家任务, 2026-01--2029-12

( 2 ) 开放环境下图像增强基础模型研究, 负责人, 地方任务, 2025-01--2027-12

( 3 ) 基于视频⽣成先验的视频处理算法研究, 负责人, 境内委托项目, 2025-01--2025-12

( 4 ) 多模态融合的白内障智能诊断方法和应用研究, 负责人, 地方任务, 2024-09--2026-09

( 5 ) 北京市科技新星创新新星计划, 负责人, 地方任务, 2023-10--2026-10

( 6 ) 基于MindSpore的视觉内容智能合成与鉴别技术研究, 负责人, 境内委托项目, 2022-11--2023-11

( 7 ) 中国科学院青年创新促进会, 负责人, 中国科学院计划, 2022-03--2025-12

( 8 ) 面向复杂场景的小样本视频自动生成研究, 负责人, 境内委托项目, 2021-09--2022-08

( 9 ) 基于解耦表达的高保真人脸图像合成理论和方法研究, 负责人, 国家任务, 2021-01--2023-12

( 10 ) 多媒体混合伪造生成模型的稳定性研究, 负责人, 国家任务, 2020-07--2023-06

( 11 ) 人脸增强、旋转与Sketch转换技术, 负责人, 境内委托项目, 2018-12--2024-06