黄岩 男 博导 中国科学院自动化研究所
国家优青
中国科学院院长特别奖
中国图象图形学学会青年科学家奖
中国人工智能学会优秀博士学位论文奖
电子邮件: yhuang@nlpr.ia.ac.cn
通信地址: 北京市海淀区中关村东路95号
邮政编码: 100190
个人简介
黄岩,国家自然科学基金委优秀青年科学基金获得者,中科院自动化所副研究员。研究方向为视觉-语言理解、多模态机器人、视频分析等,在相关领域的国内外期刊和会议上发表论文共计100余篇,曾获国内外学术会议最佳论文奖3项、国内外主流竞赛冠军4项,担任CVPR领域主席、CVPR和ICCV上3次多模态主题研讨会的共同组织主席。曾获得中国图象图形学学会青年科学家奖、中国科学院院长特别奖、NVIDIA创新研究奖、中国人工智能学会优秀博士论文奖、百度奖学金等。入选中国科协青年人才托举工程、北京市科技新星计划和微软铸星计划。
招生信息
每年招收博士1-2名,建议具有较强自主性和编程能力的同学邮件(yhuang@nlpr.ia.ac.cn)联系我。
指导或者协助指导硕士博士20余人,相关学生曾获得:中科院院长奖、北京市优秀毕业生、自动化所一等奖学金、ICDAR最佳论文提名奖、ICCV2019-VOT国际竞赛冠军、ICCV2019-WIDER国际竞赛冠军、CVPR2022-Habitat国际竞赛冠军等。
招生专业
教育背景
工作经历
工作简历
社会兼职
2020-01-01-2020-07-01,CVPR2020 Workshop on Language & Vision with Applications to Video Understanding, 组织主席
2020-01-01-2020-07-01,CVPR2020 Workshop on Multimodal Learning, 组织主席
2019-05-01-2019-11-30,ICCV2019 Workshop Cross-Modal Learning in Real World, 副秘书长
教授课程
专利与奖励
奖励信息
专利成果
出版信息
在相关领域的国际期刊和会议上发表(含录用)论文共计80余篇,其中领域权威期刊和会议论文共计40余篇。以第一作者身份发表领域顶级期刊TPAMI 4篇、领域顶级会议CVPR 2篇、ICCV 2篇、NeurIPS 2篇、AAAI 1篇。更全的论文列表请参考:https://scholar.google.com/citations?user=6nUJrQ0AAAAJ&hl=zh-CN。
发表著作
部分期刊论文
Dong An, Hanqing Wang, Wenguan Wang, Zun Wang, Yan Huang, Keji He, and Liang Wang. Etpnav: Evolving topological planning for vision-language navigation in continuous environments, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), accepted, 2024.
Yan Huang, Yuming Wang, and Liang Wang, Efficient Image and Sentence Matching, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 45(3): 2970-2983, 2023.
Chong Liu, Yuqi Zhang, Hongsong Wang, Weihua Chen, Fan Wang, Yan Huang, Yi-Dong Shen, and Liang Wang, Efficient Token-Guided Image-Text Retrieval with Consistent Multimodal Contrastive Training, IEEE Transactions on Image Processing (IEEE TIP), accepted, 2023.
Zhengxiong Luo, Yan Huang, Shang Li, Liang Wang, and Tieniu Tan, End-to-End Alternating Optimization for Real-World Blind Super Resolution, International Journal of Computer Vision (IJCV), accepted, 2023.
Yan Huang, Jingdong Wang, and Liang Wang, Few-Shot Image and Sentence Matching via Aligned Cross-Modal Memory, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 44(6): 2968-2983, 2022.
Jianhua Yang, Yan Huang, Kai Niu, Linjiang Huang, Zhanyu Ma, and Liang Wang, Actor and Action Modular Network for Text-based Video Segmentation, IEEE Transactions on Image Processing (IEEE TIP), 31: 4474-4489, 2022.
Hongyuan Yu, Houwen Peng, Yan Huang, Hao Du, Jianlong Fu, Liang Wang, and Haibin Ling, Cyclic Differentiable Architecture Search, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 45(1): 211-228, 2022.
Zerui Chen, Yan Huang, Hongyuan Yu, and Liang Wang, Learning a Robust Part-Aware Monocular 3D Human Pose Estimator via Neural Architecture Search, International Journal of Computer Vision (IJCV), 130: 56–75, 2022.
Yuchun Fang, Zhengye Xiao, Wei Zhang, Yan Huang, Liang Wang, Nozha Boujemaa, and Donald Geman, Attribute Prototype Learning for Interactive Face Retrieval, IEEE Transactions on Information Forensics and Security (IEEE TIFS), 16: 2593-2607, 2021.
Linjiang Huang, Yan Huang, Wanli Ouyang, and Liang Wang, Two-Branch Relational Prototypical Network for Weakly Supervised Temporal Action Localization, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 44(9): 5729-5746, 2022.
Linjiang Huang, Yan Huang, Wanli Ouyang, and Liang Wang, Modeling Sub-Actions for Weakly Supervised Temporal Action Localization, IEEE Transactions on Image Processing (IEEE TIP), 30: 5154-5167, 2021.
Yan Huang, Qi Wu, Wei Wang, and Liang Wang, Image and Sentence Matching via Semantic Concepts and Order Learning, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 42(3): 636-650, 2020.
Kai Niu, Yan Huang, Wanli Ouyang, and Liang Wang, Improving Description-based Person Re-identification by Multi-granularity Image-text Alignments, IEEE Transactions on Image Processing (IEEE TIP), 29: 5542-5556, 2020.
Yan Huang, Wei Wang, and Liang Wang, Video Super-resolution via Bidirectional Recurrent Convolutional Networks, IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), 40(4), 1015-1028, 2018.
Yan Huang, Wei Wang, Liang Wang, and Tieniu Tan, Conditional High-order Boltzmann Machines for Supervised Relation Learning, IEEE Transactions on Image Processing (IEEE TIP), 26(9):4297-4310, 2017.
Yan Huang, Wei Wang, and Liang Wang, Unconstrained Multimodal Multi-Label Learning, IEEE Transactions on Multimedia (IEEE TMM), 17(11):1923-1935, 2015.
部分会议论文
Yunan Zeng, Yan Huang, Jinjin Zhang, Zequn Jie, Zhenhua Chai, Liang Wang. Investigating Compositional Challenges in Vision-Language Models for Visual Grounding. IEEE Computer Vision and Pattern Recognition Conference (CVPR), accepted, 2024. (Highlight)
Keji He, Chenyang Si, Zhihe Lu, Yan Huang, Liang Wang, and Xinchao Wang, Frequency-Enhanced Data Augmentation for Vision-and-Language Navigation, Neural Information Processing Systems (NeurIPS), 2023.
Dong An, Yuankai Qi, Yangguang Li, Yan Huang, Liang Wang, Tieniu Tan, and Jing Shao, BEVBert: Multimodal Map Pre-training for Language-guided Navigation, IEEE International Conference on Computer Vision (ICCV), pp. 2737-2748, 2023.
Jilong Wang, Saihui Hou, Yan Huang, Chunshui Cao, Xu Liu, Yongzhen Huang, and Liang Wang, Causal Intervention for Sparse-View Gait Recognition, ACM Conference on Multimedia (MM), accepted, 2023.
Zhengxiong Luo, Dayou Chen, Yingya Zhang, Yan Huang, Liang Wang, Yujun Shen, Deli Zhao, Jingren Zhou, and Tieniu Tan, VideoFusion: Decomposed Diffusion Models for High-Quality Video Generation, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 10209-10218, 2023.
Ke Han, Shaogang Gong, Yan Huang, Liang Wang, Tieniu Tan, Clothing-Change Feature Augmentation for Person Re-Identification, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 22066-22075, 2023.
Weichen Yu, Tianyu Pang, Qian Liu, Chao Du, Bingyi Kang, Yan Huang, Min Lin, Shuicheng Yan, Bag of tricks for training data extraction from language models, International Conference on Machine Learning (ICML), 2023.
Yan Huang, Yuming Wang, Yunan Zeng, and Liang Wang, MACK: Multimodal Aligned Conceptual Knowledge for Unpaired Image-text Matching, Neural Information Processing Systems (NeurIPS), 2022.
Kai Niu, Linjiang Huang, Yan Huang, Peng Wang, Liang Wang, and Yanning Zhang, Cross-modal Co-occurrence Attributes Alignments for Person Search by Language, ACM Conference on Multimedia (MM), pp. 4426–4434, 2022.
Weichen Yu, Hongyuan Yu, Yan Huang, and Liang Wang, Generalized Inter-class Loss for Gait Recognition, ACM Conference on Multimedia (MM), pp. 141–150, 2022.
Hongyuan Yu, Tian Li, Weichen Yu, Jianguo Li, Yan Huang, Liang Wang, and Alex Liu, Regularized Graph Structure Learning with Semantic Knowledge for Multi-variates Time-Series Forecasting, International Joint Conference on Artificial Intelligence (IJCAI), 2362-2368, 2022.
Zhengxiong Luo, Yan Huang*, Shang Li, Liang Wang, and Tieniu Tan, Learning the Degradation Distribution for Blind Image Super-Resolution, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), accepted, 2022.
Ke Han, Chenyang Si, Yan Huang*, Liang Wang, and Tieniu Tan, Generalizable Person Re-Identification via Self-Supervised Batch Norm Test-Time Adaption, AAAI Conference on Artificial Intelligence (AAAI), accepted, 2022.
Keji He, Yan Huang, Qi Wu, Jianhua Yang, Dong An, Shuanglin Sima, and Liang Wang, Landmark-RxR: Solving Vision-and-Language Navigation with Fine-Grained Alignment Supervision, Neural Information Processing Systems (NeurIPS), 2021.
Dong An, Yuankai Qi, Yan Huang*, Qi Wu, Liang Wang, and Tieniu Tan, Neighbor-view Enhanced Model for Vision and Language Navigation, ACM Conference on Multimedia (MM), accepted, 2021. (Oral)
Zhengxiong Luo, Zhicheng Wang, Yan Huang, Shang Li, Liang Wang, Tieniu Tan, and Erjin Zhou, Rethinking the Heatmap Regression for Bottom-Up Human Pose Estimation, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 13264-13273, 2021.
Zhengxiong Luo, Yan Huang*, Shang Li, Liang Wang, and Tieniu Tan, Unfolding the Alternating Optimization for Blind Super Resolution, Neural Information Processing Systems (NeurIPS), 2020.
Kai Niu, Yan Huang, and Liang Wang, Textual Dependency Embedding for Person Search by Language, ACM Conference on Multimedia (MM), pp. 4032–4040, 2020.
Zerui Chen, Yan Huang, Hongyuan Yu, Bin Xue, Ke Han, Yiru Guo, and Liang Wang, Towards Part-aware Monocular 3D Human Pose Estimation: An Architecture Search Approach, European Conference on Computer Vision (ECCV), accepted, 2020. (Spotlight)
Ke Han, Yan Huang, Zerui Chen, Liang Wang, Tieniu Tan, Prediction, Recovery and Identification: Adaptive Low-Resolution Person Re-Identification, European Conference on Computer Vision (ECCV), accepted, 2020.
Linjiang Huang, Yan Huang, Wanli Ouyang, and Liang Wang, Relational Prototypical Network for Weakly Supervised Temporal Action Localization, AAAI Conference on Artificial Intelligence (AAAI), accepted, 2020. (Oral)
Linjiang Huang, Yan Huang, Wanli Ouyang, and Liang Wang, Part-Level Graph Convolutional Network for Skeleton-Based Action Recognition, AAAI Conference on Artificial Intelligence (AAAI), accepted, 2020. (Oral)
Yan Huang and Liang Wang, ACMM: Aligned Cross-Modal Memory For Few-Shot Image and Sentence Matching, IEEE International Conference on Computer Vision (ICCV), pp. 5774-5783, 2019.
Yan Huang, Yang Long, and Liang Wang, Few-Shot Image and Sentence Matching via Gated Visual-Semantic Embedding, AAAI Conference on Artificial Intelligence (AAAI), pp. 8489-8496, 2019. (Spotlight)
Weining Wang, Yan Huang, and Liang Wang, Language-driven Temporal Activity Localization: A Semantic Matching Reinforcement Learning Model, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 334-343, 2019. (Oral)
Chunfeng Song, Yan Huang, Wanli Ouyang, and Liang Wang, Box-driven Class-wise Region Masking and Filling Rate Guided Loss for Weakly Supervised Semantic Segmentation, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3136-3145, 2019.
Yan Huang, Qi Wu, Chunfeng Song, and Liang Wang, Learning Semantic Concepts and Order for Image and Sentence Matching, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6163-6171, 2018. (Spotlight)
Chunfeng Song, Yan Huang, Wanli Ouyang, and LiangWang, Mask-Guided Contrastive Attention Model for Person Re-Identification, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1179-1188, 2018.
Junbo Wang, Wei Wang, Yan Huang, Liang Wang, and Tieniu Tan, Multimodal Memory Modelling for Video Captioning, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 7512-7520, 2018. (Spotlight)
Junbo Wang, Wei Wang, Yan Huang, Liang Wang, and Tieniu Tan, Hierarchical Memory Modelling for Video Captioning, ACM Conference on Multimedia (MM), pp. 63-71, 2018.
Chenglong Li, Chengli Zhu, Yan Huang, Jin Tang, and Liang Wang, Cross-Modal Ranking with Soft Consistency and Noisy Labels for Robust RGB-T Tracking, European Conference on Computer Vision (ECCV), pp. 831-847, 2018.
Yan Huang, Wei Wang, and Liang Wang, Instance-aware Image and Sentence Matching with Selective Multimodal LSTM, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2310-2318, 2017.
Zhen Zhou, Yan Huang, Wei Wang, Liang Wang, and Tieniu Tan, See the forest for the trees: Joint spatial and temporal recurrent neural networks for video-based person re-identification, IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6776-6785, 2017.
Yan Huang, Wei Wang, and Liang Wang, Bidirectional Recurrent Convolutional Networks for Multi-Frame Super-Resolution, Neural Information Processing Systems (NeurIPS), pp. 235-243, 2015.
Yan Huang, Wei Wang, and Liang Wang, Conditional High-order Boltzmann Machine: A Supervised Learning Model for Relation Learning, IEEE International Conference on Computer Vision (ICCV), pp. 4265-4273, 2015.
科研活动
科研项目
(协助)指导学生及去向
罗正雄,博士,2023年毕业,北京智源人工智能研究院
韩苛,博士,2023年毕业,University of Trento
余玮辰,硕士,2023年毕业,Carnegie Mellon University
俞宏远,博士,2022年毕业,小米集团
陈泽睿,硕士,2021年毕业,INRIA
牛凯,博士,2020年毕业,西北工业大学
王卫宁,博士,2020年毕业,中科院自动化所
黄林江,博士,2020年毕业,北京航空航天大学
宋纯锋,博士,2020年毕业,上海人工智能实验室