基本信息

石海龙  男  博导  中国科学院微电子研究所

电子邮件: shihailong@ime.ac.cn

通信地址: 北京市朝阳区北土城西路3号

邮政编码: 100029

研究领域

研究方向包括多模态理解与生成、大模型推理加速、具身智能等:

1、多模态理解与生成。复杂开放场景下的图像&视频理解与生成,基于多Agent协同的多模态内容理解与生成,领域大模型微调与后训练等;

2、大模型推理加速。研究多模态大模型推理加速技术,包括模型架构设计、模型压缩剪枝量化、软硬协同优化等;

3、具身智能:研究机器人运动控制、VLA、VLN、世界模型等具身智能关键技术;

招生信息

每年招收博士生1-2人、硕士生1-2人,实习生若干。


推免生要求如下:

1. 具备推免资格,计算机相关专业,具有良好的数学、英语、编程、逻辑能力;

2. 本科期间发表过人工智能相关领域论文优先;

3. 希望推免生大四可以来实验室客座或者远程毕设,打好基础,做好科研过渡;

4. 特别欢迎本科期间有过科创经验的同学联系!


课题组与国内外高校以及互联网企业(包括微软、腾讯、阿里、蚂蚁、美团、京东等)有良好合作基础,优秀学生在校期间可以推荐到企业科研实习,优秀博士生可以推荐到国外交流学习。


欢迎报考!

教育背景

2010-09--2015-01   中国科学院计算技术研究所   博士
2008-09--2010-06   武汉大学   硕士
2004-09--2008-06   武汉大学   学士

出版信息

期刊论文

  • [ACM TIST] Xingyu Gao, Bocheng Pan, Zhenyu Chen, Baobin Zhang, Fei Sun, Hailong Shi*. Multi-Modal Collaboration Evaluation of Large Language Models via Image Captioning, ACM Transactions on Intelligent Systems and Technology, Accepted. 影响因子6.6
  • [ACM TOMM] Jiefan Qiu, Dongfu Zhu, Xingyu Gao, Mengqi Jiang, Jiahan Song, Hailong Shi*. mmWave Radar-based Personalized Multi-object Vital Signs Monitoring, ACM Transactions on Multimedia Computing, Communications, and Applications, 2026. 影响因子6.0
  • [IEEE TIFS] Jinsheng Xiao, Hao Ma, Ruidi Chen, Xingyu Gao, Hailong Shi*, Zhongyuan Wang. STKPS-Net:Spatio-Temporal Key Patch Selection Network for Anomalous Action Recognition, IEEE Transactions on Information Forensics and Security, 2026. 中科院一区TOP,影响因子8.0,CCF-A
  • [ACM TIOT] Bocheng Pan, Hailong Shi*, Xingyu Gao. MPVCD: Multi-Perspective Visual Contrastive Decoding for Reliable Assistance, ACM Transactions on Internet of Things, 2026. 影响因子3.7
  • [IEEE TCSVT] Xingyu Gao, Zuolei Li, Hailong Shi*, Zhenyu Chen, Peilin Zhao. Scribble-Supervised Video Object Segmentation via Scribble Enhancement, IEEE Transactions on Circuits and Systems for Video Technology, 2025. 中科院一区TOP,影响因子11.1
  • [ACM TOMM] Shaojun Zhu, Bincheng Zhu, Kaikai Chi, Jiefan Qiu, Hailong Shi, Xingyu Gao. Maximizing Long-Term Task Completion Ratio of UAV-Enabled Wirelessly-Powered MEC Systems, ACM Transactions on Multimedia Computing, Communications, and Applications, 2025. 影响因子6.0
  • [Life Sciences] Zizheng Suo#, Bocheng Pan#, Hailong Shi, Linhui Ma, Yuxiang Zheng, Wenjie Xu, Lina Lin, Enze Zhang, Lijuan Wang, Mingzhu Zhang, Yinyin Qu, Hui Zhenga, Xingyu Gao, Cheng Ni. HL-BscPF: Hybrid learning facilitates brain cell auto-identification in multiple pathologies, Life Sciences, 2025. 影响因子5.1
  • [IEEE TNNLS] Hang Ran, Xingyu Gao, Lusi Li, Weijun Li, Songsong Tian, Gang Wang, Hailong Shi, Xin Ning. Brain-Inspired Fast- and Slow-Update Prompt Tuning for Few-Shot Class-Incremental Learning, IEEE Transactions on Neural Networks and Learning Systems, 2024. 中科院一区TOP,影响因子8.9


会议论文

  • [CVPR 2026] Tong Xu, Hailong Shi*, Xingyu Gao. SCoRe: Salience-Coverage Reduction for Vision Token Pruning in Vision-Language Models, The IEEE/CVF Conference on Computer Vision and Pattern Recognition, Denver, USA, 3rd-7th June, 2026. CCF-A
  • [ACL 2026 Findings] Zhuoning Zhu, Xingyu Gao, Hailong Shi*. MENTOR: Mitigating Identity Drift in Dynamic Role-Playing via Dual-Chain Structured Memory, The 64th Annual Meeting of the Association for Computational Linguistics, San Diego, California, July 2-7, 2026.
  • [ACM MM 2025] Bocheng Pan, Hailong Shi*, Xingyu Gao. DR-VQA: Decompose-then-Reconstruct for Visual Question Answering in BLV Assistance, ACM International Conference on Multimedia, Dublin, Ireland, Sep 27th- Oct 31st, 2025. CCF-A
  • [ICML 2025] Wanjin Feng, Xingyu Gao, Wenqian Du, Hailong Shi, Peilin Zhao, Pengcheng Wu, Chunyan Miao. Efficient Parallel Training Methods for Spiking Neural Networks with Constant Time Complexity, Forty-second International Conference on Machine Learning, Vancouver, Canada, 13th-19th July, 2025. CCF-A
  • [IJCAI 2024] Huan Li, Hailong Shi*, Xingyu Gao. A Coarse-To-Fine Fusion Network for Event-Based Image Deblurring, The 33rd International Joint Conference on Artificial Intelligence, Jeju, South Korea, 3rd-9th August, 2024. CCF-A
  • [ECCV 2024] Qi Guo, Hailong Shi*, Huan Li, Jinsheng Xiao, Xingyu Gao. REDIR: Refocus-free Event-based De-occlusion Image Reconstruction, The 18th European Conference on Computer Vision, MiCo Milano, Italy, Sep 29th-Oct 4th, 2024. CCF-B,计算机视觉三大顶会之一
  • [ECIR 2024] Luo Ji#, Jiayu Mao#, Hailong Shi*, Qian Li, Hongxia Yang. An Adaptive Framework of Geographical Group-Specific Network on O2O Recommendation, The 46th European Conference on Information Retrieval, Glasgow, Scotland, 24th-28th March, 2024. CCF-C
  • [ICDM 2023] Wanjin Feng, Hailong Shi, Peilin Zhao, Xingyu Gao. Mixtron: Bandit Online Multiclass Prediction with Implicit Feedback, International Conference on Data Mining, Shanghai, China, 1st-4th December, 2023. CCF-B


专利

  • 视觉问答方法、装置及计算设备集群. 202511528406.1
  • 一种基于元提示和知识驱动的小样本鼓膜图像识别方法. 202510811129.9
  • 基于人工智能的小样本鼓膜图像识别与诊断系统. 202411932026.X
  • 一种基于分层竞争脉冲神经网络的图像分类方法及系统. 202411330049.3
  • 一种基于多模态信息融合的事件图像去模糊方法及系统. 202410680714.5
  • 一种基于跨模态大模型的全局视觉引导图像描述生成方法. 2024102936787
  • 电力缺陷识别方法、装置、设备、介质和产品. 202311558165.6

科研活动

  • 支持群体智能的分布式计算架构及核心芯片,中国科学院高层次人才引进项目,主持
  • 面向大规模动态异构物联网的多用户并发应用构建方法研究,国家自然科学基金青年基金项目,主持
  • 多源异质异步信息自适应融合方法研究,惯性测量全国重点实验室开放基金项目,主持
  • 支持在线学习的类脑芯片架构,国家科技创新2030重大项目,课题技术负责人
  • 中高速传感器网络核心芯片研发,国家科技重大专项,课题技术负责人
  • 海云计算系统结构及支撑关键技术研究,中国科学院先导专项,核心骨干