基本信息
赵冬斌  男  博导  中国科学院自动化研究所
电子邮件: dongbin.zhao@ia.ac.cn
通信地址: 海淀区中关村东路95号智能化大厦1005
邮政编码: 100190

研究领域

深度强化学习,多智能体强化学习,人工智能基础

智能驾驶,具身智能,游戏智能,基础模型训练,AI4S

最新成果


每月更新,近一个月的成果用黄色背景标记。关于成果的详细介绍,请关注微信公众号:深度强化学习@CASIA

人员获奖

  • 2025,赵冬斌,北京智源人工智能研究院2025 级智源学者
  • 2025,赵冬斌,2025年度中国科学院优秀导师。
  • 2025,陆润宇,博士国家奖学金;
  • 2025,刘鑫,博士国家奖学金;
  • 2025,柴嘉骏,中国科学院院长特别奖(最高等级,当年全所唯一,作为毕业生代表在国科大/自动化所毕业典礼上发言)。
  • 2025,柴嘉骏,中国科学院自动化研究所优秀毕业生,北京市优秀毕业生。
  • 2025,陆润宇,IEEE CIS Student Research Grant(每年全球6~9名)
  • 2025,中国科学院自动化研究所三好学生/优秀学生干部:方兴,陈文章,凃崧峻/刘学义,田帅。
  • 2025,中国科学院人工智能学院三好学生/优秀学生干部:陆润宇,赵子杰/徐凯旋。
  • 2025,中国科学院大学在读期间三好学生/优秀学生干部:江震南,邢泽斌,陈庆/秦宇星。
  • 2025,陈霆鸿,北京市自然科学基金本科生启研计划
  • 2025,赵冬斌,中国科学院李佩优秀教师奖
  • 2025,赵冬斌,入选2024年斯坦福全球前2%顶尖科学家,终身科学影响力排行榜和年度科学影响力排行榜


竞赛获奖

  • 2025, ICCV NAVSIM v2 End-to-End Driving Challenge, 第3名, 张启超,郑宇鹏,刑泽斌,杨鹏轩。
  • 2025, CVPR NAVSIM v2 End-to-End Driving Challenge, 第4名(学界排名第1), 张启超,郑宇鹏,刑泽斌,杨鹏轩。https://opendrivelab.com/challenge2025/
  • 2025, ICRA ManiSkill Vitac Challenge, 冠军,秦宇星参加。


国际期刊—录用/发表

  1. Zebin Xing, Yupeng Zheng, Qichao Zhang*, Zhixing Ding, Pengxuan Yang, Songen Gu, Zhongpu Xia, Dongbin Zhao, “Mimir: Asynchronous Goal-Driven Diffusion with Uncertainty Propagation for End-to-End Autonomous Driving,” IEEE Robotics and Automation Letters (RA-L), accepted on November 24, 2025. https://github.com/ZebinX/Mimir-Uncertainty-Driving
  2. Yuhui Chen, Haoran Li*, Zhennan Jiang, Haowei Wen, Dongbin Zhao, “TeViR: text-to-video reward with diffusion models for efficient reinforcement learning,” IEEE Transactions on Systems, Man, and Cybernetics: Systems, accepted on November 22, 2025.
  3. Binbin Zuo, Yuanheng Zhu*, Jiankuo Zhao, et al., “CMIP: Combining Constructive Model with Improvement Policy for Large Scale Min-Max Multiple Traveling Salesman Problem,” IEEE Transactions on Intelligent Transportation Systems (TITS), accepted on October 21, 2025.
  4. Yaran Chen, Chenguang Yang, Chaomin Luo, and Dongbin Zhao, “Guest Editorial: Special Issue on Embodied AI in Indoor Robotics: Bridging Perception, Interaction, and Autonomy,” IEEE Transactions on Cognitive and Developmental Systems, Vol. 17, No. 5, pp. 1047-1149, Oct. 2025. DOI: 10.1109/TCDS.2025.3595370. (SCI Q1, IF 5.0)
  5. Boyu Li, Haobin Jiang, Ziluo Ding, Xinrun Xu, Haoran Li, Dongbin Zhao, Zongqing Lu*, “SELU: self-learning embodied multimodal large language models in unknown environments,” Transactions on Machine Learning Research (TMLR), 2025
  6. Yuqian Fu, Yuanheng Zhu, Haoran Li, Zijie Zhao, Jiajun Chai, Dongbin Zhao*, “CPIG: leveraging consistency policy with intention guidance for multi-agent exploration,” IEEE Transactions on Cognitive and Developmental Systems (TCDS), DOI: 10.1109/TCDS.2025.3578001. (SCI Q1, IF 5.0). https://github.com/fyqqyf/CPIG 
  7. Ding Li, Qichao Zhang*, Dongfang Yang, Zhi Wang, Ren Fan, Dongbin Zhao, “IP3: Integrated path-guided prediction and planning for safe autonomous driving,” IEEE Transactions on Vehicular Technology (TVT), Vol. 74, No. 11, pp. 16729-16742, Nov. 2025, DOI: 10.1109/TVT.2025.3576204. (SCI Q1, IF 6.1). https://github.com/ld-av/IP3/.
  8. Jianjun Chai, Zijie Zhao, Yuanheng Zhu, Dongbin Zhao*, “A Survey of Cooperative Mutil-Agent Reinforcement Learning for Multi-Task Scenarios,” Artificial Intelligence Science and Engineering (AISE), DOI: 10.23919/AISE.2025.000008.
  9. Xin Liu, Yaran Chen*, Dongbin Zhao*, “Learning future representation with synthetic observations for sample-efficient reinforcement learning,” SCIENCE CHINA Information Sciences (SCIS), https://doi.org/10.1007/s11432-024-4380-4. (SCI Q1, IF 7.3)
  10. Xin Liu, Yaran Chen, Haoran Li, Dongbin Zhao*, “Balancing state exploration and skill diversity in unsupervised skill discovery,” IEEE Transactions on Cybernetics (TCyb), Vol. 55, No. 5, pp. 2234-2247, May 2025. DOI: 10.1109/TCYB.2025.3548821. (SCI Q1, IF 9.4). https://github.com/liuxin0824/ComSD
  11. Yaran Chen, Wenbo Cui, Yuanwen Chen, Mining Tan, Xinyao Zhang, Jinrui Liu, Haoran Li, Dongbin Zhao*, and He Wang, “RoboGPT: an LLM-based long-term decision-making embodied agent for instruction following tasks,” IEEE Transactions on Cognitive and Developmental Systems (TCDS). Vol. 17, No. 5, pp. 1163-1174, Oct. 2025. DOI: 10.1109/TCDS.2025.3543364. (SCI Q1, IF 5.0) https://github.com/Cwb0106/RoboGPT.
  12. Zijie Zhao, Yuanheng Zhu*, Dongbin Zhao*, “Meta learning task representation in multi-agent reinforcement learning: from global inference to local inference,” IEEE Transactions on Neural Networks and Learning Systems (TNNLS), DOI: 10.1109/TNNLS.2025.3540758. (SCI Q1, IF 10.4) https://github.com/zhaozijie2022/mg2l.
  13. Xin Liu, Yaran Chen*, Haoran Li, Boyu Li, Dongbin Zhao*, “Cross-domain random pretraining with prototypes for reinforcement learning,” IEEE Transactions on Systems, Man, and Cybernetics: Systems (TSMCS), Vol. 55, No. 5, pp. 3601 – 3613, May 2025. DOI: 10.1109/TSMC.2025.3541926. (SCI Q1, IF 8.6) https://github.com/liuxin0824/CRPTpro

国际会议-录用/发表

  1. Pengxuan Yang, Yupeng Zheng, Qichao Zhang*, Zhongpu Xia, WorldRFT: Latent World Model Planning withReinforcement Fine-Tuning for Autonomous Driving, AAAI 2026. (CCF A)
  2. Xin Liu, Haoran Li*Dongbin Zhao, “Videos are Sample-Efficient Supervisions: Behavior Cloning from Videos via Latent Representations,” NeurIPS 2025. (CCF A) https://github.com/liuxin0824/BCV-LR 
  3. Songjun Tu, Jiahao Lin, Qichao Zhang*, Xiangyu Tian, Linjing Li, Xiangyuan Lan, Dongbin Zhao, “Learning When to Think: Shaping Adaptive Reasoning in R1-Style Models via Multi-Stage RL,” NeurIPS 2025. (CCF A), https://github.com/ScienceOne-AI/AutoThink
  4. Runyu Lu, Peng Zhang, Ruochuan Shi, Yuanheng Zhu*Dongbin Zhao, Yang Liu, Dong Wang, Cesare Alippi, “Equilibrium Policy Generalization: A Reinforcement Learning Framework for Cross-Graph Zero-Shot Generalization in Pursuit-Evasion Games,” NeurIPS 2025. (CCF A) https://github.com/Cahemgco/EPG_code
  5. Zijie Zhao, Zhongyue Zhao, Kaixuan Xu, Yuqian Fu, Jiajun Chai, Yuanheng Zhu*, Dongbin Zhao, “Learning and Planning Multi-Agent Tasks via a MoE-based World Model,” NeurIPS 2025. (CCF A) https://github.com/zhaozijie2022/m3w-marl
  6. Ruochuan Shi, Runyu Lu, Yuanheng Zhu*, Dongbin Zhao*, “ARAC: Adaptive Regularized Multi-Agent Soft Actor-Critic in Graph-Structured Adversarial Games,” DAI 2025 oral.
  7. Yuqian Fu, Yuanheng Zhu*, Jiajun Chai, Guojun Yin, Wei Lin, Qichao ZhangDongbin Zhao, “RLAE: Reinforcement Learning-Assisted Ensemble for LLMs,” EMNLP 2025 main (CCF B). https://github.com/fyqqyf/RLAE
  8. Weiheng Liu, Yuxuan Wan, Jilong Wang, Yuxuan Kuang, Haoran Li, Dongbin Zhao, Zhizheng Zhang, He Wang, FetchBot: Object Fetching in Cluttered Shelves via Zero-Shot Sim2Real, CoRL 2025 oral
  9. Xueyi Liu, Zuodong Zhong, Qichao Zhang*, Yuxin Guo, Yupeng Zheng, Junli Wang, Dongbin Zhao, “ReasonPlan: Unified Scene Prediction and Decision Reasoning for Closed-loop Autonomous Driving,” CoRL 2025. https://github.com/Liuxueyi/ReasonPlan  
  10. Shuai Tian, Haoran Li*, Dongbin Zhao, Fast and Accurate Visuomotor Imitation Learning via 2D Consistency Flow Matching Policy, ICONIP 2025. (CCF C)
  11. Songjun Tu, Qichao Zhang*, Linjing Li, Yuqian Fu, Nan Xu, Xiangyuan Lan, Wei He, Xiangyuan Lan, Dongmei Jiang, Dongbin Zhao, “Enhancing LLM reasoning with iterative DPO: a comprehensive empirical investigation,” COLM 2025. https://github.com/TU2021/DPO-VP
  12. Shugao Liu, Qichao Zhang, Haoran Li*, Dongbin Zhao, “FusionNav: Enhancing Zero-Shot Object-Goal Navigation via 3D Semantic Fusion and Farsight Value Reasoning,” IEEE SMCC 2025. (CCF C)
  13. Yupeng Zheng, Pengxuan Yang, Zebin Xing, Yuhang Zheng, Pengfei Li, Yinfeng Gao, Qichao Zhang*, Teng Zhang, Zhongpu Xia, Peng Jia, XianPeng Lang, Dongbin Zhao, “World4Drive: Hierarchical Latent World Models for Perception-Free End-to-End Autonomous Driving,” ICCV 2025. (CCF A)
  14. Mengying Lin#, Shugao Liu#, Dingxi Zhang, Yaran Chen, Zhaoran Wang, Haoran Li*, Dongbin Zhao, Advancing Object-Goal Navigation through LLM-enhanced Object Affinities Transfer, 2025 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). (CCF C)
  15. Runyu Lu, Yuanheng Zhu*, Dongbin Zhao, “Constrained exploitability descent: finding mixed-strategy Nash equilibrium by offline reinforcement learning,” ICML 2025. (CCF A)
  16. Kaixuan Xu, Jiajun Chai, Sicheng Li, Yuqian Fu, Yuanheng Zhu*, Dongbin Zhao, “DipLLM: Fine-Tuning LLM for Strategic Decision-making in Diplomacy,” ICML 2025. (CCF A) https://github.com/KaiXIIM/dipllm
  17. Yuhui ChenShuai TianShugao LiuYingting ZhouHaoran Li*Dongbin Zhao, “Fine-tuning VLA models via Human-in-the-Loop Consistency Policy,” RSS 2025. https://github.com/cccedric/conrft.
  18. Yuanwen Chen, Haoran Li,  Yaran Chen, Dongbin Zhao, “LeAffordNav: Enhancing open-vocabulary mobile manipulation with LLM-guided exploration and affordance-aware navigation,” ICME 2025. (CCF B)  https://github.com/Cyuanwen/LeAffordNav.
  19. Pengxuan Yang, Yupeng Zheng, Kefei Zhu, Zebin Xing, Pengfei Li, Qichao Zhang*, Zhongpu Xia, Dongbin Zhao, “UncAD: Towards Safe End-to-end Autonomous Driving via Online Map Uncertainty,” ICRA 2025. (CCF B) 
  20. Wenbo Cui, Chengyang Zhao*, Songlin Wei*, Jiazhao Zhang, Haoran Geng, Yaran Chen, Haoran Li, He Wang, “GAPartManip: a large-scale dataset for generalizable and actionable part manipulation with material-agnostic articulated objects,” ICRA 2025. (CCF B)
  21. Xin Liu, Yaran Chen, Haoran Li*, “Sample-efficient unsupervised policy cloning from ensemble self-supervised labeled videos,” ICRA 2025. (CCF B) 
  22. Jiajun Chai, Yuqian Fu, Sicheng Li, Yuanheng Zhu*, Dongbin Zhao, "Empowering LLM Agents with zero-shot optimal decision-making through Q-learning, ICLR 2025.(CCF A) https://github.com/laq2024/MLAQ.
  23. Jingbo Sun, Songjun Tu, Qichao Zhang*, Haoran Li, Xin Liu, Yaran Chen, Ke Chen, Dongbin Zhao, “Unsupervised zero-shot reinforcement learning via dual-value forward-backward representation,” ICLR 2025.  https://github.com/bofusun/DVFB.
  24. Runyu Lu, Yuanheng Zhu*, Dongbin Zhao, “Divergence-regularized discounted aggregation: equilibrium finding in multiplayer partially observable stochastic games,” ICLR 2025. (CCF A)
  25. Yuqian Fu, Yuanheng Zhu*, Jian Zhao, Jiajun Chai, Dongbin Zhao, “INS: Interaction-aware synthesis to enhance offline multi-agent reinforcement learning,” ICLR 2025. (CCF A). https://github.com/fyqqyf/INS.
  26. Songjun TuJingbo SunQichao Zhang*, Yaocheng ZhangJia LiuKe ChenDongbin Zhao, “In-dataset trajectory return regularization for offline preference-based reinforcement learning,” AAAI 2025. (CCF A). https://github.com/TU2021/DTR.
  27. Jingbo Sun, Songjun Tu, Qichao Zhang*, Ke Chen, Dongbin Zhao*, “Salience-invariant consistent policy learning for generalization in visual reinforcement learning,” AAMAS 2025. (CCF-B, oral) 
  28. Xing Fang, Qichao Zhang*, Haoran Li, Dongbin Zhao, “Consistency policy with categorical critic for autonomous driving,” AAMAS 2025. (CCF-B, oral)
  29. Yaocheng Zhang, Yuanheng Zhu*, Yuqian Fu, Songjun Tu, Dongbin Zhao, “Offline goal-conditioned reinforcement learning with elastic-subgoal diffused policy learning,” AAMAS 2025. (CCF-B, oral)  https://github.com/zhyaoch/ESD.
  30. Songjun Tu, Qichao Zhang*, Dongbin Zhao, “Online preference-based reinforcement learning with self-augmented feedback from large language model,” AAMAS 2025. (CCF-B, oral) https://github.com/TU2021/RL-SaLLM-F.

图书章节

  1.  陈亚冉,李楠楠,丁子祥,赵冬斌,神经网络架构搜索,清华大学出版社,2025年出版
团队成员报告
  1. 2025年1月6日,面向高级别自动驾驶的人工智能方法的探索实践,中关村智能网联汽车创新发展论坛,北京,赵冬斌。
  2. 2025年1月11日,从强化学习到大模型和具身智能,IEEE 计算智能学会郑州分会成立大会&计算智能前沿论坛,郑州,赵冬斌。
  3. 2025年1月14日,监督学习式端到端自动驾驶的进展与挑战,第四届全球自动驾驶峰会,北京,张启超。
  4. Feb. 7, 2025, Reinforcement Learning Assisted Large Models and Embodied Intelligence, 13th International Conference on Intelligent Control and Information Processing (ICICIP 2025), Abu Dhabi, UAE & Muscat, Oman, Dongbin Zhao.
  5. 2025年3月22日,面向多任务的多智能体强化学习理论与应用,第四届智能优化与决策前沿论坛会议,北京,赵冬斌。
  6. 2025年3月29日,基于强化学习的视觉-语言-动作模型后训练,中国具身智能大会,北京,李浩然。
  7. 2025年4月26日,基于人工智能方法的高级别自动驾驶,2025年重庆交通大学神经网络与智能控制前沿论坛,重庆,赵冬斌。
  8. 2025年4月27日,基于生成式模型的强化学习,2025年西南大学智能系统感知与控制前沿论坛,重庆,赵冬斌。
  9. 2025年5月9日, 强化学习算法及其自动驾驶应用进展, Pre-conference workshop on Reinforcement Learning and Adaptive Dynamic Programming, IEEE 14th Data Driven Control and Learning System Conference (DDCLS’25), Wuxi, China, Qichao Zhang.
  10. 2025年5月14日,深度强化学习助力智能产业应用,聚合智能产业概念验证实验室启动论坛,北京,赵冬斌。
  11. 2025年5月24日,基于强化学习的机器人具身智能,第三届山东省计算智能大会,徐州,赵冬斌。
  12. 2025年6月14日,开放环境的多智能体决策智能,第四届智能决策论坛-智能学习与博弈论坛,南京,朱圆恒。
  13. 2025年6月14日,基于强化学习的视觉-语言-动作模型后训练,第四届智能决策论坛-具身智能前沿技术论坛,南京,李浩然。
  14. 2025年7月8日,深度强化学习和具身智能,人工智能与学习系统专题研讨会,宁波奉化,赵冬斌。
  15. 2025年8月2日,具身智能中的强化学习,第三届人工智能大模型技术高峰论坛,合肥,赵冬斌
  16. 2025年9月20日,面向具身操作的VLA现状和展望第六届中国智能机器人学术年会,南通,赵冬斌。
  17. 2025年9月20日,大语言模型的深度思考能力探索RL China 2025,科学智能体论坛,北京,张启超。
  18. 2025年9月21日,强化学习在多模态具身大模型中的应用RL China 2025,多模态智能体论坛,北京,李浩然。
  19. 2025年9月26日,开放环境的多智能体决策智能,第十三届中国(绵阳)科技城国际科技博览会及新质生产力人工智能大会暨对接交流会,中国生产力促进中心协会,绵阳,朱圆恒。
  20. 2025年9月28日,端到端自动驾驶的探索和实践,2025车机人创新发展论坛,北京,赵冬斌
  21. 2025年10月23日,端到端自动驾驶的实践和探索,第三十二届中国汽车工程学会年会,重庆,赵冬斌。
  22. 2025年10月24日,端到端自动驾驶:从模仿学习到强化学习,2025中国车辆控制与智能化大会,Pre-conference Workshop on Trustworthy Autonomous Vehicles,青岛,张启超
  23. 2025年10月29日,强化学习赋能具身智能,国科大2025-2026学年秋季学期的研究生科学前沿讲座,北京,赵冬斌
  24. 2025年11月6日,具身智能的实践和探索,北京软件和信息服务业协会人工智能应用大讲堂,北京,赵冬斌


招生信息

招生专业1:控制理论与控制工程--群体智能与博弈对抗

招生专业2:模式识别--人工智能理论与方法


招生方向
深度强化学习,智能驾驶,智能游戏,机器人
人工智能,深度强化学习,多智能体博弈

教育背景

1996-09--2000-04   哈尔滨工业大学   博士
1994-09--1996-07   哈尔滨工业大学   硕士
1990-09--1994-07   哈尔滨工业大学   学士
出国学习工作
2007年8月-2008年8月,University of Arizona, 访问学者,国家留学基金委公派留学计划。

工作经历

   
工作简历
2014-01~2014-02,新加坡科技研究局, 访问学者
2012-11~现在, 中科院自动化所, 研究员、博导
2002-04~2012-10,中国科学院自动化研究所, 副研、硕导-博导
2000-05~2002-01,清华大学, 博士后
社会兼职
2025-02-28-今,中国人工智能学会, 理事
2022-09-01-今,中国人工智能学会智能自适应协同优化控制专委会, 秘书长
2022-01-01-今,中国自动化学会“数据驱动、学习与优化”专业委员会, 副主任
2022-01-01-今,IEEE Computational Intelligence Magazine, Associate Editor
2021-09-01-2022-08-31,IEEE Conference on Games, General Chair
2021-01-01-2021-07-22,The International Joint Conference on Neural Networks (IJCNN), Shenzhen, China, July 18-22, 2021, Competition Chair
2020-07-19-2020-07-24,IEEE World Congress on Computational Intelligence (WCCI 2020), Glasgow, UK, July 19 -24, 2020, Awards Chair
2020-03-01-今,IEEE Transactions on Artificial Intelligence, Associate Editor
2020-01-01-2020-12-31,IEEE CIS Distinguished Lectures Program, Chair
2019-12-11-2019-12-16,The 10th International Conference on Intelligent Control and Information Processing (ICICIP 2019), Marrakesh, Morocco, Program Chair
2019-12-06-2019-12-09,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2019), Xiamen, China, Program Chair
2019-07-13-2019-07-18,IEEE International Joint Conference on Neural Networks (IJCNN 2019), Budapest, Hungary, Program Co-Chair
2019-05-04-2019-05-06,IEEE International Conference on Computational Intelligence for Financial Engineering and Economics (CIFEr 2019), Shenzhen, China, General Co-Chair
2019-01-01-2019-12-31,IEEE CIS Technical Activities Strategy Planning Sub-Committee, Chair
2018-12-01-2018-12-04,The 25th International Conference on Neural Information Processing (ICONIP 2018), Siem Reap, Cambodia, Dec 1-4, 2018, Tutorial Chair
2018-11-18-2018-11-21,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2018), Bangalore, India, Nov. 18 -21, 2018, Program Chair
2018-09-01-2019-08-31,IEEE Computation Intelligence Magazine special issue on “Deep Reinforcement Learning and Games”., Lead Guest Chair
2018-06-29-2018-07-06,2018 Eighth International Conference on Information Science and Technology (ICIST 2018), Cordoba, Granada, and Seville, Spain during June 30-July 6, 2018, Program Chair
2018-05-31-2018-12-31,IEEE Transactions on Neural Networks and Learning Systems special issue on “Deep Reinforcement Learning and Adaptive Dynamic Programming”, Lead Guest Editor
2018-03-01-今,IEEE Transactions on Cybernetics, Associate Editor
2017-11-26-2017-11-30,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2017), Honolulu, Hawaii, USA, Program Chair
2017-11-13-2017-11-17,The 24th International Conference on Neural Information Processing (ICONIP 2017), Guangzhou, China, Program Chair
2017-07-05-2017-07-27,2017 IEEE CIS Summer School on Computational and Artificial Intelligence, Chair
2016-12-31-2017-12-31,IEEE计算智能学会北京分会, 主席
2016-12-05-2016-12-08,IEEE Symposium on Adaptive Dynamic Programming and Reinforcement Learning (ADPRL 2016), Athens, Greece, Program Chair
2016-07-25-2017-07-29,IEEE World Congress on Computational Intelligence (WCCI 2016), Vancouver, Canada, Publicity Co-chair
2016-06-11-2016-06-14,The 13th World Congress on Intelligent Control and Automation (WCICA 2016), Guilin, China, Program Co-Chair
2015-10-15-2015-10-18,12th International Symposium on Neural Networks (ISNN 2015), Jeju, Korea, Program Co-Chair
2015-04-24-2015-04-26,The 5th International Conference on Information Science and Technology (ICIST 2015), Changsha, China, Program Chair
2015-01-01-今,Artificial Intelligence Review, Associate Editor
2014-12-31-2016-12-31,IEEE计算智能学会自适应动态规划和强化学习技术委员会, 主席
2014-12-31-2015-12-31,IEEE计算智能学会旅行资助委员会, 主席
2014-12-31-2016-12-31,IEEE计算智能学会多媒体委员会, 主席
2014-12-31-2016-12-31,IEEE计算智能学会北京分会, 副主席
2014-12-09-2014-12-12,IEEE Symposiums Series on Computational Intelligence (SSCI 2014), Atlanta, USA, Poster Chair
2014-07-06-2014-07-11,IEEE World Congress on Computational Intelligence (WCCI 2014), Beijing, China, Finance Co-Chair
2014-07-06-2014-07-11,IEEE CIS Summer School on Automated Computational Intelligence, Beijing, China, Chair
2013-12-31-2020-12-31,IEEE Computational Intelligence Magazine, Associate Editor,
2013-06-09-2013-06-11,The 4th International Conference on Intelligent Control and Information Processing (ICICIP 2013), Beijing, China, Program Chair
2012-12-31-2014-12-30,IEEE CIS Newsletter, Editor,
2012-07-11-2012-07-14,International Symposium on Neural Networks (ISNN 2012), Shenyang, China, Registration Chair
2012-07-11-2012-07-14,Brain Inspired Cognitive Systems (BICS 2012), Shenyang, China, Finance Chair
2011-12-31-2021-12-31,IEEE Transactions on Neural Networks and Learning Systems, Associate Editor
2011-11-01-今,Cognitive Computation, Associate Editor,
2010-09-30-2019-12-31,IEEE高级会员,
-今,

教授课程

演化计算
强化学习
计算智能
本科生毕业设计(计算机科学与技术)
智能控制
智能控制理论基础及应用

指导学生

已指导学生

田艺  硕士研究生  081101-控制理论与控制工程  

胡朝辉  硕士研究生  081101-控制理论与控制工程  

戴钰桀  博士研究生  081101-控制理论与控制工程  

苏永生  硕士研究生  081101-控制理论与控制工程  

张震  博士研究生  081101-控制理论与控制工程  

王滨  博士研究生  081101-控制理论与控制工程  

朱圆恒  博士研究生  081101-控制理论与控制工程  

王海涛  硕士研究生  081101-控制理论与控制工程  

夏中谱  博士研究生  081101-控制理论与控制工程  

张启超  博士研究生  081101-控制理论与控制工程  

吕乐  博士研究生  081101-控制理论与控制工程  

卜丽  博士研究生  081101-控制理论与控制工程  

陈亚冉  博士研究生  081101-控制理论与控制工程  

唐振韬  博士研究生  081101-控制理论与控制工程  

邵坤  博士研究生  081101-控制理论与控制工程  

李栋  博士研究生  081101-控制理论与控制工程  

卢毅  博士研究生  081101-控制理论与控制工程  

李浩然  博士研究生  081101-控制理论与控制工程  

丁子祥  博士研究生  081203-计算机应用技术  

刘育琦  博士研究生  081101-控制理论与控制工程  

李伟凡  博士研究生  081104-模式识别与智能系统  

胡光政  博士研究生  081203-计算机应用技术  

李楠楠  博士研究生  081101-控制理论与控制工程  

王俊杰  博士研究生  081101-控制理论与控制工程  

李丁  博士研究生  081203-计算机应用技术  

刘民颂  博士研究生  081101-控制理论与控制工程  

刘莎莎  硕士研究生  085410-人工智能  

马名骏  硕士研究生  085410-人工智能  

郭又天  硕士研究生  085211-计算机技术  

柴嘉骏  博士研究生  081101-控制理论与控制工程  

现指导学生

陆润宇  博士研究生  081203-计算机应用技术  

范昌易  硕士研究生  085410-人工智能  

赵子杰  博士研究生  081203-计算机应用技术  

傅宇千  博士研究生  081104-模式识别与智能系统  

徐凯旋  博士研究生  081203-计算机应用技术  

田帅  硕士研究生  081104-模式识别与智能系统  

刘鑫  博士研究生  081104-模式识别与智能系统  

江震南  博士研究生  081203-计算机应用技术  

孙敬博  博士研究生  081101-控制理论与控制工程  

陈宇辉  博士研究生  081101-控制理论与控制工程  

陈庆  硕士研究生  085410-人工智能  

凃崧峻  博士研究生  081101-控制理论与控制工程  

刘学义  博士研究生  081101-控制理论与控制工程  

崔文博  博士研究生  081101-控制理论与控制工程  

李博宇  博士研究生  081101-控制理论与控制工程  

刘卫恒  博士研究生  081104-模式识别与智能系统  

李思成  博士研究生  081101-控制理论与控制工程  

郑宇鹏  博士研究生  081101-控制理论与控制工程