发表论文
(1) 基于对手池的两人格斗游戏深度强化学习, 控制理论与应用, 2024, 通讯作者(2) FM3Q: Factorized Multi-Agent MiniMax Q-Learning for Two-Team Zero-Sum Markov Game, IEEE Transactions on Emerging Topics in Computational Intelligence, 2024, 通讯作者(3) Enhancing Reinforcement Learning via Transformer-based State Predictive Representations, IEEE Transactions on Artificial Intelligence, 2024, 通讯作者(4) Stabilizing Diffusion Model for Robotic Control with Dynamic Programming and Transition Feasibility, IEEE Transactions on Artificial Intelligence, 2024, 通讯作者(5) MAT: Morphological Adaptive Transformer for Universal Morphology Policy Learning, IEEE Transactions on Cognitive and Developmental Systems, 2024, 第 3 作者(6) Boosting On-Policy Actor–Critic With Shallow Updates in Critic, IEEE Transactions on Neural Networks and Learning Systems, 2024, 通讯作者(7) 面向对抗博弈的深度强化学习研究进展, 中国计算机学会通讯, 2023, 通讯作者(8) NVIF: Neighboring Variational Information Flow for Cooperative Large-Scale Multiagent Reinforcement Learning, IEEE Transactions on Neural Networks and Learning Systems, 2023, 通讯作者(9) Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning, IEEE Transactions on Neural Networks and Learning Systems, 2023, 通讯作者(10) A Hierarchical Deep Reinforcement Learning Framework for 6-DOF UCAV Air-to-Air Combat, IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2023, 通讯作者(11) UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 第 3 作者(12) Enhanced Rolling Horizon Evolution Algorithm With Opponent Model Learning: Results for the Fighting Game AI Competition, IEEE TRANSACTIONS ON GAMES, 2023, 第 2 作者(13) NeuronsMAE: A Novel Multi-Agent Reinforcement Learning Environment for Cooperative and Competitive Multi-Robot Tasks, 2023 International Joint Conference on Neural Networks(IJCNN), 2023, 第 4 作者(14) Online Minimax Q Network Learning for Two-Player Zero-Sum Markov Games, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 第 1 作者(15) 实时格斗游戏的智能决策方法, Intelligent decision making approaches for real time fighting game, 控制理论与应用, 2022, 第 3 作者(16) Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target, COMPLEX & INTELLIGENT SYSTEMS, 2022, 通讯作者(17) Soft Contrastive Learning with Q-irrelevance Abstraction for Reinforcement Learning, Ieee transactions on cognitive and developmental systems, 2022, 第 4 作者(18) Empirical Policy Optimization for n-Player Markov Games, IEEE TRANSACTIONS ON CYBERNETICS, 2022, 第 1 作者(19) UNMAS: Multiagent Reinforcement Learning for Unshaped Cooperative Scenarios, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 第 3 作者(20) Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors, IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING, 2021, 通讯作者(21) Missile guidance with assisted deep reinforcement learning for head-on interception of maneuvering target, COMPLEXINTELLIGENTSYSTEMS, 2021, 第 2 作者(22) Decentralized Event-Driven Constrained Control Using Adaptive Critic Designs, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 第 2 作者(23) Event-Triggered Communication Network With Limited-Bandwidth Constraint for Multi-Agent Reinforcement Learning, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 通讯作者(24) Enhanced Rolling Horizon Evolution Algorithm with Opponent Model Learning: Results for the Fighting Game AI Competition, 2020, 第 2 作者(25) LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control, IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2020, 第 1 作者(26) Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies, IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY, 2020, 通讯作者(27) Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward, 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020, 第 2 作者(28) 在线最小最大Q 网络学习算法解决两人零和马尔科夫博弈过程, IEEE Transactions on Neural Networks and Learning Systems, 2020, 第 1 作者(29) An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game, 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2020, 第 2 作者(30) 基于前馈策略对协同自适应巡航控制的设计, IEEE Transactions on Vehicular Technology, 2020, 第 1 作者(31) 强化水平滚动演化计算算法和对手建模, IEEE Transactions on Games, 2020, 第 1 作者(32) Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control, IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2020, 第 1 作者(33) Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems, IEEE TRANSACTIONS ON SMART GRID, 2019, 第 1 作者(34) StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning, IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE, 2019, 第 2 作者(35) 强化学习和课程迁移学习结合实现星际争霸微操控制, IEEE Transactions on Emerging Topics in Computational Intelligence, 2019, 第 1 作者(36) 基于LMI设计协同自适应巡航控制系统满足弦稳定的控制器, IEEE Transactions on Intelligent Transportation Systems, 2019, 第 1 作者(37) Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming, 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2019, 通讯作者(38) 基于深度和强化学习对开源赛车仿真器的视觉驾驶, JOURNALOFAMBIENTINTELLIGENCEANDHUMANIZEDCOMPUTING, 2019, 第 1 作者(39) 控制受限自适应动态规划方法对多电池存储系统的设计, IEEE Transactions on Smart Grid, 2019, 第 1 作者(40) 不变自适应动态规划方法求解离散时间系统最优控制, IEEE Transactions on Systems, Man, and Cybernetics: Systems, 2019, 第 1 作者(41) Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics, IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY, 2019, 第 1 作者(42) 对动力学带有不确定性的异构协同自适应巡航控制系统的自适应最优控制, IEEE Transactions on Control Systems Technology, 2019, 第 1 作者(43) A Review of Computational Intelligence for StarCraft AI, 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, 第 3 作者(44) Comprehensive comparison of online ADP algorithms for continuous-time optimal control, ARTIFICIAL INTELLIGENCE REVIEW, 2018, 第 1 作者(45) An Autonomous Driving Experience Platform with Learning-Based Functions, 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, 第 4 作者(46) Learning battles in ViZDoom via deep reinforcement learning, 2018, 第 1 作者(47) Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming, IEEE TRANSACTIONS ON CYBERNETICS, 2018, 通讯作者(48) Visual Navigation with Actor-Critic Deep Reinforcement Learning, 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2018, 第 3 作者(49) 针对连续时间最优控制的在线自适应动态规划方法的综合比较, Artificial Intelligence Review, 2018, 第 1 作者(50) An Autonomous Driving Experience Platform with Learning-Based Functions, 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI), 2018, 第 1 作者(51) Visual navigation with Actor-Critic deep reinforcement learning, 2018, 第 1 作者(52) Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming, IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS, 2017, 通讯作者(53) Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems, IET CONTROL THEORY AND APPLICATIONS, 2017, 第 4 作者(54) Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs, NEUROCOMPUTING, 2017, 第 3 作者(55) 利用自适应动态规划实现对部分未知、控制受限系统的事件驱动最优控制, IEEE Transactions on Industrial Electronics, 2017, 第 1 作者(56) Adaptive dynamic programming for robust neural control of unknown continuous-time non-linear systems, IET CONTROL THEORY AND APPLICATIONS, 2017, 第 4 作者(57) 利用平方和编程实现对多项式非线性系统H无穷最优控制的策略迭代求解, IEEE transactions on cybernetics, 2017, 第 1 作者(58) 基于在线数据使用迭代自适应动态规划求解未知非线性零和博弈问题, IEEE Transactions on Neural Networks and Learning Systems, 2017, 第 1 作者(59) 数据驱动自适应动态规划求解部分输入受限的连续时间完全合作博弈问题, Neurocomputing, 2017, 第 1 作者(60) Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft, 2017, 第 1 作者(61) 深度强化学习进展: 从 AlphaGo 到 AlphaGo Zero, Recent progress of deep reinforcement learning:from AlphaGo to AlphaGo Zero, 控 制 理 论 与 应 用, 2017, 第 4 作者(62) Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2017, 通讯作者(63) 自适应动态规划实现未知连续时间非线性系统的鲁棒网络控制, IET Control Theory & Applications, 2017, 第 1 作者(64) Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning, IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS, 2017, 第 3 作者(65) Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics, IET CONTROL THEORY AND APPLICATIONS, 2016, 第 1 作者(66) Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations, 2016, 第 1 作者(67) Move Prediction in Gomoku Using Deep Learning, 2016, 第 4 作者(68) Deep Reinforcement Learning with Experience Replay Based on SARSA, PROCEEDINGS OF 2016 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (SSCI), 2016, 第 4 作者(69) 使用强化学习技术求解在系统动力学未知情况下连续时间非线性最优追踪问题, IET Control Theory Applications, 2016, 第 1 作者(70) Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics, IEEE TRANSACTIONS ON CYBERNETICS, 2016, 通讯作者(71) 概率近似正确的强化学习算法解决连续状态空间控制问题, Probably approximately correct reinforcement learning solving continuous-state control problem, 控制理论与应用, 2016, 第 2 作者(72) Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems, COGNITIVE COMPUTATION, 2015, 第 1 作者(73) MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems, IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2015, 第 2 作者(74) A data-based online reinforcement learning algorithm satisfying probably approximately correct principle, NEURAL COMPUTING & APPLICATIONS, 2015, 第 1 作者(75) 对离散时间系统无衰减最优控制使用近似策略迭代的收敛性证明, Cognitive Computation, 2015, 第 1 作者(76) MEC对连续确定性系统的近似最优在线强化学习算法, IEEE Transactions on Neural Networks and Learning Systems, 2015, 第 1 作者(77) Model-Free Adaptive Algorithm for Optimal Control of Continuous-Time Nonlinear System, 2015, 第 2 作者(78) Thermal Comfort Control Based on MEC Algorithm for HVAC Systems, 2015 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2015, 第 3 作者(79) Convergence analysis and application of fuzzy-HDP for nonlinear discrete-time HJB systems, NEUROCOMPUTING, 2015, 第 1 作者(80) 基于数据的在线强化学习算法实现概率近似正确原理, Neural Computing and Applications, 2015, 第 1 作者(81) Thermal Comfort Control Based on MEC Algorithm for HVAC System, 2015, 第 3 作者(82) 对非线性离散时间HJB系统的收敛分析和模糊HDP方法应用, Neurocomputing, 2015, 第 1 作者(83) A data-based online reinforcement learning algorithm with high-efficient exploration, 2014, 第 1 作者(84) Full-range adaptive cruise control based on supervised adaptive dynamic programming, NEUROCOMPUTING, 2014, 第 5 作者(85) An high-efficient online reinforcement learning algorithm for continuous-state systems, IEEE WORLD CONGRESSON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2014, 第 1 作者(86) Online reinforcement learning for continuous-state systems, FRONTIERS OF INTELLIGENT CONTROL AND INFORMATION PROCESSING, 2014, 第 1 作者(87) Online Model-Free RLSPI Algorithm for Nonlinear Discrete-Time Non-affine Systems, 2013, 第 2 作者(88) Integration of fuzzy controller with adaptive dynamic programming, IEEE WORLD CONGRESSON INTELLIGENT CONTROL AND AUTOMATION (WCICA), 2012, 第 1 作者(89) Neural and Fuzzy Dynamic Programming for Under-actuated Systems, INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN), 2012, 第 2 作者