发表论文
[1] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Liu, Bao, Lai, Zhichao, Chen, Jianxin. CNN-G: Convolutional Neural Network Combined With Graph for Image Segmentation With Theoretical Analysis. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS[J]. 2021, 13(3): 631-644, http://dx.doi.org/10.1109/TCDS.2020.2998497.[2] Li, Nannan, Pan, Yu, Chen, Yaran, Ding, Zixiang, Zhao, Dongbin, Xu, Zenglin. Heuristic rank selection with progressively searching tensor ring network. COMPLEX & INTELLIGENT SYSTEMS. 2021, http://dx.doi.org/10.1007/s40747-021-00308-x.[3] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo. Optimal Feedback Control of Pedestrian Flow in Heterogeneous Corridors. IEEE TRANSACTIONS ON AUTOMATION SCIENCE AND ENGINEERING[J]. 2021, 18(3): 1097-1108, http://dx.doi.org/10.1109/TASE.2020.2996018.[4] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Li, Dong. MGRL: Graph neural network based inference in a Markov network with reinforcement learning for visual navigation. NEUROCOMPUTING[J]. 2021, 421: 140-150, http://dx.doi.org/10.1016/j.neucom.2020.07.091.[5] Zhu, Yuanheng, He, Haibo, Zhao, Dongbin. LMI-Based Synthesis of String-Stable Controller for Cooperative Adaptive Cruise Control. IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS[J]. 2020, 21(11): 4516-4525, https://www.webofscience.com/wos/woscc/full-record/WOS:000587709700003.[6] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo. Synthesis of Cooperative Adaptive Cruise Control With Feedforward Strategies. IEEE TRANSACTIONS ON VEHICULAR TECHNOLOGY[J]. 2020, 69(4): 3615-3627, https://www.webofscience.com/wos/woscc/full-record/WOS:000530284400009.[7] Zhao Dongbin. A spatial-temporal LSTM model for human trajectory prediction. IEEE/CAA Journal of Automation Sinica. 2020, [8] Zhao, Xiaodong, Chen, Yaran, Guo, Jin, Zhao, Dongbin. A spatial-temporal attention model for human trajectory prediction. IEEE-CAA JOURNAL OF AUTOMATICA SINICA[J]. 2020, 7(4): 965-974, http://dx.doi.org/10.1109/JAS.2020.1003228.[9] Wang, Xu, Liu, Jingwei, Wu, Chaoyong, Liu, Junhong, Li, Qianqian, Chen, Yufeng, Wang, Xinrong, Chen, Xinli, Pang, Xiaohan, Chang, Binglong, Lin, Jiaying, Zhao, Shifeng, Li, Zhihong, Deng, Qingqiong, Lu, Yi, Zhao, Dongbin, Chen, Jianxin. Artificial intelligence in tongue diagnosis: Using deep convolutional neural network for recognizing unhealthy tongue with tooth-mark. COMPUTATIONAL AND STRUCTURAL BIOTECHNOLOGY JOURNAL[J]. 2020, 18: 973-980, http://dx.doi.org/10.1016/j.csbj.2020.04.002.[10] Mu, Chaoxu, Wang, Ke, Zhang, Qichao, Zhao, Dongbin. Hierarchical optimal control for input-affine nonlinear systems through the formulation of Stackelberg game. INFORMATION SCIENCES[J]. 2020, 517: 1-17, http://dx.doi.org/10.1016/j.ins.2019.12.078.[11] Li, Haoran, Zhang, Qichao, Zhao, Dongbin. Deep Reinforcement Learning-Based Automatic Exploration for Navigation in Unknown Environment. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2020, 31(6): 2064-2076, http://dx.doi.org/10.1109/TNNLS.2019.2927869.[12] Shao, Kun, Zhu, Yuanheng, Tang, Zhentao, Zhao, Dongbin, IEEE. Cooperative Multi-Agent Deep Reinforcement Learning with Counterfactual Reward. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2020, [13] Liu, Minsong, Zhu, Yuanheng, Zhao, Dongbin, IEEE. An Improved Minimax-Q Algorithm Based on Generalized Policy Iteration to Solve a Chaser-Invader Game. 2020 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2020, [14] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo. Invariant Adaptive Dynamic Programming for Discrete-Time Optimal Control. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2020, 50(11): 3959-3971, https://www.webofscience.com/wos/woscc/full-record/WOS:000578826300003.[15] Zhao Dongbin. Advances in Deep Neural Information Processing - Editorial. Neurocomputing. 2020, [16] Zhao, Dongbin, Duan, Shukai, Yan, Zheng, Alippi, Cesare. Advances in deep neural information processing. NEUROCOMPUTINGnull. 2020, 408: 80-81, http://dx.doi.org/10.1016/j.neucom.2020.01.001.[17] Zhao Dongbin. Adaptive optimal control of cooperative adaptive cruise control with uncertain heterogeneous vehicles. IEEE Control System Technology. 2019, [18] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Chen, Jianxin, Lu, H, Tang, H, Wang, Z. Graph-FCN for Image Semantic Segmentation. ADVANCES IN NEURAL NETWORKS - ISNN 2019, PT Inull. 2019, 11554: 97-105, [19] Zhu, Yuanheng, Zhao, Dongbin, Li, Xiangjun, Wang, Ding. Control-Limited Adaptive Dynamic Programming for Multi-Battery Energy Storage Systems. IEEE TRANSACTIONS ON SMART GRID[J]. 2019, 10(4): 4235-4244, https://www.webofscience.com/wos/woscc/full-record/WOS:000472577500065.[20] Gao, Yinfeng, Liu, Yuqi, Zhang, Qichao, Wang, Yu, Zhao, Dongbin, Ding, Dawei, Pang, Zhonghua, Zhang, Yueming, IEEE. Comparison of Control Methods Based on Imitation Learning for Autonomous Driving. 2019 TENTH INTERNATIONAL CONFERENCE ON INTELLIGENT CONTROL AND INFORMATION PROCESSING (ICICIP)null. 2019, 274-281, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000613247000046.[21] Shao, Kun, Zhu, Yuanheng, Zha, Dongbin. StarCraft Micromanagement With Reinforcement Learning and Curriculum Transfer Learning. IEEE TRANSACTIONS ON EMERGING TOPICS IN COMPUTATIONAL INTELLIGENCE[J]. 2019, 3(1): 73-84, http://dx.doi.org/10.1109/TETCI.2018.2823329.[22] Zhu, Yuanheng, He, Haibo, Zhao, Dongbin, Hou, Zhongsheng, IEEE. Optimal Pedestrian Evacuation in Building with Consecutive Differential Dynamic Programming. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2019, [23] Su, Hao, Chen, Yaran, Tong, Shiwen, Zhao, Dongbin, IEEE. Real-time multiple object tracking based on optical flow. 2019 9TH INTERNATIONAL CONFERENCE ON INFORMATION SCIENCE AND TECHNOLOGY (ICIST2019)null. 2019, 350-356, [24] Lv, Le, Zhao, Dongbin, Shao, Kun. Deep sparse representation-based mid-level visual elements discovery in fine-grained classification. SOFT COMPUTING[J]. 2019, 23(18): 8711-8722, http://dx.doi.org/10.1007/s00500-018-3468-3.[25] Chen, Yaran, Zhao, Dongbin, Li, Haoran, IEEE. Deep Kalman Filter with Optical Flow for Multiple Object Tracking. 2019 IEEE INTERNATIONAL CONFERENCE ON SYSTEMS, MAN AND CYBERNETICS (SMC)null. 2019, 3036-3041, [26] Li, Dong, Zhao, Dongbin, Zhang, Qichao, Chen, Yaran. Reinforcement Learning and Deep Learning Based Lateral Control for Autonomous Driving. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE[J]. 2019, 14(2): 83-98, http://ir.ia.ac.cn/handle/173211/23517.[27] Wang, Bin, Zhao, Dongbin, Cheng, Jin. Adaptive cruise control via adaptive dynamic programming with experience replay. SOFT COMPUTING[J]. 2019, 23(12): 4131-4144, http://ir.ia.ac.cn/handle/173211/24396.[28] Zhang, Qichao, Zhao, Dongbin. Data-Based Reinforcement Learning for Nonzero-Sum Games With Unknown Drift Dynamics. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2019, 49(8): 2874-2885, http://ir.ia.ac.cn/handle/173211/24567.[29] Wang, Junjie, Zhang, Qichao, Zhao, Dongbin, Chen, Yaran, IEEE. Lane Change Decision-making through Deep Reinforcement Learning with Rule-based Constraints. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2019, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000530893803042.[30] Zhu, Yuanheng, Zhao, Dongbin, Zhong, Zhiguang. Adaptive Optimal Control of Heterogeneous CACC System With Uncertain Dynamics. IEEE TRANSACTIONS ON CONTROL SYSTEMS TECHNOLOGY[J]. 2019, 27(4): 1772-1779, [31] Zhang, Qichao, Luo, Rui, Zhao, Dongbin, Luo, Chaomin, Qian, Dianwei, IEEE. Model-Free Reinforcement Learning based Lateral Control for Lane Keeping. 2019 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2019, [32] Tang Zhentao, Shao Kun, Zhu Yuanheng, Li Dong, Zhao Dongbin, Huang Tingwen, Sundaram S. A Review of Computational Intelligence for StarCraft AI. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI)null. 2018, 1167-1173, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459238800159.[33] Zhao, Xiaodong, Zhang, Qichao, Zhao, Dongbin, Pang, Zhonghua, Sun, MX, Zhang, HG. Overview of Image Segmentation and Its Application on Free Space Detection. PROCEEDINGS OF 2018 IEEE 7TH DATA DRIVEN CONTROL AND LEARNING SYSTEMS CONFERENCE (DDCLS)null. 2018, 1164-1169, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000450645900210.[34] Chen, Yaran, Zhao, Dongbin, Lv, Le, Zhang, Qichao. Multi-task learning for dangerous object detection in autonomous driving. INFORMATION SCIENCES[J]. 2018, 432(*): 559-571, http://dx.doi.org/10.1016/j.ins.2017.08.035.[35] Zhang, Zhen, Wang, Dongqing, Zhao, Dongbin, Han, Qiaoni, Song, Tingting. A Gradient-Based Reinforcement Learning Algorithm for Multiple Cooperative Agents. IEEE ACCESS[J]. 2018, 6: 70223-70235, http://ir.ia.ac.cn/handle/173211/25665.[36] Zhao Dongbin, Li Haoran, Li Dong, Guo Ping, Chen Yaran. A Temporal-based Deep Learning Method for Multiple Objects Detection in Autonomous Driving. 2018, http://ir.ia.ac.cn/handle/173211/23521.[37] Zhu, Yuanheng, Zhao, Dongbin. Comprehensive comparison of online ADP algorithms for continuous-time optimal control. ARTIFICIAL INTELLIGENCE REVIEW[J]. 2018, 49(4): 531-547, https://www.webofscience.com/wos/woscc/full-record/WOS:000426912500004.[38] Zhao, Dongbin, Liu, Derong, Lewis, F L, Principe, Jose C, Squartini, Stefano. Special Issue on Deep Reinforcement Learning and Adaptive Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMSnull. 2018, 29(6): 2038-2041, https://www.webofscience.com/wos/woscc/full-record/WOS:000432398300001.[39] Li Dong, Zhao Dongbin, Zhang Qichao, Zhu Yuanheng, Sundaram S. An Autonomous Driving Experience Platform with Learning-Based Functions. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI)null. 2018, 1174-1179, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459238800160.[40] Zhu, Yuanheng, Zhao, Dongbin, Yang, Xiong, Zhang, Qichao. Policy Iteration for H infinity Optimal Control of Polynomial Nonlinear Systems via Sum of Squares Programming. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2018, 48(2): 500-509, https://www.webofscience.com/wos/woscc/full-record/WOS:000422925700005.[41] Yuanheng Zhu, Nannan Li, Kun Shao, Dongbin Zhao. Learning battles in ViZDoom via deep reinforcement learning. 2018, http://ir.ia.ac.cn/handle/173211/23364.[42] Zhang, Qichao, Zhao, Dongbin, Lewis, Frank L, IEEE. Model-Free Reinforcement Learning for Fully Cooperative Multi-Agent Graphical Games. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [43] Zhang, Qichao, Zhao, Dongbin, Wang, Ding. Event-Based Robust Control for Uncertain Nonlinear Systems Using Adaptive Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2018, 29(1): 37-50, https://www.webofscience.com/wos/woscc/full-record/WOS:000419558900004.[44] Chen, Yaran, Zhao, Dongbin, Li, Haoran, Li, Dong, Guo, Ping, IEEE. A temporal-based deep learning method for multiple objects detection in autonomous driving. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [45] Shao, Kun, Zhao, Dongbin, Zhu, Yuanheng, Zhang, Qichao, IEEE. Visual Navigation with Actor-Critic Deep Reinforcement Learning. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [46] Bu, Li, Alippi, Cesare, Zhao, Dongbin. A pdf-Free Change Detection Test Based on Density Difference Estimation. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2018, 29(2): 324-334, https://www.webofscience.com/wos/woscc/full-record/WOS:000422952400007.[47] Wu, I C, Lee, C S, Tian, Y, Mueller, M. Guest Editorial Special Issue on Deep/Reinforcement Learning and Games. IEEE TRANSACTIONS ON GAMESnull. 2018, 10(4): 333-335, https://www.webofscience.com/wos/woscc/full-record/WOS:000453577300001.[48] Zhao Dongbin. Comprehesive comparison of online ADP algorithms for continuous-time optimal control. Artificial Intelligence Review. 2018, [49] Lu, Yi, Chen, Yaran, Zhao, Dongbin, Li, Haoran, IEEE. Hybrid Deep Learning Based Moving Object Detection via Motion prediction. 2018 CHINESE AUTOMATION CONGRESS (CAC)null. 2018, 1442-1447, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459239501089.[50] Li, Dong, Zhao, Dongbin, Chen, Yaran, Zhang, Qichao, IEEE. DeepSign: Deep Learning based Traffic Sign Recognition. 2018 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2018, [51] Zhu Yuanheng, Zhang Qichao, Zhao Dongbin, Li Dong. An Autonomous Driving Experience Platform with Learning-Based Functions. 2018 IEEE SYMPOSIUM SERIES ON COMPUTATIONAL INTELLIGENCE (IEEE SSCI)null. 2018, 1174-1179, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000459238800160.[52] Yuanheng Zhu, Qichao Zhang, Dongbin Zhao, Kun Shao. Visual navigation with Actor-Critic deep reinforcement learning. 2018, http://ir.ia.ac.cn/handle/173211/23365.[53] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo, Ji, Junhong. Event-Triggered Optimal Control for Partially Unknown Constrained-Input Systems via Adaptive Dynamic Programming. IEEE TRANSACTIONS ON INDUSTRIAL ELECTRONICS[J]. 2017, 64(5): 4101-4109, https://www.webofscience.com/wos/woscc/full-record/WOS:000399674000064.[54] Deng QingQiong, Zhao Dongbin, Lv Le. Image Clustering based on Deep Sparse Representations. 2016 IEEE Symposium Series on Computational Intelligence: SSCI 2016, Athens, Greece, 6-9 December 2016, pages 2037-2712, v.4null. 2017, 2108-2113, http://ir.ia.ac.cn/handle/173211/14471.[55] Bu Li, Zhao Dongbin, Alippi Cesare. An Incremental Change Detection Test Based on Density Difference Estimation. IEEE Transactions on Systems, Man, and Cybernetics: Systems[J]. 2017, [56] Li, Chengdong, Ding, Zixiang, Zhao, Dongbin, Yi, Jianqiang, Zhang, Guiqing. Building Energy Consumption Prediction: An Extreme Deep Learning Approach. ENERGIES[J]. 2017, 10(10): https://doaj.org/article/97e10cd1f86645f384b67cc9b9f33881.[57] Zhang, Qichao, Zhao, Dongbin, Zhu, Yuanheng. Data-driven adaptive dynamic programming for continuous-time fully cooperative games with partially constrained inputs. NEUROCOMPUTING[J]. 2017, 238(*): 377-386, http://dx.doi.org/10.1016/j.neucom.2017.01.076.[58] Zhao, Dongbin, Chen, Yaran, Lv, Le. Deep Reinforcement Learning With Visual Attention for Vehicle Classification. IEEE TRANSACTIONS ON COGNITIVE AND DEVELOPMENTAL SYSTEMS[J]. 2017, 9(4): 356-367, http://dx.doi.org/10.1109/TCDS.2016.2614675.[59] Li Dong, Zhao Dongbin, Zhang Qichao, Luo Chaomin, IEEE. Policy Gradient Methods with Gaussian Process Modelling Acceleration. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2017, 1774-1779, [60] Qichao Zhang, Haoran Li, Dongbin Zhao. Comparison of methods to efficient graph SLAM under general optimization framework. YAC 2017null. 2017, *-, http://ir.ia.ac.cn/handle/173211/19422.[61] Zhao Dongbin. Editorial: new developments in neural network structures for signal processing, autonomous decision, and adaptive controll. IEEE Transactions on Neural Networks and Learning Systems. 2017, [62] 唐振韬, 邵坤, 赵冬斌, 朱圆恒. 深度强化学习进展:从AlphaGo到AlphaGo Zero. 控制理论与应用[J]. 2017, 34(12): 1529-1546, http://lib.cqvip.com/Qikan/Article/Detail?id=7000480876.[63] Tang Zhentao, Lv Le, Shao Kun, Zhao Dongbin. ADP with MCTS algorithm for Gomoku. 2017, http://ir.ia.ac.cn/handle/173211/14475.[64] Zhao Dongbin, Wei Qinglai, Alippi Cesare, Bu Li. A Kolmogorov-Smirnov Test to Detect Changes in Stationarity in Big Data. IFAC PAPERSONLINEnull. 2017, 50(1): 14260-14265, http://dx.doi.org/10.1016/j.ifacol.2017.08.1821.[65] 朱圆恒, 赵冬斌, 邵坤. Cooperative Reinforcement Learning for Multiple Units Combat in StarCraft. 2017, http://ir.ia.ac.cn/handle/173211/15399.[66] Zhu, Yuanheng, Zhao, Dongbin, Li, Xiangjun. Iterative Adaptive Dynamic Programming for Solving Unknown Nonlinear Zero-Sum Game Based on Online Data. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2017, 28(3): 714-725, https://www.webofscience.com/wos/woscc/full-record/WOS:000395980500020.[67] Zhang Qichao, Zhao Dongbin, Zhu Yuanheng. Event-Triggered $H_\\infty $ Control for Continuous-Time Nonlinear System via Concurrent Learning. IEEE Transactions on Systems, Man, and Cybernetics: Systems[J]. 2017, [68] Zhao, Dongbin, Xia, Zhongpu, Zhang, Qichao. Model-free Optimal Control based Intelligent Cruise Control with Hardware-in-the-loop Demonstration. IEEE COMPUTATIONAL INTELLIGENCE MAGAZINE[J]. 2017, 12(2): 56-69, https://www.webofscience.com/wos/woscc/full-record/WOS:000399714900005.[69] Lv, Le, Zhao, Dongbin, Deng, Qingqiong. A Semi-Supervised Predictive Sparse Decomposition Based on Task-Driven Dictionary Learning. COGNITIVE COMPUTATION[J]. 2017, 9(1): 115-124, https://www.webofscience.com/wos/woscc/full-record/WOS:000394418100008.[70] Zhao Dongbin. Event-triggered optimal control for nonlinear constrained-input systems with partially unknown dynamics via adaptive dynamic programming. IEEE Transactions on Industrial Electronics. 2017, [71] Shengli Xie, Derong Liu, Dongbin Zhao, ElSayed M ElAlfy, Yuanqing Li. Neural Information Processing. Neural Information Processing, Lecture Notes in Computer Sciencenull. 2017, 10636, 10637, 10638, 10639,-, http://ir.ia.ac.cn/handle/173211/19892.[72] Zhao Dongbin, Zhang Qichao. Data-driven adaptive dynamic programming for two-player nonzero-sum game. 2017, http://ir.ia.ac.cn/handle/173211/14342.[73] Chen, Yaran, Zhao, Dongbin, Cong, F, Leung, A, Wei, Q. Multi-task Learning with Cartesian Product-Based Multi-objective Combination for Dangerous Object Detection. ADVANCES IN NEURAL NETWORKS, PT Inull. 2017, 10261: 28-35, [74] Bu, Li, Zhao, Dongbin, Alippi, Cesare. An Incremental Change Detection Test Based on Density Difference Estimation. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2017, 47(10): 2714-2726, https://www.webofscience.com/wos/woscc/full-record/WOS:000411098200009.[75] Zhang, Qichao, Zhao, Dongbin, Zhu, Yuanheng. Event-Triggered H-infinity Control for Continuous-Time Nonlinear System via Concurrent Learning. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2017, 47(7): 1071-1081, https://www.webofscience.com/wos/woscc/full-record/WOS:000404354600004.[76] Zhang, Zhen, Zhao, Dongbin, Gao, Junwei, Wang, Dongqing, Dai, Yujie. FMRQ-A Multiagent Reinforcement Learning Algorithm for Fully Cooperative Tasks. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2017, 47(6): 1367-1379, https://www.webofscience.com/wos/woscc/full-record/WOS:000401950400002.[77] Li Dong, Zhao Dongbin, Zhang Qichao, Luo Chaomin, IEEE. Policy Gradient Methods with Gaussian Process Modelling Acceleration. 2017 INTERNATIONAL JOINT CONFERENCE ON NEURAL NETWORKS (IJCNN)null. 2017, 1774-1779, [78] Wang, Ding, Liu, Derong, Zhang, Qichao, Zhao, Dongbin. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2016, 46(11): 1544-1555, https://www.webofscience.com/wos/woscc/full-record/WOS:000386225800006.[79] Tang, Yufei, He, Haibo, Ni, Zhen, Zhong, Xiangnan, Zhao, Dongbin, Xu, Xin. Fuzzy-Based Goal Representation Adaptive Dynamic Programming. IEEE TRANSACTIONS ON FUZZY SYSTEMS[J]. 2016, 24(5): 1159-1175, https://www.webofscience.com/wos/woscc/full-record/WOS:000386076600013.[80] Zhu, Yuanheng, Zhao, Dongbin, Li, Xiangjun. Using reinforcement learning techniques to solve continuous-time non-linear optimal tracking problem without system dynamics. IET CONTROL THEORY AND APPLICATIONS[J]. 2016, 10(12): 1339-1347, [81] Zhu Yuanheng, Chen Xi, Zhao Dongbin, Zhang Qichao. Model-free reinforcement learning for nonlinear zero-sum games with simultaneous explorations. 2016, http://ir.ia.ac.cn/handle/173211/14340.[82] Li Dong, Xia Zhongpu, Zhao Dongbin. A Perturbed Gaussian Process Regression with Chunk Sparsification for Tracking Non-stationary Systems. PROCEEDINGS OF THE 28TH CHINESE CONTROL AND DECISION CONFERENCE (2016 CCDC)null. 2016, 6639-6644, [83] 周彤, 李栋, 朱圆恒, 王成红, 刘德荣, 王海涛, 陈亚冉, 邵坤, 赵冬斌. 深度强化学习综述:兼论计算机围棋的发展. 控制理论与应用[J]. 2016, 33(6): 701-717, [84] Zhao Dongbin, Alippi Cesare, Bu Li. Ensemble LSDD-based change detection tests. 2016, http://ir.ia.ac.cn/handle/173211/14332.[85] 孙长银, 王成红, 胡跃明, 赵东斌, 周彤, 苏剑波. “机器智能、系统优化与最优决策”专刊前言. 控制理论与应用. 2016, 33(12): 1553-1554, http://lib.cqvip.com/Qikan/Article/Detail?id=7000119650.[86] Zhao Dongbin. Model-free iterative adaptive dynamic programming solving unknown nonlinear zero-sum game based on online measurement. IEEE Transactions on Neural Networks and Learning Systems. 2016, [87] Zhao, Dongbin, Zhang, Qichao, Wang, Ding, Zhu, Yuanheng. Experience Replay for Optimal Control of Nonzero-Sum Game Systems With Unknown Dynamics. IEEE TRANSACTIONS ON CYBERNETICS[J]. 2016, 46(3): 854-865, https://www.webofscience.com/wos/woscc/full-record/WOS:000370963500023.[88] Dongbin Zhao, Le Lv, Qingqiong Deng. Image clustering based on the deep sparse representations. Computational Intelligence (SSCI), 2016 IEEE Symposium Series onnull. 2016, 1-6, http://ir.ia.ac.cn/handle/173211/19423.[89] ZhuYuanheng, ShaoKun, WangHaitao, 赵冬斌. Deep reinforcement learning with Experience Replay based on SARSA. 2016, http://ir.ia.ac.cn/handle/173211/19877.[90] Chen, Yaran, Zhao, Dongbin, Lv, Le, Li, Chengdong, IEEE. A Visual Attention Based Convolutional Neural Network for Image Classification. PROCEEDINGS OF THE 2016 12TH WORLD CONGRESS ON INTELLIGENT CONTROL AND AUTOMATION (WCICA)null. 2016, 764-769, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000388373802067.[91] Xia, Zhongpu, Zhao, Dongbin. Online reinforcement learning control by Bayesian inference. IET CONTROL THEORY AND APPLICATIONS[J]. 2016, 10(12): 1331-1338, https://www.webofscience.com/wos/woscc/full-record/WOS:000381410000003.[92] 赵冬斌, 朱圆恒. 概率近似正确的强化学习算法解决连续状态空间控制问题. 控制理论与应用. 2016, 33(12): 1603-1613, http://lib.cqvip.com/Qikan/Article/Detail?id=7000119656.[93] Wang, Ding, Liu, Derong, Zhang, Qichao, Zhao, Dongbin. Data-Based Adaptive Critic Designs for Nonlinear Robust Optimal Control With Uncertain Dynamics. IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS[J]. 2016, 46(11): 1544-1555, https://www.webofscience.com/wos/woscc/full-record/WOS:000386225800006.[94] Zhao, Dongbin, Zhu, Yuanheng. MEC-A Near-Optimal Online Reinforcement Learning Algorithm for Continuous Deterministic Systems. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2015, 26(2): 346-356, http://www.irgrid.ac.cn/handle/1471x/980893.[95] Ni, Zhen, He, Haibo, Zhao, Dongbin, Xu, Xin, Prokhorov, Danil V. GrDHP: A General Utility Function Representation for Dual Heuristic Dynamic Programming. IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS[J]. 2015, 26(3): 614-627, https://www.webofscience.com/wos/woscc/full-record/WOS:000351834400016.[96] Zhao Dongbin, ZhangQichao, Li Chengdong, Wei Qinglai. Consensus of Heterogeneous Multi-agent Systems With Switching Topologies Using Input-output Feedback Linearization. 2015 34th Chinese control conference: CCC 2015, Hangzhou, China, 28-30 July 2015, pages 6414-7296, v.8null. 2015, 6872-6877, http://ir.ia.ac.cn/handle/173211/14338.[97] Zhang, Qichao, Zhao, Dongbin, Wei, Qinglai, Li, Chengdong, Zhao, Q, Liu, S. Consensus of Heterogeneous Multi-agent Systems With Switching Topologies Using Input-output Feedback Linearization. 2015 34TH CHINESE CONTROL CONFERENCE (CCC)null. 2015, 6872-6877, [98] Zhu, Yuanheng, Zhao, Dongbin, He, Haibo, Ji, Junhong. Convergence Proof of Approximate Policy Iteration for Undiscounted Optimal Control of Discrete-Time Systems. COGNITIVE COMPUTATION[J]. 2015, 7(6): 763-771, http://ir.ia.ac.cn/handle/173211/10525.[99] Squartini, Stefano, Liu, Derong, Piazza, Francesco, Zhao, Dongbin, He, Haibo. Computational Energy Management in Smart Grids. NEUROCOMPUTINGnull. 2015, 170: 267-269, http://dx.doi.org/10.1016/j.neucom.2015.05.110.[100] 王革, 刘广天, 汪海洪, 巩可欣, 赵冬斌. 能源存储:一种新的方法. 能源存储:一种新的方法null. 2015, http://ir.ia.ac.cn/handle/173211/19889.