发表论文
[1] Xinda Chen, Rongliang Fu, Junying Huang, Huawei Cao, Zhimin Zhang, Xiaochun Ye, Tsung-Yi Ho, Dongrui Fan. JRouter: A Multi-Terminal Hierarchical Length-Matching Router under Planar Manhattan Routing Model for RSFQ Circuits. GLSVLSI. 2023, 第 8 作者null(null): https://dl.acm.org/doi/abs/10.1145/3583781.3590267.[2] 范志华, 李文明, 王珎, 刘天雨, 吴海彬, 刘艳欢, 吴萌, 叶笑春, 范东睿, 安学军. Accelerating Convolutional Neural Networks by Exploiting the Sparsity of Output Activation. TPDS[J]. 2023, 第 9 作者34(12): 3253-3265, [3] 范志华, 李文明, 汤胜中, 安学军, 叶笑春, 范东睿. Improving Utilization of Dataflow Architectures Through Software and Hardware Co-Design. Euro-Par. 2023, 第 6 作者[4] 范志华, 吴欣欣, 李文明, 曹华伟, 安学军, 叶笑春, 范东睿. 面向低精度神经网络的数据流体系结构优化. 计算机研究与发展[J]. 2023, 第 7 作者60(1): 43-58, http://lib.cqvip.com/Qikan/Article/Detail?id=7108741862.[5] 范志华, 李文明, 王珎, 刘天雨, 吴海彬, 刘艳欢, 吴萌, 吴欣欣, 叶笑春, 范东睿, 孙凝晖, 安学军. Accelerating Convolutional Neural Networks by Exploiting the Sparsity of Output Activation. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2023, 第 10 作者[6] Junying Huang, Rongliang Fu, Xiaochun Ye, Dongrui Fan. A survey on superconducting computing technology: circuits, architectures and design tools. CCF Transactions on High Performance Computing[J]. 2022, 第 4 作者[7] Sun, Gongjian, Yan, Mingyu, Wang, Duo, Li, Han, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Multi-node Acceleration for Large-scale GCNs. IEEE TRANSACTIONS ON COMPUTERS[J]. 2022, 第 7 作者https://ieeexplore.ieee.org/document/9893364.[8] Feng, YuJing, Li, DeJian, Tan, Xu, Ye, XiaoChun, Fan, DongRui, Li, WenMing, Wang, Da, Zhang, Hao, Tang, ZhiMin. Accelerating Data Transfer in Dataflow Architectures Through a Look-Ahead Acknowledgment Mechanism. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2022, 第 5 作者37(4): 942-959, http://sciencechina.cn/gw.jsp?action=detail.jsp&internal_id=7283413&detailType=1.[9] Zhihua Fan, Wenming Li, Tianyu Liu, Shengzhong Tang, Zhen Wang, Xuejun An, Xiaochun Ye, Dongrui Fan. A Loop Optimization Method for Dataflow. High Performance Computing and Communications. 2022, 第 8 作者[10] Yan, Mingyu, Zou, Mo, Yang, Xiaocheng, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Characterizing and Understanding HGNNs on GPUs. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 第 6 作者21(2): 69-72, http://dx.doi.org/10.1109/LCA.2022.3198281.[11] Liu, Xin, Yan, Mingyu, Deng, Lei, Li, Guoqi, Ye, Xiaochun, Fan, Dongrui, Pan, Shirui, Xie, Yuan. Survey on Graph Neural Network Acceleration: An Algorithmic Perspective. International Joint Conference on Artificial Intelligence. 2022, 第 6 作者http://arxiv.org/abs/2202.04822.[12] Lin, Haiyang, Yan, Mingyu, Yang, Xiaocheng, Zou, Mo, Li, Wenming, Ye, Xiaochun, Fan, Dongrui. Characterizing and Understanding Distributed GNN Training on GPUs. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 第 7 作者21(1): 21-24, http://dx.doi.org/10.1109/LCA.2022.3168067.[13] Lin, Haiyang, Yan, Mingyu, Wang, Duo, Zou, Mo, Tu, Fengbin, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Alleviating Datapath Conflicts and Design Centralization in Graph Analytics Acceleration. DESIGN AUTOMATION CONFERENCE. 2022, 第 7 作者https://dl.acm.org/doi/10.1145/3489517.3530524.[14] Wang, Yinshen, Li, Wenming, Liu, Tianyu, Zhou, Liangjiang, Wang, Bingnan, Fan, Zhihua, Ye, Xiaochun, Fan, Dongrui, Ding, Chibiao. Characterization and Implementation of Radar System Applications on a Reconfigurable Dataflow Architecture. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 第 8 作者21(2): 121-124, [15] Xinxin Wu, Zhihua Fan, Tianyu Liu, Wenming Li, Xiaochun Ye, Dongrui Fan. LRP: Predictive output activation based on SVD approach for CNNs acceleration. Design, Automation and Test in Europe. 2022, 第 6 作者[16] Liu, Xin, Yan, Mingyu, Deng, Lei, Li, Guoqi, Ye, Xiaochun, Fan, Dongrui. Sampling Methods for Efficient Training of Graph Convolutional Networks: A Survey. IEEE-CAA JOURNAL OF AUTOMATICA SINICA. 2022, 第 6 作者9(2): 205-234, http://dx.doi.org/10.1109/JAS.2021.1004311.[17] Zou, Mo, Zhang, Mingzhe, Wang, Rujia, Sun, XianHe, Ye, Xiaochun, Fan, Dongrui, Tang, Zhimin. Accelerating Graph Processing With Lightweight Learning-Based Data Reordering. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 第 6 作者21(1): 5-8, http://dx.doi.org/10.1109/LCA.2022.3151087.[18] Rongliang Fu, Junying Huang, Haibin Wu, Xiaochun Ye, Dongrui Fan, Tsung-Yi Ho. JBNN: A Hardware Design for Binarized Neural Networks Using Single-Flux-Quantum Circuits. IEEE TRANSACTIONS ON COMPUTERS[J]. 2022, 第 5 作者771(12): 3203-3214, [19] 严明玉, 李涵, 邓磊, 胡杏, 叶笑春, 张志敏, 范东睿, 谢源. 图计算加速架构综述. 计算机研究与发展[J]. 2021, 第 7 作者58(4): 862-887, http://lib.cqvip.com/Qikan/Article/Detail?id=7104271412.[20] Li, Yi, Wu, Meng, Ye, Xiaochun, Li, Wenming, Xue, Rui, Wang, Da, Zhang, Hao, Fan, Dongrui. An efficient scheduling algorithm for dataflow architecture using loop-pipelining. INFORMATION SCIENCES[J]. 2021, 第 8 作者547: 1136-1153, http://dx.doi.org/10.1016/j.ins.2020.09.029.[21] 范东睿. 数据流计算研究进展与概述. 数据与计算发展前沿. 2021, 第 1 作者 通讯作者 [22] 李涵, 严明玉, 吕征阳, 李文明, 叶笑春, 范东睿, 唐志敏. 图神经网络加速结构综述. 计算机研究与发展[J]. 2021, 第 6 作者58(6): 1204-1229, http://lib.cqvip.com/Qikan/Article/Detail?id=7104820799.[23] Li, Han, Yan, Mingyu, Yang, Xiaocheng, Deng, Lei, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Hardware Acceleration for GCNs via Bidirectional Fusion. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2021, 第 7 作者20(1): [24] Chenglong Zhang, Huawei Cao, Xiaochun Ye, Guobo Wang, Qinfen Hao, Dongrui Fan. Highly Efficient Breadth-First Search on CPU-based Single-node System. INTERNATIONAL JOURNAL OF HYDROGEN ENERGY. 2021, 第 6 作者2066-2071, [25] Dongrui Fan. Scalable and Efficient Graph Traversal on High-Throughput Cluster. CCF Transaction on High Performance Computing (CCF THPC). 2021, 第 1 作者[26] 吴欣欣, 欧焱, 李文明, 王达, 张浩, 范东睿. 基于粗粒度数据流架构的稀疏卷积神经网络加速. 计算机研究与发展[J]. 2021, 第 6 作者58(7): 1504-1517, http://lib.cqvip.com/Qikan/Article/Detail?id=7105055136.[27] Cao, Dingyuan, Zhang, Mingzhe, Lu, Hang, Ye, Xiaochun, Fan, Dongrui, Che, Yuezhi, Wang, Rujia. Streamline Ring ORAM Accesses through Spatial and Temporal Optimization. 2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021). 2021, 第 5 作者14-25, [28] 李灵枝, 胡九川, 叶笑春, 范东睿, 严龙. 渗透缓存命中率诱导的缓存区域动态分配机制研究. 软件导刊[J]. 2020, 第 4 作者19(4): 1-8, http://lib.cqvip.com/Qikan/Article/Detail?id=7101773847.[29] Rongliang Fu, Zhimin Zhang, Guangming Tang, Junying Huang, Xiaochun Ye, Dongrui Fan, Ninghui Sun. Design Automation Methodology from RTL to Gate-level Netlist and Schematic for RSFQ Logic Circuits. Great Lakes Symposium on VLSI. 2020, 第 6 作者[30] Qu, PeiYao, Tang, GuangMing, Yang, JiaHong, Ye, XiaoChun, Fan, DongRui, Zhang, ZhiMin, Sun, NingHui. Design of an 8-bit Bit-Parallel RSFQ Microprocessor. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2020, 第 5 作者30(7): [31] Yang, JiaHong, Tang, GuangMing, Zheng, XiangYu, Ye, XiaoChun, Fan, DongRui, Zhang, ZhiMin, Sun, NingHui. Distributed Self-Clock: A Suitable Architecture for SFQ Circuits. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2020, 第 5 作者30(7): http://dx.doi.org/10.1109/TASC.2020.3007175.[32] Dongrui Fan. Pixel-Semantic Revising of Position: One-Stage Object Detector with Shared Encoder-Decoder. The 27th International Conference on Neural Information Processing (ICONIP2020). 2020, 第 1 作者[33] Wu, Xinxin, Li, Yi, Ou, Yan, Li, Wenming, Sun, Shibo, Xu, Wenxing, Fan, Dongrui, Qiu, M. Accelerating Sparse Convolutional Neural Networks Based on Dataflow Architecture. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT II. 2020, 第 7 作者12453: 14-31, [34] Dongrui Fan. Scalable and efcient graph traversal on high‑throughput cluster. CCF Transactions on High Performance Computing. 2020, 第 1 作者[35] Tang, GuangMing, Qu, PeiYao, Zheng, XiangYu, Yang, JiaHong, Ye, XiaoChun, Fan, DongRui, Sun, NingHui. Bit-Slice Butterfly Processing Units for 64-Point RSFQ FFT Processors. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2020, 第 6 作者30(1): https://www.webofscience.com/wos/woscc/full-record/WOS:000482590700001.[36] Yan Mingyu, Deng Lei, Hu Xing, Liang Ling, Feng Yujing, Ye Xiaochun, Zhang Zhimin, Fan Dongrui, Xie Yuan. HyGCN: A GCN Accelerator with Hybrid Architecture. HPCA. 2020, 第 8 作者http://arxiv.org/abs/2001.02514.[37] Li, Qian, Guo, Nan, Ye, Xiaochun, Fan, Dongrui, Tang, Zhimin. Video Face Recognition System: RetinaFace-mnet-faster and Secondary Search. 2020, 第 4 作者http://arxiv.org/abs/2009.13167.[38] 范灵俊, 杨菲, 郑卫城, 洪学海, 范东睿. 构建城市“互联网+”新型基础设施发展战略研究. 中国工程科学[J]. 2020, 第 5 作者22(4): 106-113, http://lib.cqvip.com/Qikan/Article/Detail?id=7102599416.[39] Yan, Mingyu, Deng, Lei, Hu, Xing, Liang, Ling, Feng, Yujing, Ye, Xiaochun, Zhang, Zhimin, Fan, Dongrui, Xie, Yuan. HyGCN: A GCN Accelerator with Hybrid Architecture. 2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020). 2020, 第 8 作者15-29, [40] Yan, Mingyu, Chen, Zhaodong, Deng, Lei, Ye, Xiaochun, Zhang, Zhimin, Fan, Dongrui, Xie, Yuan. Characterizing and Understanding GCNs on GPU. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2020, 第 6 作者19(1): 22-25, http://dx.doi.org/10.1109/LCA.2020.2970395.[41] Ou, Yan, Shen, Chongfei, Feng, Yujing, Wu, Xinxin, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Qiu, M. CTA: A Critical Task Aware Scheduling Mechanism for Dataflow Architecture. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT I. 2020, 第 11 作者12452: 61-77, [42] Ye, Xiaochun, Tan, Xu, Wu, Meng, Feng, Yujing, Wang, Da, Zhang, Hao, Pei, Songwen, Fan, Dongrui. An efficient dataflow accelerator for scientific applications. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE[J]. 2020, 第 8 作者 通讯作者 112: 580-588, http://dx.doi.org/10.1016/j.future.2020.03.023.[43] 叶笑春, 李文明, 张洋, 张浩, 王达, 范东睿. 高通量众核处理器设计. 数据与计算发展前沿[J]. 2020, 第 6 作者2(1): 70-84, http://www.jfdc.cnic.cn/CN/10.11871/jfdc.issn.2096-742X.2020.01.006.[44] Hao, Qinfen, Hao, Kai, Xue, Haiyun, Han, Meng, Qi, Nan, Zhang, Kunming, Niu, Xingmao, Xiao, Limin, Fan, Dongrui, IEEE. A Chip-level Optical Interconnect for CPU. 2020 IEEE PHOTONICS CONFERENCE (IPC). 2020, 第 9 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000612237500111.[45] 张承龙, 曹华伟, 王国波, 郝沁汾, 张洋, 叶笑春, 范东睿. 面向高通量计算机的图算法优化技术. 计算机研究与发展[J]. 2020, 第 7 作者57(6): 1152-1163, http://lib.cqvip.com/Qikan/Article/Detail?id=7101851458.[46] Rongyu Dong, Huawei Cao, Xiaochun Ye, Yuan Zhang, Qinfen Hao, Dongrui Fan. Highly Efficient and GPU-Friendly Implementation of BFS on Single-node System. 18th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA 2020). 2020, 第 6 作者null(null): https://ieeexplore.ieee.org/document/9443861.[47] Junying Huang, Jing Ye, Xiaochun Ye, Da Wang, Dongrui Fan, Huawei Li, Xiaowei Li, Zhimin Zhang. Instruction Vulnerability Test and Code Optimization against DVFS attack. 2019 IEEE INTERNATIONAL TEST CONFERENCE IN ASIA (ITC-ASIA 2019)[J]. 2019, 第 5 作者 通讯作者 49-54, [48] 范东睿, 叶笑春, 包云岗, 孙凝晖. 中国高通量计算机的自主研发之路. 中国科学院院刊[J]. 2019, 第 1 作者648-656, http://lib.cqvip.com/Qikan/Article/Detail?id=75898988504849574854484856.[49] Zokaee, Farzaneh, Zhang, Mingzhe, Ye, Xiaochun, Fan, Dongrui, Jiang, Lei, ACM. Magma: A Monolithic 3D Vertical Heterogeneous ReRAM-based Main Memory Architecture. PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC). 2019, 第 4 作者http://dx.doi.org/10.1145/3316781.3317858.[50] 张志敏. Balancing Memory Accesses for Energy-Efficient Graph Analytics Accelerators. ISLPED. 2019, [51] Li, Wenming, Ye, Xiaochun, Wang, Da, Zhang, Hao, Tang, Zhimin, Fan, Dongrui, Sun, Ninghui. PIM-WEAVER: A High Energy-efficient, General-purpose Acceleration Architecture for String Operations in Big Data Processing. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS[J]. 2019, 第 6 作者21: 129-142, http://dx.doi.org/10.1016/j.suscom.2019.01.006.[52] 余世干, 唐志敏, 叶笑春, 范东睿. 基于推测机制异构多核处理器容错方法与仿真. 系统仿真学报[J]. 2019, 第 4 作者31(12): 2685-2695, http://lib.cqvip.com/Qikan/Article/Detail?id=7100565631.[53] Wenming Li, Xiaochun Ye, Da Wang, Hao Zhang, Zhimin Tang, Dongrui Fan, Ninghui Sun. PIM-WEAVER: A High Energy-efficient, General-purpose Acceleration Architecture for String Operations in Big Data Processing. SUSTAINABLE COMPUTING: INFORMATICS AND SYSTEMS. 2019, 第 6 作者21: 129-142, http://dx.doi.org/10.1016/j.suscom.2019.01.006.[54] Dongrui Fan. Applying CNN on a Scientific Application Accelerator Based on Dataflow Architecture. CCF Transaction on High Performance Computing (CCF THPC). 2019, 第 1 作者 通讯作者 [55] Gao Yan, Liu Boxiao, Guo Nan, Ye Xiaochun, Wan Fang, You Haihang, Fan Dongrui. Utilizing the Instability in Weakly Supervised Object Detection. 2019, 第 7 作者http://arxiv.org/abs/1906.06023.[56] Yan Mingyu, Hu Xing, Li Shuangchen, Basak Abanti, Li Han, Ma Xin, Akgun Itir, Peng Yujing, Gu Peng, Deng Lei, Ye Xiaochun, Zhang Zhimin, Fan Dongrui, Xie Yuan. Alleviating Irregularity in Graph Analytics Acceleration: a Hardware/Software Co-Design Approach. MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE. 2019, 第 13 作者615-628, http://dx.doi.org/10.1145/3352460.3358318.[57] Gao, Yan, Liu, Boxiao, Guo, Nan, Ye, Xiaochun, Wan, Fang, You, Haihang, Fan, Dongrui, IEEE. C-MIDN: Coupled Multiple Instance Detection Network With Segmentation Guidance for Weakly Supervised Object Detection. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019). 2019, 第 11 作者9833-9842, [58] 向陶然, 叶笑春, 李文明, 冯煜晶, 谭旭, 张浩, 范东睿. 基于细粒度数据流架构的稀疏神经网络全连接层加速. 计算机研究与发展[J]. 2019, 第 7 作者56(6): 1192-1204, http://lib.cqvip.com/Qikan/Article/Detail?id=7002192926.[59] Dongrui Fan. iATPG: Instruction-level Automatic Test Program Generation for Vulnerability under DVFS Attack. 2019 IEEE 25th International Symposium on On-Line Testing and Robust System Design (IOLTS). 2019, 第 1 作者 通讯作者 [60] 李易, 常成娟, 卢圣健, 江道忠, 范东睿, 叶笑春. 面向数据流结构的指令映射优化方法. 计算机工程与科学[J]. 2019, 第 5 作者41(1): 9-13, http://lib.cqvip.com/Qikan/Article/Detail?id=7001148810.[61] Dongrui Fan. A Sharing Path Awareness Scheduling Algorithm for Dataflow Architecture. HPCC. 2019, 第 1 作者[62] 范东睿. 面向数据流结构的指令内存访存冲突优化研究. 计算机研究与发展. 2019, 第 1 作者[63] Dongrui Fan. C-MAP: Improving the Effectiveness of Mapping Method for CGRA by Reducing NoC Congestion. HPCC 2019. 2019, 第 1 作者[64] 欧焱, 冯煜晶, 李文明, 叶笑春, 王达, 范东睿. 面向数据流结构的指令内访存冲突优化研究. 计算机研究与发展[J]. 2019, 第 6 作者56(12): 2720-2732, http://lib.cqvip.com/Qikan/Article/Detail?id=7100658631.[65] NinghuiSUN, YungangBAO, DongruiFAN. The rise of high-throughput computing. Frontiers of Information Technology & Electronic Engineering[J]. 2018, 第 3 作者19(10): 1245-1250, https://journal.hep.com.cn/ckcest/fitee/EN/10.1631/FITEE.1800501.[66] Tang, GuangMing, Qu, PeiYao, Ye, XiaoChun, Fan, DongRui, Sun, NingHui. 32-Bit 4 x 4 Bit-Slice RSFQ Matrix Multiplier. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2018, 第 4 作者28(7): https://www.webofscience.com/wos/woscc/full-record/WOS:000435190700001.[67] Xie, Xiaolong, Liang, Yun, Li, Xiuhong, Wu, Yudong, Sun, Guangyu, Wang, Tao, Fan, Dongrui. CRAT: Enabling Coordinated Register Allocation and Thread-Level Parallelism Optimization for GPUs. IEEE TRANSACTIONS ON COMPUTERS[J]. 2018, 第 7 作者67(6): 890-897, https://www.webofscience.com/wos/woscc/full-record/WOS:000431902600010.[68] Xiang Taoran, Feng Yujing, Ye Xiaochun, Tan Xu, Li Wenming, Zhu Yatao, Wu Meng, Zhang Hao, Fan Dongrui, IEEE. Accelerating CNN Algorithm with Fine-grained Dataflow Architectures. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS). 2018, 第 9 作者243-251, http://dx.doi.org/10.1109/HPCC/SmartCity/DSS.2018.00063.[69] Feng Yujing, Li Han, Tan Xu, Ye Xiaochun, Fan Dongrui, Tang Zhimin, IEEE. Optimizing network efficiency of dataflow architectures through dynamic packet merging. 2018 NINTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2018, 第 5 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000484460900038.[70] Xu Tan, Xiao-Chun Ye, Xiao-Wei Shen, Yuan-Chao Xu, Da Wang, Lunkai Zhang, Wen-Ming Li, Dong-Rui Fan, Zhi-Min Tang. A Pipelining Loop Optimization Method for Dataflow Architecture. 计算机科学技术学报:英文版[J]. 2018, 第 8 作者33(1): 116-130, http://lib.cqvip.com/Qikan/Article/Detail?id=674567291.[71] Tang, GuangMing, Qu, PeiYao, Ye, XiaoChun, Fan, DongRui. Logic Design of a 16-bit Bit-Slice Arithmetic Logic Unit for 32-/64-bit RSFQ Microprocessors. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2018, 第 4 作者28(4): https://www.webofscience.com/wos/woscc/full-record/WOS:000425742900001.[72] Tan, Xu, Ye, XiaoChun, Shen, XiaoWei, Xu, YuanChao, Wang, Da, Zhang, Lunkai, Li, WenMing, Fan, DongRui, Tang, ZhiMin. A Pipelining Loop Optimization Method for Dataflow Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2018, 第 8 作者33(1): 116-130, http://lib.cqvip.com/Qikan/Article/Detail?id=674567291.[73] 冯煜晶, 欧焱, 叶笑春, 范东睿, 谭旭, 唐志敏. 基于网络负载特征感知的数据流指令调度机制研究. 高技术通讯[J]. 2018, 第 4 作者28(11): 885-898, http://lib.cqvip.com/Qikan/Article/Detail?id=7001166774.[74] Ninghui SUN, Yungang BAO, Dongrui FAN. The rise of high-throughput computing. 信息与电子工程前沿:英文版[J]. 2018, 第 3 作者19(10): 1245-1250, http://lib.cqvip.com/Qikan/Article/Detail?id=676786551.[75] Tan, Xu, Shen, XiaoWei, Ye, XiaoChun, Wang, Da, Fan, DongRui, Zhang, Lunkai, Li, WenMing, Zhang, ZhiMin, Tang, ZhiMin. A Non-Stop Double Buffering Mechanism for Dataflow Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2018, 第 5 作者 通讯作者 33(1): 145-157, http://lib.cqvip.com/Qikan/Article/Detail?id=674567293.[76] Li Wenming, Ye Xiaochun, Wang Da, Zhang Hao, Wu Dongdong, Zhang Zhimin, Fan Dongrui, Chen JJ, Yang LT. WEAVER: An Energy Efficient, General-Purpose Acceleration Architecture for String Operations in Big Data Applications. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONS. 2018, 第 7 作者47-54, [77] 范东睿, 叶笑春. 众核处理器:高端计算的核心引擎. 前沿科学[J]. 2018, 第 1 作者12(4): 32-36, http://lib.cqvip.com/Qikan/Article/Detail?id=7001585981.[78] Feng Yujing, Xiang Taoran, Ye Xiaochun, Fan Dongrui, Wang Da, Wu Dongdong, Tang Zhimin, IEEE. Optimizing the efficiency of data transfer in dataflow architectures. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS). 2018, 第 4 作者140-149, http://dx.doi.org/10.1109/HPCC/SmartCity/DSS.2018.00050.[79] Fan, Dongrui, Li, Wenming, Ye, Xiaochun, Wang, Da, Zhang, Hao, Tang, Zhimin, Sun, Ninghui, IEEE. SmarCo: An Efficient Many-Core Processor for High-Throughput Applications in Datacenters. 2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA). 2018, 第 1 作者596-607, [80] Shen, XiaoWei, Ye, XiaoChun, Tan, Xu, Wang, Da, Zhang, Lunkai, Li, WenMing, Zhang, ZhiMin, Fan, DongRui, Sun, NingHui. An Efficient Network-on-Chip Router for Dataflow Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2017, 第 8 作者32(1): 11-25, [81] 申小伟, 叶笑春, 王达, 张浩, 王飞, 谭旭, 张志敏, 范东睿, 唐志敏, 孙凝晖. 一种面向科学计算的数据流优化方法. 计算机学报[J]. 2017, 第 8 作者40(9): 2181-2196, http://lib.cqvip.com/Qikan/Article/Detail?id=673042586.[82] 张洋, 李文明, 叶笑春, 王达, 范东睿, 李宏亮, 唐志敏, 孙凝晖. LFF:一种面向大数据应用的众核处理器访存公平性调度机制. 高技术通讯[J]. 2017, 第 5 作者27(2): 103-111, http://lib.cqvip.com/Qikan/Article/Detail?id=672300314.[83] Dongrui Fan. An Adaptive Tuning Sparse Fast Fourier Transform. Pacific-Rim Conference on Multimedia (PCM). 2017, 第 1 作者[84] 胡九川, 范东睿, 李丹萍, 严龙, 叶笑春. 一种支持数据渗透迁移的片上缓存模型研究. 北京交通大学学报:自然科学版[J]. 2017, 第 2 作者41(5): 1-9, http://lib.cqvip.com/Qikan/Article/Detail?id=674102938.[85] 刘炳涛, 王达, 叶笑春, 范东睿, 张志敏, 唐志敏. 基于数据流块的空间指令调度方法. 计算机研究与发展[J]. 2017, 第 4 作者54(4): 750-763, http://lib.cqvip.com/Qikan/Article/Detail?id=7000192386.[86] Chu Yi, Luo Chuan, Huang Wenxuan, You Haihang, Fan Dongrui, IEEE. Hard Neighboring Variables Based Configuration Checking in Stochastic Local Search for Weighted Partial Maximum Satisfiability. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017). 2017, 第 5 作者139-146, [87] Zhu Yatao, Zhang Shuai, Ye Xiaochun, Wang Da, Tan Xu, Fan Dongrui, Zhang Zhimin, Li Hongliang, IEEE. An Energy-efficient Bandwidth Allocation Method for Single-chip Heterogeneous Processor. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2016, 第 6 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700033.[88] Hu Jiuchuan, Fan Dongrui, Li Danping, Yan Long, Ye Xiaochun, IEEE. A Percolation Data Migration Schema in A Hybrid Cache Hierarchy. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2016, 第 2 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700006.[89] Zhu Yatao, Ye Xiaochun, Wang Da, Li Wenming, Zhang Yang, Fan Dongrui, Zhang Zhimin, Tang Zhimin, IEEE. A Framework for Energy-efficient Optimization on Multi-Cores. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2016, 第 6 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700032.[90] 张洋, 王达, 叶笑春, 朱亚涛, 范东睿, 李宏亮, 谢向辉. 众核处理器片上网络的层次化全局自适应路由机制. 计算机研究与发展[J]. 2016, 第 5 作者53(6): 1211-1220, http://lib.cqvip.com/Qikan/Article/Detail?id=669061058.[91] Shen Xiaowei, Ye Xiaochun, Tan Xu, Wang Da, Zhang Zhimin, Tang Zhimin, Fan Dongrui, IEEE. Memory Partition for SIMD in Streaming Dataflow Architectures. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2016, 第 7 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700035.[92] Wang Fei, Wang Da, Yang Haigang, Xie Xianghui, Fan Dongrui. On-Chip Generating FPGA Test Configuration Bitstreams to Reduce Manufacturing Test Time. CHINESE JOURNAL OF ELECTRONICS[J]. 2016, 第 5 作者25(1): 64-70, http://lib.cqvip.com/Qikan/Article/Detail?id=667783130.[93] 刘炳涛, 王达, 叶笑春, 张浩, 范东睿, 张志敏. 一种缓存数据流信息的处理器前端设计. 计算机研究与发展[J]. 2016, 第 5 作者53(6): 1221-1237, http://lib.cqvip.com/Qikan/Article/Detail?id=669061059.[94] 刘炳涛, 王达, 叶笑春, 张浩, 范东睿, 张志敏. 一种缓存数据流信息的处理器前端设计. 计算机研究与发展[J]. 2016, 第 5 作者53(6): 1221-1237, http://lib.cqvip.com/Qikan/Article/Detail?id=669061059.[95] 李国杰, 范东睿. 面向高通量计算的可扩展、高效能并行微结构研究立项报告. 科技创新导报[J]. 2016, 第 2 作者13(9): 168-168, http://lib.cqvip.com/Qikan/Article/Detail?id=669805509.[96] Sheikh, Hafiz Fahad, Ahmad, Ishfaq, Fan, Dongrui. An Evolutionary Technique for Performance-Energy-Temperature Optimized Scheduling of Parallel Tasks on Multi-Core Processors. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2016, 第 3 作者 通讯作者 27(3): 668-681, http://dx.doi.org/10.1109/TPDS.2015.2421352.[97] Hu, Jiuchuan, Fan, Dongrui, Li, Danping, Yan, Long, Ye, Xiaochun, IEEE. On the Properties of Data Migration Based on Topology Pattern Keeping On Cache Hierarchy. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2016, 第 2 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700007.[98] Shen Xiaowei, Ye Xiaochun, Tan Xu, Wang Da, Zhang Zhimin, Fan Dongrui, Tang Zhimin, IEEE. POSTER: An Optimization of Dataflow Architectures for Scientific Applications. 2016INTERNATIONALCONFERENCEONPARALLELARCHITECTUREANDCOMPILATIONTECHNIQUESPACT. 2016, 第 6 作者441-442, http://dx.doi.org/10.1145/2967938.2974054.[99] Qi Yuqiong, Ma Lina, Li Wenming, Ye Xiaochun, Wang Da, Fan Dongrui, Sun Ninghui, Chen J, Yang LT. ACCC: An Acceleration Mechanism for Character Operation based on Cache Computing in Big Data Applications. PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS). 2016, 第 6 作者608-615, http://dx.doi.org/10.1109/HPCC-SmartCity-DSS.2016.56.[100] 李文明, 叶笑春, 张洋, 宋风龙, 王达, 唐士斌, 范东睿, 谢向辉. BDSim:面向大数据应用的组件化高可配并行模拟框架. 计算机学报[J]. 2015, 第 7 作者38(10): 1959-1975, http://lib.cqvip.com/Qikan/Article/Detail?id=666506311.[101] Li Wenming, Fan Lingjun, Wang Zihou, Ye Xiaochun, Wang Da, Zhang Hao, Zhang Liang, Fan Dongrui, Xie Xianghui, IEEE. Thread ID Based Power Reduction Mechanism for Multi-thread Shared Set-associative Caches. 2015 SIXTH INTERNATIONAL GREEN COMPUTING CONFERENCE AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2015, 第 8 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000380428700018.[102] 高珂, 陈荔城, 范东睿, 刘志勇. 多核系统共享内存资源分配和管理研究. 计算机学报[J]. 2015, 第 3 作者38(5): 1020-1034, http://lib.cqvip.com/Qikan/Article/Detail?id=664815060.[103] Li Wenming, Zhang Liang, Ye Xiaochun, Wang Da, Zhang Hao, Wang Zihou, Fan Dongrui, IEEE. A High-Density Data Path Implementation fitting for HTC Applications. 2015 SIXTH INTERNATIONAL GREEN COMPUTING CONFERENCE AND SUSTAINABLE COMPUTING CONFERENCE (IGSC). 2015, 第 7 作者http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000380428700059.[104] 高珂, 范东睿, 刘志勇. 一种缓解多线程访存干扰的VRB内存机制. 计算机研究与发展[J]. 2015, 第 2 作者52(11): 2577-2588, http://lib.cqvip.com/Qikan/Article/Detail?id=666660942.[105] 李文明, 叶笑春, 王达, 郑方, 李宏亮, 林晗, 范东睿, 孙凝晖. MACT:高通量众核处理器离散访存请求批量处理机制. 计算机研究与发展[J]. 2015, 第 7 作者52(6): 1254-1265, http://lib.cqvip.com/Qikan/Article/Detail?id=665059268.[106] 朱亚涛, 张帅, 王达, 叶笑春, 张洋, 胡九川, 张志敏, 范东睿, 李宏亮. EOFDM:一种面向众核架构的最低能耗搜索方法. 计算机研究与发展[J]. 2015, 第 8 作者52(6): 1303-1315, http://lib.cqvip.com/Qikan/Article/Detail?id=665059273.[107] Gupta, Sandeep K S, Fan, Dongrui. Introduction to special issue on Selected Papers from 2013 International Green Computing Conference. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS. 2015, 第 2 作者6: 1-2, http://dx.doi.org/10.1016/j.suscom.2015.01.001.[108] 朱亚涛, 张帅, 王达, 叶笑春, 张洋, 胡九川, 张志敏, 范东睿, 李宏亮. EOFDM:一种面向众核架构的最低能耗搜索方法. 计算机研究与发展[J]. 2015, 第 8 作者52(6): 1303-1315, http://lib.cqvip.com/Qikan/Article/Detail?id=665059273.[109] Xie Xiaolong, Liang Yun, Li Xiuhong, Wu Yudong, Sun Guangyu, Wang Tao, Fan Dongrui, ACM. Enabling Coordinated Register Allocation and Thread-level Parallelism Optimization for GPUs. PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48). 2015, 第 7 作者395-406, http://dx.doi.org/10.1145/2830772.2830813.[110] Sandeep K.S. Gupta, Dongrui Fan. Introduction to special issue on Selected Papers from 2013 International Green Computing Conference. SUSTAINABLECOMPUTINGINFORMATICSANDSYSTEMS. 2015, 第 2 作者6: 1-2, http://dx.doi.org/10.1016/j.suscom.2015.01.001.[111] 李文明, 叶笑春, 张洋, 宋风龙, 王达, 唐士斌, 范东睿, 谢向辉. BDSim:面向大数据应用的组件化高可配并行模拟框架. 计算机学报[J]. 2015, 第 7 作者38(10): 1959-1975, http://lib.cqvip.com/Qikan/Article/Detail?id=666506311.[112] 李文明, 叶笑春, 王达, 郑方, 李宏亮, 林晗, 范东睿, 孙凝晖. MACT:高通量众核处理器离散访存请求批量处理机制. 计算机研究与发展[J]. 2015, 第 7 作者52(6): 1254-1265, http://lib.cqvip.com/Qikan/Article/Detail?id=665059268.[113] 范东睿. HD-NoC:面向高通量应用的高密度片上网络实现机制. HPC-China. 2015, 第 1 作者[114] 唐士斌, 宋风龙, 张帅, 范东睿, 刘志勇. 基于全局同步逻辑时间的访存依赖约减方法. 计算机学报[J]. 2014, 第 4 作者37(7): 1487-1499, http://lib.cqvip.com/Qikan/Article/Detail?id=662044928.[115] 汤旭龙, 安虹, 范东睿. 主流视频编解码软件的硬件性能分析与设计. 计算机工程[J]. 2014, 第 3 作者40(6): 300-305, http://lib.cqvip.com/Qikan/Article/Detail?id=50016433.[116] Chen, Zheng, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. A Hierarchical Optical Network-On-Chip Using Central-Controlled Subnet and Wavelength Assignment. JOURNAL OF LIGHTWAVE TECHNOLOGY[J]. 2014, 第 4 作者32(5): 930-938, https://www.webofscience.com/wos/woscc/full-record/WOS:000330129500008.[117] 魏海涛, 秦明康, 于俊清, 范东睿. 一种面向众核架构的数据流编译框架. 计算机学报[J]. 2014, 第 4 作者37(7): 1560-1569, http://lib.cqvip.com/Qikan/Article/Detail?id=662044935.[118] Zhang, Na, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. QBNoC: QoS-aware bufferless NoC architecture. MICROELECTRONICS JOURNAL[J]. 2014, 第 4 作者45(6): 751-758, http://dx.doi.org/10.1016/j.mejo.2014.04.015.[119] Chen, Ke, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. A Novel Two-Layer Passive Optical Interconnection Network for On-Chip Communication. JOURNAL OF LIGHTWAVE TECHNOLOGY[J]. 2014, 第 4 作者32(9): 1770-1776, https://www.webofscience.com/wos/woscc/full-record/WOS:000334741300004.[120] Zhang Lunkai, Strukov Dmitri, Saadeldeen Hebatallah, Fan Dongrui, Zhang Mingzhe, Franklin Diana, IEEE. SpongeDirectory: Flexible Sparse Directories Utilizing Multi-Level Memristors. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14). 2014, 第 4 作者61-73, [121] 孙公瑾, 安虹, 范东睿. 多标准视频编码器下的运动估计评估. 计算机工程[J]. 2014, 第 3 作者40(4): 295-300,304, http://lib.cqvip.com/Qikan/Article/Detail?id=49246178.[122] Song, Fenglong, Tang, Shibin, Li, Wenming, Miao, Futao, Zhang, Hao, Fan, Dongrui, Liu, Zhiyong. CRANarch: A feasible processor micro-architecture for Cloud Radio Access Network. MICROPROCESSORS AND MICROSYSTEMS[J]. 2014, 第 6 作者38(8): 1025-1036, http://dx.doi.org/10.1016/j.micpro.2014.08.003.[123] 熊海泉, 刘志勇, 徐卫志, 唐士斌, 范东睿. VMM中Guest OS非陷入系统调用指令截获与识别. 计算机研究与发展[J]. 2014, 第 5 作者51(10): 2348-2359, http://lib.cqvip.com/Qikan/Article/Detail?id=662435628.[124] 张轮凯, 宋风龙, 王达, 范东睿, 孙凝晖. 提升稀疏目录缓存一致性系统性能的方法. 计算机研究与发展[J]. 2014, 第 4 作者51(9): 1955-1970, http://lib.cqvip.com/Qikan/Article/Detail?id=662178137.[125] Dongrui Fan. BDSim : A component-based high configurable parallel simulation framework for big-data application evaluation. CCF Bigdata2014. 2014, 第 1 作者 通讯作者 [126] 郑亚松, 王达, 叶笑春, 崔慧敏, 徐远超, 范东睿. MALK:一种高效处理大规模键值的MapReduce框架. 计算机研究与发展[J]. 2014, 第 6 作者51(12): 2711-2723, http://lib.cqvip.com/Qikan/Article/Detail?id=663245478.[127] 徐冉冉, 孟海波, 桂小琰, 申小伟, 安述倩. 面向门级网表的VLSI三模冗余加固设计. 计算机工程与科学[J]. 2014, 36(12): 2355-2360, http://lib.cqvip.com/Qikan/Article/Detail?id=663226939.[128] Song, Fenglong, Zheng, Yasong, Miao, Futao, Ye, Xiaochun, Zhang, Hao, Fan, Dongrui, Liu, Zhiyong, IEEE. Low Execution Efficiency: When General Multi-Core Processor Meets Wireless Communication Protocol. 2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC). 2013, 第 6 作者906-913, http://dx.doi.org/10.1109/HPCC.and.EUC.2013.129.[129] Zhang Shuai, Liu Zhiyong, Fan Dongrui, Song Fonglong, Zhang Mingzhe, IEEE. Energy-Performance Modeling and Optimization of Parallel Computing in On-Chip Networks. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013). 2013, 第 3 作者879-886, [130] Ye Xiaochun, Fan Dongrui, Sun Ninghui, Tang Shibin, Zhang Mingzhe, Zhang Hao, IEEE. SimICT: A Fast and Flexible Framework for Performance and Power Evaluation of Large-Scale Architecture. 2013IEEEINTERNATIONALSYMPOSIUMONLOWPOWERELECTRONICSANDDESIGNISLPED. 2013, 第 2 作者273-278, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000337238700048.[131] Ding, Hui, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. 3D Networks-on-Chip mapping targeting minimum signal TSVs. IEICE ELECTRONICS EXPRESS[J]. 2013, 第 4 作者10(18): https://www.webofscience.com/wos/woscc/full-record/WOS:000326194900004.[132] Wei, Haitao, Qin, Mingkang, Zhang, Weiwei, Yu, Junqing, Fan, Dongrui, Gao, Guang R. StreamTMC: Stream compilation for tiled multi-core architectures. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING[J]. 2013, 第 5 作者73(4): 484-494, http://dx.doi.org/10.1016/j.jpdc.2012.12.001.[133] 吕慧伟, 程元, 白露, 陈明宇, 范东睿, 孙凝晖. 众核处理器和众核集群的并行模拟. 计算机研究与发展[J]. 2013, 第 5 作者50(5): 1110-1117, http://lib.cqvip.com/Qikan/Article/Detail?id=45617364.[134] Dongrui Fan. International Symposium on Low Power Electronics and Desig. International Symposium on Low Power Electronics and Design. 2013, 第 1 作者[135] Zhang Mingzhe, Wang Da, Ye Xiaochun, He Liqiang, Fan Dongrui, Liu Zhiyong, IEEE. A Path-Adaptive Opto-Electronic Hybrid NoC for Chip Multi-Processor. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013). 2013, 第 5 作者1198-1205, [136] 范涛, 刘高辉, 叶笑春, 李文明, 宋爽, 范东睿. SPARC平台模拟器源码级调试系统的研究与实现. 计算机工程与应用[J]. 2013, 第 6 作者49(4): 65-70, http://lib.cqvip.com/Qikan/Article/Detail?id=44810940.[137] Dongrui Fan. An Efficient Parallel Mechanism for Highly-Debuggable Multicore Simulator. International Conference on Advanced Parallel Processing Technology (APPT). 2013, 第 1 作者[138] 张帅, 宋风龙, 王栋, 刘志勇, 范东睿. 多核结构片上网络性能-能耗分析及优化方法. 计算机学报[J]. 2013, 第 5 作者36(5): 988-1003, http://lib.cqvip.com/Qikan/Article/Detail?id=45850220.[139] 范灵俊, 徐远超, 施巍松, 范东睿, 娄杰. 针对组相联缓存的无效缓存路访问混合过滤机制研究. 计算机学报[J]. 2013, 第 4 作者36(4): 799-807, http://lib.cqvip.com/Qikan/Article/Detail?id=45976851.[140] Peng, Liu, Tan, Guangming, Kalia, Rajiv K, Nakano, Aiichiro, Vashishta, Priya, Fan, Dongrui, Zhang, Hao, Song, Fenglong. Scalability study of molecular dynamics simulation on Godson-T many-core architecture. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING[J]. 2013, 第 6 作者73(11): 1469-1482, http://dx.doi.org/10.1016/j.jpdc.2012.07.007.[141] 范东睿. MALK——面向共享存储多核系统高效处理大规模键值的MapReduce框架. CCF BigData2013. 2013, 第 1 作者 通讯作者 [142] Cui, Huimin, Xue, Jingling, Wang, Lei, Yang, Yang, Feng, Xiaobing, Fan, Dongrui. Extendable Pattern-Oriented Optimization Directives. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION[J]. 2012, 第 6 作者9(3): [143] Jiao Shuai, Ienne Paolo, Ye Xiaochun, Wang Da, Fan Dongrui, Sun Ninghui, Kaklamanis C, Papatheodorou T, Spirakis PG. CRAW/P: A Workload Partition Method for the Efficient Parallel Simulation of Manycores. EURO-PAR 2012 PARALLEL PROCESSING. 2012, 第 5 作者7484: 102-114, [144] Xu Weizhi, Liu Zhiyong, Wu Jun, Ye Xiaochun, Jiao Shuai, Wang Da, Song Fenglong, Fan Dongrui, IEEE. Auto-Tuning GEMV on Many-Core GPU. PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012). 2012, 第 8 作者30-36, [145] Dongrui Fan. Self-correction trace model: A full-system simulator for optical network-on-chip. Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2012. 2012, 第 1 作者[146] Wang Da, Zhang Lunkai, Xu Weizhi, Fan Dongrui, Wang Fei, IEEE. A SAT-Based Diagnosis Pattern Generation Method for Timing Faults in Scan Chains. 2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012). 2012, 第 4 作者2308-2312, [147] Fan, Dongrui, Zhang, Hao, Wang, Da, Ye, Xiaochun, Song, Fenglong, Li, Guojie, Sun, Ninghui. GODSON-T: AN EFFICIENT MANY-CORE PROCESSOR EXPLORING THREAD-LEVEL PARALLELISM. IEEE MICRO[J]. 2012, 第 1 作者32(2): 38-47, https://www.webofscience.com/wos/woscc/full-record/WOS:000302458600007.[148] Dongrui Fan. Godson-T-- High-Efficient Architecture of Godson-T Many-Core Processor. HotChips. 2011, 第 1 作者[149] Dongrui Fan. An Efficient and Flexible Task Management for Many Cores. LNCS Transactions on High-Performance Embedded Architectures and Compilers. 2011, 第 1 作者[150] 马宜科, 常晓涛, 范东睿, 刘志勇. 混合体系结构中有状态硬件加速器的优化. 计算机学报[J]. 2011, 第 3 作者34(7): 1314-1322, http://lib.cqvip.com/Qikan/Article/Detail?id=38725757.[151] Da Wang, Dongrui Fan, Yu Hu. A Case Study: Low Power Design-for-Testability Features of a Multi-core Processor Godson-T. ADVANCED MATERIALS RESEARCH. 2011, 第 2 作者1359: [152] 范灵俊, 颜成钢, 宋风龙, 马宜科, 范东睿. H.264去块滤波算法在众核结构上的并行优化. 小型微型计算机系统[J]. 2011, 第 5 作者32(11): 2263-2267, http://xwxt.sict.ac.cn/CN/Y2011/V32/I11/2263.[153] 焦帅, 徐卫志, 唐士斌, 范东睿, 孙凝晖. PartitionSim:一个面向众核结构的并行模拟器. 计算机学报[J]. 2011, 第 4 作者34(11): 2084-2092, http://lib.cqvip.com/Qikan/Article/Detail?id=40083654.[154] Lei Yu, Zhi Yong Liu, Dong Rui Fan, Yike Ma, Feng Long Song, Xiao Chun Ye, Wei Zhi Xu. Study on the Mapping of Streaming Application on Many-Core Architecture. APPLIED MECHANICS AND MATERIALS. 2011, 第 3 作者1287: [155] Fan, DongRui, Li, XiaoWei, Li, GuoJie. New Methodologies for Parallel Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2011, 第 1 作者 通讯作者 26(4): 578-587, http://lib.cqvip.com/Qikan/Article/Detail?id=38447509.[156] Peng Liu, Tan Guangming, Kalia Rajiv K, Nakano Aiichiro, Vashishta Priya, Fang Dongrui, Sun Ninghui, Guarracino MR, Vivien F, Traff JL, Cannataro M, Danelutto M, Hast A, Perla F, Knupfer A, DiMartino B, Alexander M. Preliminary Investigation of Accelerating Molecular Dynamics Simulation on Godson-T Many-Core Processor. EURO-PAR 2010 PARALLEL PROCESSING WORKSHOPS. 2011, 6586: 349-356, [157] Cui, Huimin, Xue, Jingling, Wang, Lei, Yang, Yang, Feng, Xiaobing, Fan, Dongrui, IEEE. Extendable Pattern-Oriented Optimization Directives. 2011 9TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO). 2011, 第 6 作者107-118, [158] Dongrui Fan. Optimizing web browser on many-core architectures. 2011, 第 1 作者 通讯作者 [159] Peng, Liu, Nakano, Aiichiro, Tan, Guangming, Vashishta, Priya, Fan, Dongrui, Zhang, Hao, Kalia, Rajiv K, Song, Fenglong, ACM. Performance Analysis and Optimization of Molecular Dynamics Simulation on Godson-T Many-core Processor. PROCEEDINGS OF THE 2011 8TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF 2011). 2011, 第 5 作者http://dx.doi.org/10.1145/2016604.2016643.[160] Lei Yu, Zhi Yong Liu, Dong Rui Fan, Yi Ke Ma, Feng Long Song, Xiao Chun Ye, Wei Zhi Xu. Mapping Routing Lookup Algorithm on Many-Core Architecture Based on SPM and Cache Mixed Method. APPLIED MECHANICS AND MATERIALS. 2011, 第 3 作者1287: [161] Dongrui Fan. Thread Owned Block Cache: Managing Latency in Many-Core Architecture. International Conference on Parallel Computing (Euro-Par). 2010, 第 1 作者[162] 包尔固德, 李伟生, 范东睿, 杨扬, 马啸宇. Godson-T众核体系结构上的Broadcast性能优化. 计算机研究与发展[J]. 2010, 第 3 作者524-531, http://lib.cqvip.com/Qikan/Article/Detail?id=33116075.[163] Silvano, Cristina, Fornaciari, William, Palermo, Gianluca, Zaccaria, Vittorio, Castro, Fabrizio, Martinez, Marcos, Bocchio, Sara, Zafalon, Roberto, Avasare, Prabhat, Vanmeerbeeck, Geert, YkmanCouvreur, Chantal, Wouters, Maryse, Kavka, Carlos, Onesti, Luka, Turco, Alessandro, Bondi, Umberto, Mariani, Giovanni, Posadas, Hector, Villar, Eugenio, Wu, Chris, Fan Dongrui, Hao, Zhang, Tang Shibin, IEEE Comp Soc. MULTICUBE: Multi-Objective Design Space Exploration of Multi-Core Architectures. IEEE ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2010). 2010, 第 21 作者488-493, [164] Cui, HuiMin, Wang, Lei, Fan, DongRui, Feng, XiaoBing. Landing Stencil Code on Godson-T. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2010, 第 3 作者25(4): 886-894, http://lib.cqvip.com/Qikan/Article/Detail?id=34470262.[165] 叶笑春, 林伟, 范东睿, 张浩. 蛋白质序列比对算法在众核结构上的并行优化. 软件学报[J]. 2010, 第 3 作者3094-3105, http://lib.cqvip.com/Qikan/Article/Detail?id=36056005.[166] Dongrui Fan. High Performance Comparison-Based Sorting Algorithm on Many-Core GPUs. International Parallel and Distributed Processing Symposium (IPDPS). 2010, 第 1 作者[167] 崔慧敏, 王蕾, 范东睿, 冯晓兵. Landing Stencil Code on Godson-T. 计算机科学技术学报(英文版)[J]. 2010, 第 3 作者886-894, http://lib.cqvip.com/Qikan/Article/Detail?id=34470262.[168] 徐卫志, 宋风龙, 范东睿, 余磊, 张帅, 刘志勇. 众核处理器片上同步机制和评估方法研究. 计算机学报[J]. 2010, 第 3 作者1777-1787, http://lib.cqvip.com/Qikan/Article/Detail?id=35344799.[169] Dongrui Fan. Efficient Address Mapping of Shared Cache for On-Chip Many-Core Architecture. International Conference on Parallel Computing (Euro-Par). 2010, 第 1 作者[170] Dongrui Fan. P-GAS: Parallelizing a cycle-accurate event-driven many-core processor simulator using parallel discrete event simulation. 2010, 第 1 作者 通讯作者 [171] Dongrui Fan. GVE: Godson-T verification engine for many-core architecture rapid prototyping and debugging. 2010, 第 1 作者 通讯作者 [172] Dongrui Fan. Minimal Multi-Threading: Finding and Removing Redundant Instructions in Multi-Threaded Processors. International Symposium on Microarchitecture (Micro). 2010, 第 1 作者 通讯作者 [173] Yu Lei, Liu Zhiyong, Fan Dongrui, Song Fenglong, Zhang Junchao, Yuan Nan, IEEE COMPUTER SOC. Study on Fine-grained Synchronization in Many-Core Architecture. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGS. 2009, 第 3 作者524-529, http://dx.doi.org/10.1109/SNPD.2009.61.[174] Dongrui Fan. Evaluation method of synchronization for shared-memory on-chip many-core processor. Proceedings - 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2009. 2009, 第 1 作者[175] Yuan Nan, Zhou Yongbin, Tan Guangming, Zhang Junchao, Fan Dongrui, Sips H, Epema D, Lin HX. High Performance Matrix Multiplication on Many Cores. EURO-PAR 2009: PARALLEL PROCESSING, PROCEEDINGS. 2009, 第 5 作者5704: 948-959, [176] Dongrui Fan. Design of new hash mapping functions. 2009, 第 1 作者[177] Dongrui Fan. GFFC: The global feedback based flow control in the NoC design for many-core processor. NPC 2009 - 6th International Conference on Network and Parallel Computing. 2009, 第 1 作者[178] DongRui Fan, Nan Yuan, JunChao Zhang, YongBin Zhou, Wei Lin, FengLong Song, XiaoChun Ye, He Huang, Lei Yu, GuoPing Long, Hao Zhang, Lei Liu. Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY,[J]. 2009, 第 1 作者 通讯作者 24(6): 1061-1073, https://www.webofscience.com/wos/woscc/full-record/WOS:000271535700008.[179] Fan, DongRui, Yuan, Nan, Zhang, JunChao, Zhou, YongBin, Lin, Wei, Song, FengLong, Ye, XiaoChun, Huang, He, Yu, Lei, Long, GuoPing, Zhang, Hao, Liu, Lei. Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2009, 第 1 作者 通讯作者 24(6): 1061-1073, http://lib.cqvip.com/Qikan/Article/Detail?id=32022578.[180] Dongrui Fan. A fast linear-space sequence alignment algorithm with dynamic parallelization framework. Proceedings - IEEE 9th International Conference on Computer and Information Technology, CIT 2009. 2009, 第 1 作者[181] Dongrui Fan. A synchronization-based alternative to directory protocol. Proceedings - 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2009. 2009, 第 1 作者 通讯作者 [182] Long, Guoping, Fan, Dongrui, Zhang, Junchao. Architectural Support for Cilk Computations on Many-core Architectures. ACM SIGPLAN NOTICES[J]. 2009, 第 2 作者44(4): 285-286, https://www.webofscience.com/wos/woscc/full-record/WOS:000272014600032.[183] Dongrui Fan. A low-complexity synchronization based cache coherence solution for many cores. Proceedings - IEEE 9th International Conference on Computer and Information Technology, CIT 2009. 2009, 第 1 作者[184] Dongrui Fan. Software and hardware cooperate for 1-D FFT algorithm optimization on multicore processors. Proceedings - IEEE 9th International Conference on Computer and Information Technology, CIT 2009. 2009, 第 1 作者[185] 龙国平, 范东睿. LU分解在Godson—Tv1众核体系结构上的并行化研究. 计算机学报[J]. 2009, 第 2 作者2157-2167, http://lib.cqvip.com/Qikan/Article/Detail?id=32080304.[186] 龙国平, 范东睿. LU分解在Godson-Tvl众核体系结构上的半行化研究. 计算机学报[J]. 2009, 第 2 作者32(11): 2157-2167, http://dx.doi.org/10.3724/SP.J.1016.2009.02157.[187] 张浩, 林伟, 周永彬, 叶笑春, 范东睿. 通用处理器的高带宽访存流水线研究. 计算机学报[J]. 2009, 第 5 作者142-151, http://lib.cqvip.com/Qikan/Article/Detail?id=29336464.[188] Dongrui Fan. Characterizing and understanding the bandwidth behavior of workloads on multi-core processors. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2009, 第 1 作者[189] 宋风龙, 刘志勇, 范东睿, 张军超, 余磊. 一种片上众核结构共享Cache动态隐式隔离机制研究. 计算机学报[J]. 2009, 第 3 作者1896-1904, http://lib.cqvip.com/Qikan/Article/Detail?id=31781012.[190] Zhou Yongbin, Zhang Junchao, Zhang Shuai, Yuan Nan, Fan Dongrui, Liao XF, Jin H, Zheng R, Zou DQ. Data Management: The Spirit to Pursuit Peak Performance on Many-Core Processor. 2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGS. 2009, 第 5 作者559-564, http://dx.doi.org/10.1109/ISPA.2009.22.[191] Dongrui Fan. A Performance Model of Dense Matrix Operations on Many-core Architectures. International Conference on Parallel Computing (Euro-Par). 2008, 第 1 作者[192] 袁楠, 范东睿. 高性能代价比的两层关联间接转移预测器设计. 计算机学报[J]. 2008, 第 2 作者31(11): 1898-1906, http://lib.cqvip.com/Qikan/Article/Detail?id=28668923.[193] 龙国平, 张军超, 范东睿. 众核体系结构对Cilk语言的硬件支持及评测研究. 计算机学报[J]. 2008, 第 3 作者31(11): 1975-1985, http://lib.cqvip.com/Qikan/Article/Detail?id=28668931.[194] 段振中, 范东睿. JTAG调试通信接口的软件模拟. 微电子学与计算机[J]. 2008, 第 2 作者25(2): 157-159, http://lib.cqvip.com/Qikan/Article/Detail?id=26550273.[195] 许彤, 王朋宇, 黄海林, 范东睿, 朱鹏飞, 郑保建, 曹非. 嵌入式处理器在片调试功能的验证. 计算机辅助设计与图形学学报[J]. 2007, 第 4 作者19(4): 502-507, http://lib.cqvip.com/Qikan/Article/Detail?id=24260721.[196] 范东睿, 黄海林, 唐志敏. 嵌入式处理器TLB设计方法研究. 计算机学报[J]. 2006, 第 1 作者29(1): 73-80, http://lib.cqvip.com/Qikan/Article/Detail?id=21072974.[197] 黄海林, 范东睿, 许彤, 唐志敏. 嵌入式处理器中访存部件的低功耗设计研究. 计算机学报[J]. 2006, 第 2 作者29(5): 815-821, http://lib.cqvip.com/Qikan/Article/Detail?id=21884374.[198] 黄海林, 许彤, 范东睿, 唐志敏. 嵌入式处理器中降低Cache缺失代价设计方法研究. 小型微型计算机系统[J]. 2006, 第 3 作者27(11): 2077-2081, https://d.wanfangdata.com.cn/periodical/xxwxjsjxt200611019.[199] 黄海林, 范东睿, 许彤, 朱鹏飞, 郑保建, 曹非, 陈亮. 嵌入式处理器在片调试功能的设计与实现. 计算机辅助设计与图形学学报[J]. 2006, 第 2 作者18(7): 1005-1010, http://lib.cqvip.com/Qikan/Article/Detail?id=22439361.[200] 常晓涛, 范东睿, 韩银和, 张志敏. 应用输入向量控制技术降低漏电功耗的快速算法. 计算机研究与发展[J]. 2006, 第 2 作者43(5): 946-952, http://lib.cqvip.com/Qikan/Article/Detail?id=21816504.[201] 范东睿. 嵌入式处理器中TLB 设计方法研究. 计算机学报,. 2006, 第 1 作者[202] Dongrui Fan. An Energy Efficient TLB Design Methodology. International Symposium on Low Power Electronics and Design (ISLPED). 2005, 第 1 作者[203] Fan, DR, Yang, HB, Gao, GR, Zhao, RC. Evaluation and choice of various branch predictors for low-power embedded processor. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2003, 18(6): 833-838, http://lib.cqvip.com/Qikan/Article/Detail?id=8906949.[204] 蒋敬旗, 周旭, 李文, 范东睿. 系统芯片中低功耗测试的几种方法. 微电子学与计算机[J]. 2002, 第 4 作者19(10): 20-23, http://lib.cqvip.com/Qikan/Article/Detail?id=6962753.[205] 李文, 周旭, 范东睿, 蒋敬旗. 可测试性设计中的功耗优化技术. 贵州工业大学学报:自然科学版[J]. 2002, 第 3 作者31(4): 1-7, http://lib.cqvip.com/Qikan/Article/Detail?id=6763121.