发表论文
[1] Xinda Chen, Rongliang Fu, Junying Huang, Huawei Cao, Zhimin Zhang, Xiaochun Ye, Tsung-Yi Ho, Dongrui Fan. JRouter: A Multi-Terminal Hierarchical Length-Matching Router under Planar Manhattan Routing Model for RSFQ Circuits. GLSVLSInull. 2023, [2] 范志华, 吴欣欣, 李文明, 曹华伟, 安学军, 叶笑春, 范东睿. 面向低精度神经网络的数据流体系结构优化. 计算机研究与发展[J]. 2023, 60(1): 43-58, http://lib.cqvip.com/Qikan/Article/Detail?id=7108741862.[3] Liu, Xin, Yan, Mingyu, Deng, Lei, Li, Guoqi, Ye, Xiaochun, Fan, Dongrui. Sampling Methods for Efficient Training of Graph Convolutional Networks: A Survey. IEEE-CAA JOURNAL OF AUTOMATICA SINICAnull. 2022, 9(2): 205-234, http://dx.doi.org/10.1109/JAS.2021.1004311.[4] Mo Zou, Mingyu Yan, Wenming Li, Zhimin Tang, Xiaochun Ye, Dongrui Fan. GEM: Execution-Aware Cache Management for Graph Analytics. ICA3PPnull. 2022, [5] Zou, Mo, Zhang, Mingzhe, Wang, Rujia, Sun, XianHe, Ye, Xiaochun, Fan, Dongrui, Tang, Zhimin. Accelerating Graph Processing With Lightweight Learning-Based Data Reordering. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 21(1): 5-8, http://dx.doi.org/10.1109/LCA.2022.3151087.[6] Rongliang Fu, Junying Huang, Haibin Wu, Xiaochun Ye, Dongrui Fan, Tsung-Yi Ho. JBNN: A Hardware Design for Binarized Neural Networks Using Single-Flux-Quantum Circuits. IEEE TRANSACTIONS ON COMPUTERS[J]. 2022, 771(12): 3203-3214, [7] Zhihua Fan, Wenming Li, Tianyu Liu, Xuejun An, Xiaochun Ye, Dongrui Fan. A Routing-Aware Mapping Method for Dataflow Architectures. International Conference on Network and Parallel Computingnull. 2022, [8] Zhihua Fan, Wenming Li, Tianyu Liu, Shengzhong Tang, Zhen Wang, Xuejun An, Xiaochun Ye, Dongrui Fan. A Loop Optimization Method for Dataflow Architecture. HPCCnull. 2022, [9] Junying Huang, Rongliang Fu, Xiaochun Ye, Dongrui Fan. A survey on superconducting computing technology: circuits, architectures and design tools. CCF Transactions on High Performance Computing[J]. 2022, [10] Sun, Gongjian, Yan, Mingyu, Wang, Duo, Li, Han, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Multi-node Acceleration for Large-scale GCNs. IEEE TRANSACTIONS ON COMPUTERS[J]. 2022, [11] Feng, YuJing, Li, DeJian, Tan, Xu, Ye, XiaoChun, Fan, DongRui, Li, WenMing, Wang, Da, Zhang, Hao, Tang, ZhiMin. Accelerating Data Transfer in Dataflow Architectures Through a Look-Ahead Acknowledgment Mechanism. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2022, 37(4): 942-959, [12] Zhihua Fan, Wenming Li, Tianyu Liu, Shengzhong Tang, Zhen Wang, Xuejun An, Xiaochun Ye, Dongrui Fan. A Loop Optimization Method for Dataflow. High Performance Computing and Communicationsnull. 2022, [13] Yan, Mingyu, Zou, Mo, Yang, Xiaocheng, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Characterizing and Understanding HGNNs on GPUs. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 21(2): 69-72, [14] Liu, Xin, Yan, Mingyu, Deng, Lei, Li, Guoqi, Ye, Xiaochun, Fan, Dongrui, Pan, Shirui, Xie, Yuan. Survey on Graph Neural Network Acceleration: An Algorithmic Perspective. International Joint Conference on Artificial Intelligencenull. 2022, http://arxiv.org/abs/2202.04822.[15] Lin, Haiyang, Yan, Mingyu, Wang, Duo, Zou, Mo, Tu, Fengbin, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Alleviating Datapath Conflicts and Design Centralization in Graph Analytics Acceleration. DESIGN AUTOMATION CONFERENCEnull. 2022, [16] Lin, Haiyang, Yan, Mingyu, Yang, Xiaocheng, Zou, Mo, Li, Wenming, Ye, Xiaochun, Fan, Dongrui. Characterizing and Understanding Distributed GNN Training on GPUs. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 21(1): 21-24, http://dx.doi.org/10.1109/LCA.2022.3168067.[17] Wang, Yinshen, Li, Wenming, Liu, Tianyu, Zhou, Liangjiang, Wang, Bingnan, Fan, Zhihua, Ye, Xiaochun, Fan, Dongrui, Ding, Chibiao. Characterization and Implementation of Radar System Applications on a Reconfigurable Dataflow Architecture. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2022, 21(2): 121-124, [18] Xinxin Wu, Zhihua Fan, Tianyu Liu, Wenming Li, Xiaochun Ye, Dongrui Fan. LRP: Predictive output activation based on SVD approach for CNNs acceleration. Design, Automation and Test in Europenull. 2022, [19] 轩伟, 曹华伟, 严明玉, 唐志敏, 叶笑春, 范东睿. BSR-TC: Adaptively Sampling for Accurate Triangle Counting over Evolving Graph Streams. International Journal of Software Engineering and Knowledge Engineering[J]. 2021, 31(11): 1561-1581, https://worldscientific.com/doi/10.1142/S021819402140012X.[20] 严明玉, 李涵, 邓磊, 胡杏, 叶笑春, 张志敏, 范东睿, 谢源. 图计算加速架构综述. 计算机研究与发展[J]. 2021, 58(4): 862-887, http://lib.cqvip.com/Qikan/Article/Detail?id=7104271412.[21] Li, Yi, Wu, Meng, Ye, Xiaochun, Li, Wenming, Xue, Rui, Wang, Da, Zhang, Hao, Fan, Dongrui. An efficient scheduling algorithm for dataflow architecture using loop-pipelining. INFORMATION SCIENCES[J]. 2021, 547: 1136-1153, http://dx.doi.org/10.1016/j.ins.2020.09.029.[22] 范东睿. 数据流计算研究进展与概述. 数据与计算发展前沿. 2021, [23] 李涵, 严明玉, 吕征阳, 李文明, 叶笑春, 范东睿, 唐志敏. 图神经网络加速结构综述. 计算机研究与发展[J]. 2021, 58(6): 1204-1229, http://lib.cqvip.com/Qikan/Article/Detail?id=7104820799.[24] Li, Han, Yan, Mingyu, Yang, Xiaocheng, Deng, Lei, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Xie, Yuan. Hardware Acceleration for GCNs via Bidirectional Fusion. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2021, 20(1): [25] Chenglong Zhang, Huawei Cao, Xiaochun Ye, Guobo Wang, Qinfen Hao, Dongrui Fan. Highly Efficient Breadth-First Search on CPU-based Single-node System. INTERNATIONAL JOURNAL OF HYDROGEN ENERGYnull. 2021, 2066-2071, [26] Dongrui Fan. Scalable and Efficient Graph Traversal on High-Throughput Cluster. CCF Transaction on High Performance Computing (CCF THPC). 2021, [27] 吴欣欣, 欧焱, 李文明, 王达, 张浩, 范东睿. 基于粗粒度数据流架构的稀疏卷积神经网络加速. 计算机研究与发展[J]. 2021, 58(7): 1504-1517, http://lib.cqvip.com/Qikan/Article/Detail?id=7105055136.[28] Cao, Dingyuan, Zhang, Mingzhe, Lu, Hang, Ye, Xiaochun, Fan, Dongrui, Che, Yuezhi, Wang, Rujia. Streamline Ring ORAM Accesses through Spatial and Temporal Optimization. 2021 27TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2021)null. 2021, 14-25, [29] 李灵枝, 胡九川, 叶笑春, 范东睿, 严龙. 渗透缓存命中率诱导的缓存区域动态分配机制研究. 软件导刊[J]. 2020, 19(4): 1-8, http://lib.cqvip.com/Qikan/Article/Detail?id=7101773847.[30] Rongliang Fu, Zhimin Zhang, Guangming Tang, Junying Huang, Xiaochun Ye, Dongrui Fan, Ninghui Sun. Design Automation Methodology from RTL to Gate-level Netlist and Schematic for RSFQ Logic Circuits. Great Lakes Symposium on VLSInull. 2020, [31] Qu, PeiYao, Tang, GuangMing, Yang, JiaHong, Ye, XiaoChun, Fan, DongRui, Zhang, ZhiMin, Sun, NingHui. Design of an 8-bit Bit-Parallel RSFQ Microprocessor. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2020, 30(7): [32] Yang, JiaHong, Tang, GuangMing, Zheng, XiangYu, Ye, XiaoChun, Fan, DongRui, Zhang, ZhiMin, Sun, NingHui. Distributed Self-Clock: A Suitable Architecture for SFQ Circuits. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2020, 30(7): http://dx.doi.org/10.1109/TASC.2020.3007175.[33] Dongrui Fan. Pixel-Semantic Revising of Position: One-Stage Object Detector with Shared Encoder-Decoder. The 27th International Conference on Neural Information Processing (ICONIP2020). 2020, [34] Wu, Xinxin, Li, Yi, Ou, Yan, Li, Wenming, Sun, Shibo, Xu, Wenxing, Fan, Dongrui, Qiu, M. Accelerating Sparse Convolutional Neural Networks Based on Dataflow Architecture. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT IInull. 2020, 12453: 14-31, [35] Dongrui Fan. Scalable and efcient graph traversal on high‑throughput cluster. CCF Transactions on High Performance Computing. 2020, [36] Tang, GuangMing, Qu, PeiYao, Zheng, XiangYu, Yang, JiaHong, Ye, XiaoChun, Fan, DongRui, Sun, NingHui. Bit-Slice Butterfly Processing Units for 64-Point RSFQ FFT Processors. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2020, 30(1): https://www.webofscience.com/wos/woscc/full-record/WOS:000482590700001.[37] Yan Mingyu, Deng Lei, Hu Xing, Liang Ling, Feng Yujing, Ye Xiaochun, Zhang Zhimin, Fan Dongrui, Xie Yuan. HyGCN: A GCN Accelerator with Hybrid Architecture. 2020, http://arxiv.org/abs/2001.02514.[38] Li, Qian, Guo, Nan, Ye, Xiaochun, Fan, Dongrui, Tang, Zhimin. Video Face Recognition System: RetinaFace-mnet-faster and Secondary Search. 2020, http://arxiv.org/abs/2009.13167.[39] 范灵俊, 杨菲, 郑卫城, 洪学海, 范东睿. 构建城市“互联网+”新型基础设施发展战略研究. 中国工程科学[J]. 2020, 22(4): 106-113, http://lib.cqvip.com/Qikan/Article/Detail?id=7102599416.[40] Yan, Mingyu, Deng, Lei, Hu, Xing, Liang, Ling, Feng, Yujing, Ye, Xiaochun, Zhang, Zhimin, Fan, Dongrui, Xie, Yuan, IEEE. HyGCN: A GCN Accelerator with Hybrid Architecture. 2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020)null. 2020, 15-29, [41] Yan, Mingyu, Chen, Zhaodong, Deng, Lei, Ye, Xiaochun, Zhang, Zhimin, Fan, Dongrui, Xie, Yuan. Characterizing and Understanding GCNs on GPU. IEEE COMPUTER ARCHITECTURE LETTERS[J]. 2020, 19(1): 22-25, http://dx.doi.org/10.1109/LCA.2020.2970395.[42] Ye, Xiaochun, Tan, Xu, Wu, Meng, Feng, Yujing, Wang, Da, Zhang, Hao, Pei, Songwen, Fan, Dongrui. An efficient dataflow accelerator for scientific applications. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE[J]. 2020, 112: 580-588, http://dx.doi.org/10.1016/j.future.2020.03.023.[43] Ou, Yan, Shen, Chongfei, Feng, Yujing, Wu, Xinxin, Li, Wenming, Ye, Xiaochun, Fan, Dongrui, Qiu, M. CTA: A Critical Task Aware Scheduling Mechanism for Dataflow Architecture. ALGORITHMS AND ARCHITECTURES FOR PARALLEL PROCESSING, ICA3PP 2020, PT Inull. 2020, 12452: 61-77, [44] Hao, Qinfen, Hao, Kai, Xue, Haiyun, Han, Meng, Qi, Nan, Zhang, Kunming, Niu, Xingmao, Xiao, Limin, Fan, Dongrui, IEEE. A Chip-level Optical Interconnect for CPU. 2020 IEEE PHOTONICS CONFERENCE (IPC)null. 2020, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000612237500111.[45] 叶笑春, 李文明, 张洋, 张浩, 王达, 范东睿. 高通量众核处理器设计. 数据与计算发展前沿[J]. 2020, 2(1): 70-84, https://kns.cnki.net/KCMS/detail/detail.aspx?dbcode=CJFQ&dbname=CJFDLAST2020&filename=KYXH202001006&v=MDU4OTk4ZVgxTHV4WVM3RGgxVDNxVHJXTTFGckNVUjd1Zlp1Wm5GaXZuVUwzTkxqVFRackc0SE5ITXJvOUZZb1I=.[46] 张承龙, 曹华伟, 王国波, 郝沁汾, 张洋, 叶笑春, 范东睿. 面向高通量计算机的图算法优化技术. 计算机研究与发展[J]. 2020, 57(6): 1152-1163, http://lib.cqvip.com/Qikan/Article/Detail?id=7101851458.[47] 董荣育, 曹华伟, 叶笑春, 张园, 郝沁汾, 范东睿. Highly Efficient and GPU-Friendly Implementation of BFS on Single-node System. International Symposium on Parallel and Distributed Processing with Applications (ISPA 2017)null. 2020, https://ieeexplore.ieee.org/document/9443861.[48] Dongrui Fan. iATPG: Instruction-level Automatic Test Program Generation for Vulnerability under DVFS Attack. 2019 IEEE 25th International Symposium on On-Line Testing and Robust System Design (IOLTS). 2019, [49] 李易, 常成娟, 卢圣健, 江道忠, 范东睿, 叶笑春. 面向数据流结构的指令映射优化方法. 计算机工程与科学[J]. 2019, 41(1): 9-13, http://lib.cqvip.com/Qikan/Article/Detail?id=7001148810.[50] Dongrui Fan. A Sharing Path Awareness Scheduling Algorithm for Dataflow Architecture. HPCC. 2019, [51] 范东睿. 面向数据流结构的指令内存访存冲突优化研究. 计算机研究与发展. 2019, [52] Dongrui Fan. C-MAP: Improving the Effectiveness of Mapping Method for CGRA by Reducing NoC Congestion. HPCC 2019. 2019, [53] 欧焱, 冯煜晶, 李文明, 叶笑春, 王达, 范东睿. 面向数据流结构的指令内访存冲突优化研究. 计算机研究与发展[J]. 2019, 56(12): 2720-2732, http://lib.cqvip.com/Qikan/Article/Detail?id=7100658631.[54] Junying Huang, Jing Ye, Xiaochun Ye, Da Wang, Dongrui Fan, Huawei Li, Xiaowei Li, Zhimin Zhang. Instruction Vulnerability Test and Code Optimization against DVFS attack. 2019 IEEE INTERNATIONAL TEST CONFERENCE IN ASIA (ITC-ASIA 2019)[J]. 2019, 49-54, [55] 范东睿, 叶笑春, 包云岗, 孙凝晖. 中国高通量计算机的自主研发之路. 中国科学院院刊[J]. 2019, 648-656, http://lib.cqvip.com/Qikan/Article/Detail?id=75898988504849574854484856.[56] Zokaee, Farzaneh, Zhang, Mingzhe, Ye, Xiaochun, Fan, Dongrui, Jiang, Lei, ACM. Magma: A Monolithic 3D Vertical Heterogeneous ReRAM-based Main Memory Architecture. PROCEEDINGS OF THE 2019 56TH ACM/EDAC/IEEE DESIGN AUTOMATION CONFERENCE (DAC)null. 2019, http://dx.doi.org/10.1145/3316781.3317858.[57] 张志敏. Balancing Memory Accesses for Energy-Efficient Graph Analytics Accelerators. ISLPED. 2019, [58] Li, Wenming, Ye, Xiaochun, Wang, Da, Zhang, Hao, Tang, Zhimin, Fan, Dongrui, Sun, Ninghui. PIM-WEAVER: A High Energy-efficient, General-purpose Acceleration Architecture for String Operations in Big Data Processing. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMS[J]. 2019, 21: 129-142, http://dx.doi.org/10.1016/j.suscom.2019.01.006.[59] Wenming Li, Xiaochun Ye, Da Wang, Hao Zhang, Zhimin Tang, Dongrui Fan, Ninghui Sun. PIM-WEAVER: A High Energy-efficient, General-purpose Acceleration Architecture for String Operations in Big Data Processing. SUSTAINABLE COMPUTING: INFORMATICS AND SYSTEMS. 2019, 21: 129-142, http://dx.doi.org/10.1016/j.suscom.2019.01.006.[60] 余世干, 唐志敏, 叶笑春, 范东睿. 基于推测机制异构多核处理器容错方法与仿真. 系统仿真学报[J]. 2019, 31(12): 2685-2695, http://lib.cqvip.com/Qikan/Article/Detail?id=7100565631.[61] Dongrui Fan. Applying CNN on a Scientific Application Accelerator Based on Dataflow Architecture. CCF Transaction on High Performance Computing (CCF THPC). 2019, [62] Gao Yan, Liu Boxiao, Guo Nan, Ye Xiaochun, Wan Fang, You Haihang, Fan Dongrui. Utilizing the Instability in Weakly Supervised Object Detection. 2019, http://arxiv.org/abs/1906.06023.[63] Yan Mingyu, Hu Xing, Li Shuangchen, Basak Abanti, Li Han, Ma Xin, Akgun Itir, Peng Yujing, Gu Peng, Deng Lei, Ye Xiaochun, Zhang Zhimin, Fan Dongrui, Xie Yuan, Assoc Comp Machinery. Alleviating Irregularity in Graph Analytics Acceleration: a Hardware/Software Co-Design Approach. MICRO'52: THE 52ND ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTUREnull. 2019, 615-628, http://dx.doi.org/10.1145/3352460.3358318.[64] Gao, Yan, Liu, Boxiao, Guo, Nan, Ye, Xiaochun, Wan, Fang, You, Haihang, Fan, Dongrui, IEEE. C-MIDN: Coupled Multiple Instance Detection Network With Segmentation Guidance for Weakly Supervised Object Detection. 2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019)null. 2019, 9833-9842, [65] 向陶然, 叶笑春, 李文明, 冯煜晶, 谭旭, 张浩, 范东睿. 基于细粒度数据流架构的稀疏神经网络全连接层加速. 计算机研究与发展[J]. 2019, 56(6): 1192-1204, http://lib.cqvip.com/Qikan/Article/Detail?id=7002192926.[66] Sun, NingHui, Bao, YunGang, Fan, DongRui. The rise of high-throughput computing. FRONTIERS OF INFORMATION TECHNOLOGY & ELECTRONIC ENGINEERING[J]. 2018, 19(10): 1245-1250, http://lib.cqvip.com/Qikan/Article/Detail?id=676786551.[67] Tang, GuangMing, Qu, PeiYao, Ye, XiaoChun, Fan, DongRui, Sun, NingHui. 32-Bit 4 x 4 Bit-Slice RSFQ Matrix Multiplier. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2018, 28(7): https://www.webofscience.com/wos/woscc/full-record/WOS:000435190700001.[68] Xie, Xiaolong, Liang, Yun, Li, Xiuhong, Wu, Yudong, Sun, Guangyu, Wang, Tao, Fan, Dongrui. CRAT: Enabling Coordinated Register Allocation and Thread-Level Parallelism Optimization for GPUs. IEEE TRANSACTIONS ON COMPUTERS[J]. 2018, 67(6): 890-897, https://www.webofscience.com/wos/woscc/full-record/WOS:000431902600010.[69] Xiang Taoran, Feng Yujing, Ye Xiaochun, Tan Xu, Li Wenming, Zhu Yatao, Wu Meng, Zhang Hao, Fan Dongrui, IEEE. Accelerating CNN Algorithm with Fine-grained Dataflow Architectures. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS)null. 2018, 243-251, http://dx.doi.org/10.1109/HPCC/SmartCity/DSS.2018.00063.[70] Feng Yujing, Li Han, Tan Xu, Ye Xiaochun, Fan Dongrui, Tang Zhimin, IEEE. Optimizing network efficiency of dataflow architectures through dynamic packet merging. 2018 NINTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2018, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000484460900038.[71] Xu Tan, Xiao-Chun Ye, Xiao-Wei Shen, Yuan-Chao Xu, Da Wang, Lunkai Zhang, Wen-Ming Li, Dong-Rui Fan, Zhi-Min Tang. A Pipelining Loop Optimization Method for Dataflow Architecture. 计算机科学技术学报:英文版[J]. 2018, 33(1): 116-130, http://lib.cqvip.com/Qikan/Article/Detail?id=674567291.[72] Tang, GuangMing, Qu, PeiYao, Ye, XiaoChun, Fan, DongRui. Logic Design of a 16-bit Bit-Slice Arithmetic Logic Unit for 32-/64-bit RSFQ Microprocessors. IEEE TRANSACTIONS ON APPLIED SUPERCONDUCTIVITY[J]. 2018, 28(4): https://www.webofscience.com/wos/woscc/full-record/WOS:000425742900001.[73] Tan, Xu, Ye, XiaoChun, Shen, XiaoWei, Xu, YuanChao, Wang, Da, Zhang, Lunkai, Li, WenMing, Fan, DongRui, Tang, ZhiMin. A Pipelining Loop Optimization Method for Dataflow Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2018, 33(1): 116-130, http://lib.cqvip.com/Qikan/Article/Detail?id=674567291.[74] 冯煜晶, 欧焱, 叶笑春, 范东睿, 谭旭, 唐志敏. 基于网络负载特征感知的数据流指令调度机制研究. 高技术通讯[J]. 2018, 28(11): 885-898, http://lib.cqvip.com/Qikan/Article/Detail?id=7001166774.[75] Ninghui SUN, Yungang BAO, Dongrui FAN. The rise of high-throughput computing. 信息与电子工程前沿:英文版[J]. 2018, 19(10): 1245-1250, http://lib.cqvip.com/Qikan/Article/Detail?id=676786551.[76] Tan, Xu, Shen, XiaoWei, Ye, XiaoChun, Wang, Da, Fan, DongRui, Zhang, Lunkai, Li, WenMing, Zhang, ZhiMin, Tang, ZhiMin. A Non-Stop Double Buffering Mechanism for Dataflow Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2018, 33(1): 145-157, http://lib.cqvip.com/Qikan/Article/Detail?id=674567293.[77] 范东睿, 叶笑春. 众核处理器:高端计算的核心引擎. 前沿科学[J]. 2018, 12(4): 32-36, http://lib.cqvip.com/Qikan/Article/Detail?id=7001585981.[78] Li Wenming, Ye Xiaochun, Wang Da, Zhang Hao, Wu Dongdong, Zhang Zhimin, Fan Dongrui, Chen JJ, Yang LT. WEAVER: An Energy Efficient, General-Purpose Acceleration Architecture for String Operations in Big Data Applications. 2018 IEEE INT CONF ON PARALLEL & DISTRIBUTED PROCESSING WITH APPLICATIONS, UBIQUITOUS COMPUTING & COMMUNICATIONS, BIG DATA & CLOUD COMPUTING, SOCIAL COMPUTING & NETWORKING, SUSTAINABLE COMPUTING & COMMUNICATIONSnull. 2018, 47-54, [79] Feng Yujing, Xiang Taoran, Ye Xiaochun, Fan Dongrui, Wang Da, Wu Dongdong, Tang Zhimin, IEEE. Optimizing the efficiency of data transfer in dataflow architectures. IEEE 20TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS / IEEE 16TH INTERNATIONAL CONFERENCE ON SMART CITY / IEEE 4TH INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS)null. 2018, 140-149, http://dx.doi.org/10.1109/HPCC/SmartCity/DSS.2018.00050.[80] Fan, Dongrui, Li, Wenming, Ye, Xiaochun, Wang, Da, Zhang, Hao, Tang, Zhimin, Sun, Ninghui, IEEE. SmarCo: An Efficient Many-Core Processor for High-Throughput Applications in Datacenters. 2018 24TH IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA)null. 2018, 596-607, [81] Shen, XiaoWei, Ye, XiaoChun, Tan, Xu, Wang, Da, Zhang, Lunkai, Li, WenMing, Zhang, ZhiMin, Fan, DongRui, Sun, NingHui. An Efficient Network-on-Chip Router for Dataflow Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2017, 32(1): 11-25, [82] 申小伟, 叶笑春, 王达, 张浩, 王飞, 谭旭, 张志敏, 范东睿, 唐志敏, 孙凝晖. 一种面向科学计算的数据流优化方法. 计算机学报[J]. 2017, 40(9): 2181-2196, http://lib.cqvip.com/Qikan/Article/Detail?id=673042586.[83] 张洋, 李文明, 叶笑春, 王达, 范东睿, 李宏亮, 唐志敏, 孙凝晖. LFF:一种面向大数据应用的众核处理器访存公平性调度机制. 高技术通讯[J]. 2017, 27(2): 103-111, http://lib.cqvip.com/Qikan/Article/Detail?id=672300314.[84] 胡九川, 范东睿, 李丹萍, 严龙, 叶笑春. 一种支持数据渗透迁移的片上缓存模型研究. 北京交通大学学报:自然科学版[J]. 2017, 41(5): 1-9, http://lib.cqvip.com/Qikan/Article/Detail?id=674102938.[85] Dongrui Fan. An Adaptive Tuning Sparse Fast Fourier Transform. Pacific-Rim Conference on Multimedia (PCM). 2017, [86] 刘炳涛, 王达, 叶笑春, 范东睿, 张志敏, 唐志敏. 基于数据流块的空间指令调度方法. 计算机研究与发展[J]. 2017, 54(4): 750-763, http://lib.cqvip.com/Qikan/Article/Detail?id=7000192386.[87] Chu Yi, Luo Chuan, Huang Wenxuan, You Haihang, Fan Dongrui, IEEE. Hard Neighboring Variables Based Configuration Checking in Stochastic Local Search for Weighted Partial Maximum Satisfiability. 2017 IEEE 29TH INTERNATIONAL CONFERENCE ON TOOLS WITH ARTIFICIAL INTELLIGENCE (ICTAI 2017)null. 2017, 139-146, [88] Sheikh, Hafiz Fahad, Ahmad, Ishfaq, Fan, Dongrui. An Evolutionary Technique for Performance-Energy-Temperature Optimized Scheduling of Parallel Tasks on Multi-Core Processors. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2016, 27(3): 668-681, https://www.webofscience.com/wos/woscc/full-record/WOS:000370926400005.[89] Hu, Jiuchuan, Fan, Dongrui, Li, Danping, Yan, Long, Ye, Xiaochun, IEEE. On the Properties of Data Migration Based on Topology Pattern Keeping On Cache Hierarchy. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2016, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700007.[90] Shen Xiaowei, Ye Xiaochun, Tan Xu, Wang Da, Zhang Zhimin, Fan Dongrui, Tang Zhimin, IEEE. POSTER: An Optimization of Dataflow Architectures for Scientific Applications. 2016INTERNATIONALCONFERENCEONPARALLELARCHITECTUREANDCOMPILATIONTECHNIQUESPACTnull. 2016, 441-442, http://dx.doi.org/10.1145/2967938.2974054.[91] Qi Yuqiong, Ma Lina, Li Wenming, Ye Xiaochun, Wang Da, Fan Dongrui, Sun Ninghui, Chen J, Yang LT. ACCC: An Acceleration Mechanism for Character Operation based on Cache Computing in Big Data Applications. PROCEEDINGS OF 2016 IEEE 18TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS; IEEE 14TH INTERNATIONAL CONFERENCE ON SMART CITY; IEEE 2ND INTERNATIONAL CONFERENCE ON DATA SCIENCE AND SYSTEMS (HPCC/SMARTCITY/DSS)null. 2016, 608-615, http://dx.doi.org/10.1109/HPCC-SmartCity-DSS.2016.56.[92] Zhu Yatao, Zhang Shuai, Ye Xiaochun, Wang Da, Tan Xu, Fan Dongrui, Zhang Zhimin, Li Hongliang, IEEE. An Energy-efficient Bandwidth Allocation Method for Single-chip Heterogeneous Processor. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2016, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700033.[93] Hu Jiuchuan, Fan Dongrui, Li Danping, Yan Long, Ye Xiaochun, IEEE. A Percolation Data Migration Schema in A Hybrid Cache Hierarchy. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2016, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700006.[94] Zhu Yatao, Ye Xiaochun, Wang Da, Li Wenming, Zhang Yang, Fan Dongrui, Zhang Zhimin, Tang Zhimin, IEEE. A Framework for Energy-efficient Optimization on Multi-Cores. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2016, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700032.[95] 张洋, 王达, 叶笑春, 朱亚涛, 范东睿, 李宏亮, 谢向辉. 众核处理器片上网络的层次化全局自适应路由机制. 计算机研究与发展[J]. 2016, 53(6): 1211-1220, http://lib.cqvip.com/Qikan/Article/Detail?id=669061058.[96] Wang Fei, Wang Da, Yang Haigang, Xie Xianghui, Fan Dongrui. On-Chip Generating FPGA Test Configuration Bitstreams to Reduce Manufacturing Test Time. CHINESE JOURNAL OF ELECTRONICS[J]. 2016, 25(1): 64-70, http://lib.cqvip.com/Qikan/Article/Detail?id=667783130.[97] Shen Xiaowei, Ye Xiaochun, Tan Xu, Wang Da, Zhang Zhimin, Tang Zhimin, Fan Dongrui, IEEE. Memory Partition for SIMD in Streaming Dataflow Architectures. 2016 SEVENTH INTERNATIONAL GREEN AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2016, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000402169700035.[98] 刘炳涛, 王达, 叶笑春, 张浩, 范东睿, 张志敏. 一种缓存数据流信息的处理器前端设计. 计算机研究与发展[J]. 2016, 53(6): 1221-1237, http://lib.cqvip.com/Qikan/Article/Detail?id=669061059.[99] 刘炳涛, 王达, 叶笑春, 张浩, 范东睿, 张志敏. 一种缓存数据流信息的处理器前端设计. 计算机研究与发展[J]. 2016, 53(6): 1221-1237, http://lib.cqvip.com/Qikan/Article/Detail?id=669061059.[100] 李国杰, 范东睿. 面向高通量计算的可扩展、高效能并行微结构研究立项报告. 科技创新导报[J]. 2016, 13(9): 168-168, http://lib.cqvip.com/Qikan/Article/Detail?id=669805509.[101] 李文明, 叶笑春, 张洋, 宋风龙, 王达, 唐士斌, 范东睿, 谢向辉. BDSim:面向大数据应用的组件化高可配并行模拟框架. 计算机学报[J]. 2015, 38(10): 1959-1975, http://lib.cqvip.com/Qikan/Article/Detail?id=666506311.[102] 高珂, 陈荔城, 范东睿, 刘志勇. 多核系统共享内存资源分配和管理研究. 计算机学报[J]. 2015, 38(5): 1020-1034, http://lib.cqvip.com/Qikan/Article/Detail?id=664815060.[103] Li Wenming, Fan Lingjun, Wang Zihou, Ye Xiaochun, Wang Da, Zhang Hao, Zhang Liang, Fan Dongrui, Xie Xianghui, IEEE. Thread ID Based Power Reduction Mechanism for Multi-thread Shared Set-associative Caches. 2015 SIXTH INTERNATIONAL GREEN COMPUTING CONFERENCE AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2015, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000380428700018.[104] 李文明, 叶笑春, 王达, 郑方, 李宏亮, 林晗, 范东睿, 孙凝晖. MACT:高通量众核处理器离散访存请求批量处理机制. 计算机研究与发展[J]. 2015, 52(6): 1254-1265, http://lib.cqvip.com/Qikan/Article/Detail?id=665059268.[105] Li Wenming, Zhang Liang, Ye Xiaochun, Wang Da, Zhang Hao, Wang Zihou, Fan Dongrui, IEEE. A High-Density Data Path Implementation fitting for HTC Applications. 2015 SIXTH INTERNATIONAL GREEN COMPUTING CONFERENCE AND SUSTAINABLE COMPUTING CONFERENCE (IGSC)null. 2015, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000380428700059.[106] 高珂, 范东睿, 刘志勇. 一种缓解多线程访存干扰的VRB内存机制. 计算机研究与发展[J]. 2015, 52(11): 2577-2588, http://lib.cqvip.com/Qikan/Article/Detail?id=666660942.[107] 朱亚涛, 张帅, 王达, 叶笑春, 张洋, 胡九川, 张志敏, 范东睿, 李宏亮. EOFDM:一种面向众核架构的最低能耗搜索方法. 计算机研究与发展[J]. 2015, 52(6): 1303-1315, http://lib.cqvip.com/Qikan/Article/Detail?id=665059273.[108] Gupta, Sandeep K S, Fan, Dongrui. Introduction to special issue on Selected Papers from 2013 International Green Computing Conference. SUSTAINABLE COMPUTING-INFORMATICS & SYSTEMSnull. 2015, 6: 1-2, http://dx.doi.org/10.1016/j.suscom.2015.01.001.[109] 朱亚涛, 张帅, 王达, 叶笑春, 张洋, 胡九川, 张志敏, 范东睿, 李宏亮. EOFDM:一种面向众核架构的最低能耗搜索方法. 计算机研究与发展[J]. 2015, 52(6): 1303-1315, http://lib.cqvip.com/Qikan/Article/Detail?id=665059273.[110] Xie Xiaolong, Liang Yun, Li Xiuhong, Wu Yudong, Sun Guangyu, Wang Tao, Fan Dongrui, ACM. Enabling Coordinated Register Allocation and Thread-level Parallelism Optimization for GPUs. PROCEEDINGS OF THE 48TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO-48)null. 2015, 395-406, http://dx.doi.org/10.1145/2830772.2830813.[111] Sandeep K.S. Gupta, Dongrui Fan. Introduction to special issue on Selected Papers from 2013 International Green Computing Conference. SUSTAINABLECOMPUTINGINFORMATICSANDSYSTEMS. 2015, 6: 1-2, http://dx.doi.org/10.1016/j.suscom.2015.01.001.[112] 李文明, 叶笑春, 张洋, 宋风龙, 王达, 唐士斌, 范东睿, 谢向辉. BDSim:面向大数据应用的组件化高可配并行模拟框架. 计算机学报[J]. 2015, 38(10): 1959-1975, http://lib.cqvip.com/Qikan/Article/Detail?id=666506311.[113] 李文明, 叶笑春, 王达, 郑方, 李宏亮, 林晗, 范东睿, 孙凝晖. MACT:高通量众核处理器离散访存请求批量处理机制. 计算机研究与发展[J]. 2015, 52(6): 1254-1265, http://lib.cqvip.com/Qikan/Article/Detail?id=665059268.[114] 范东睿. HD-NoC:面向高通量应用的高密度片上网络实现机制. HPC-China. 2015, [115] 唐士斌, 宋风龙, 张帅, 范东睿, 刘志勇. 基于全局同步逻辑时间的访存依赖约减方法. 计算机学报[J]. 2014, 37(7): 1487-1499, http://lib.cqvip.com/Qikan/Article/Detail?id=662044928.[116] 汤旭龙, 安虹, 范东睿. 主流视频编解码软件的硬件性能分析与设计. 计算机工程[J]. 2014, 40(6): 300-305, http://lib.cqvip.com/Qikan/Article/Detail?id=50016433.[117] Chen, Zheng, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. A Hierarchical Optical Network-On-Chip Using Central-Controlled Subnet and Wavelength Assignment. JOURNAL OF LIGHTWAVE TECHNOLOGY[J]. 2014, 32(5): 930-938, https://www.webofscience.com/wos/woscc/full-record/WOS:000330129500008.[118] 魏海涛, 秦明康, 于俊清, 范东睿. 一种面向众核架构的数据流编译框架. 计算机学报[J]. 2014, 37(7): 1560-1569, http://lib.cqvip.com/Qikan/Article/Detail?id=662044935.[119] Zhang, Na, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. QBNoC: QoS-aware bufferless NoC architecture. MICROELECTRONICS JOURNAL[J]. 2014, 45(6): 751-758, http://dx.doi.org/10.1016/j.mejo.2014.04.015.[120] Chen, Ke, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. A Novel Two-Layer Passive Optical Interconnection Network for On-Chip Communication. JOURNAL OF LIGHTWAVE TECHNOLOGY[J]. 2014, 32(9): 1770-1776, https://www.webofscience.com/wos/woscc/full-record/WOS:000334741300004.[121] Zhang Lunkai, Strukov Dmitri, Saadeldeen Hebatallah, Fan Dongrui, Zhang Mingzhe, Franklin Diana, IEEE. SpongeDirectory: Flexible Sparse Directories Utilizing Multi-Level Memristors. PROCEEDINGS OF THE 23RD INTERNATIONAL CONFERENCE ON PARALLEL ARCHITECTURES AND COMPILATION TECHNIQUES (PACT'14)null. 2014, 61-73, [122] 孙公瑾, 安虹, 范东睿. 多标准视频编码器下的运动估计评估. 计算机工程[J]. 2014, 40(4): 295-300,304, http://lib.cqvip.com/Qikan/Article/Detail?id=49246178.[123] Song, Fenglong, Tang, Shibin, Li, Wenming, Miao, Futao, Zhang, Hao, Fan, Dongrui, Liu, Zhiyong. CRANarch: A feasible processor micro-architecture for Cloud Radio Access Network. MICROPROCESSORS AND MICROSYSTEMS[J]. 2014, 38(8): 1025-1036, http://dx.doi.org/10.1016/j.micpro.2014.08.003.[124] 熊海泉, 刘志勇, 徐卫志, 唐士斌, 范东睿. VMM中Guest OS非陷入系统调用指令截获与识别. 计算机研究与发展[J]. 2014, 51(10): 2348-2359, http://lib.cqvip.com/Qikan/Article/Detail?id=662435628.[125] 张轮凯, 宋风龙, 王达, 范东睿, 孙凝晖. 提升稀疏目录缓存一致性系统性能的方法. 计算机研究与发展[J]. 2014, 51(9): 1955-1970, http://lib.cqvip.com/Qikan/Article/Detail?id=662178137.[126] Dongrui Fan. BDSim : A component-based high configurable parallel simulation framework for big-data application evaluation. CCF Bigdata2014. 2014, [127] 郑亚松, 王达, 叶笑春, 崔慧敏, 徐远超, 范东睿. MALK:一种高效处理大规模键值的MapReduce框架. 计算机研究与发展[J]. 2014, 51(12): 2711-2723, http://lib.cqvip.com/Qikan/Article/Detail?id=663245478.[128] 徐冉冉, 孟海波, 桂小琰, 申小伟, 安述倩. 面向门级网表的VLSI三模冗余加固设计. 计算机工程与科学[J]. 2014, 36(12): 2355-2360, http://lib.cqvip.com/Qikan/Article/Detail?id=663226939.[129] Song, Fenglong, Zheng, Yasong, Miao, Futao, Ye, Xiaochun, Zhang, Hao, Fan, Dongrui, Liu, Zhiyong, IEEE. Low Execution Efficiency: When General Multi-Core Processor Meets Wireless Communication Protocol. 2013 IEEE 15TH INTERNATIONAL CONFERENCE ON HIGH PERFORMANCE COMPUTING AND COMMUNICATIONS & 2013 IEEE INTERNATIONAL CONFERENCE ON EMBEDDED AND UBIQUITOUS COMPUTING (HPCC_EUC)null. 2013, 906-913, http://dx.doi.org/10.1109/HPCC.and.EUC.2013.129.[130] Zhang Shuai, Liu Zhiyong, Fan Dongrui, Song Fonglong, Zhang Mingzhe, IEEE. Energy-Performance Modeling and Optimization of Parallel Computing in On-Chip Networks. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013)null. 2013, 879-886, [131] Ye Xiaochun, Fan Dongrui, Sun Ninghui, Tang Shibin, Zhang Mingzhe, Zhang Hao, IEEE. SimICT: A Fast and Flexible Framework for Performance and Power Evaluation of Large-Scale Architecture. 2013IEEEINTERNATIONALSYMPOSIUMONLOWPOWERELECTRONICSANDDESIGNISLPEDnull. 2013, 273-278, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000337238700048.[132] Wei, Haitao, Qin, Mingkang, Zhang, Weiwei, Yu, Junqing, Fan, Dongrui, Gao, Guang R. StreamTMC: Stream compilation for tiled multi-core architectures. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING[J]. 2013, 73(4): 484-494, http://dx.doi.org/10.1016/j.jpdc.2012.12.001.[133] Ding, Hui, Gu, Huaxi, Yang, Yintang, Fan, Dongrui. 3D Networks-on-Chip mapping targeting minimum signal TSVs. IEICE ELECTRONICS EXPRESS[J]. 2013, 10(18): https://www.webofscience.com/wos/woscc/full-record/WOS:000326194900004.[134] 吕慧伟, 程元, 白露, 陈明宇, 范东睿, 孙凝晖. 众核处理器和众核集群的并行模拟. 计算机研究与发展[J]. 2013, 50(5): 1110-1117, http://lib.cqvip.com/Qikan/Article/Detail?id=45617364.[135] Dongrui Fan. International Symposium on Low Power Electronics and Desig. International Symposium on Low Power Electronics and Design. 2013, [136] Zhang Mingzhe, Wang Da, Ye Xiaochun, He Liqiang, Fan Dongrui, Liu Zhiyong, IEEE. A Path-Adaptive Opto-Electronic Hybrid NoC for Chip Multi-Processor. 2013 12TH IEEE INTERNATIONAL CONFERENCE ON TRUST, SECURITY AND PRIVACY IN COMPUTING AND COMMUNICATIONS (TRUSTCOM 2013)null. 2013, 1198-1205, [137] 范涛, 刘高辉, 叶笑春, 李文明, 宋爽, 范东睿. SPARC平台模拟器源码级调试系统的研究与实现. 计算机工程与应用[J]. 2013, 49(4): 65-70, http://lib.cqvip.com/Qikan/Article/Detail?id=44810940.[138] Dongrui Fan. An Efficient Parallel Mechanism for Highly-Debuggable Multicore Simulator. International Conference on Advanced Parallel Processing Technology (APPT). 2013, [139] 张帅, 宋风龙, 王栋, 刘志勇, 范东睿. 多核结构片上网络性能-能耗分析及优化方法. 计算机学报[J]. 2013, 36(5): 988-1003, http://lib.cqvip.com/Qikan/Article/Detail?id=45850220.[140] 范灵俊, 徐远超, 施巍松, 范东睿, 娄杰. 针对组相联缓存的无效缓存路访问混合过滤机制研究. 计算机学报[J]. 2013, 36(4): 799-807, http://lib.cqvip.com/Qikan/Article/Detail?id=45976851.[141] Peng, Liu, Tan, Guangming, Kalia, Rajiv K, Nakano, Aiichiro, Vashishta, Priya, Fan, Dongrui, Zhang, Hao, Song, Fenglong. Scalability study of molecular dynamics simulation on Godson-T many-core architecture. JOURNAL OF PARALLEL AND DISTRIBUTED COMPUTING[J]. 2013, 73(11): 1469-1482, http://dx.doi.org/10.1016/j.jpdc.2012.07.007.[142] 范东睿. MALK——面向共享存储多核系统高效处理大规模键值的MapReduce框架. CCF BigData2013. 2013, [143] Cui, Huimin, Xue, Jingling, Wang, Lei, Yang, Yang, Feng, Xiaobing, Fan, Dongrui. Extendable Pattern-Oriented Optimization Directives. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION[J]. 2012, 9(3): [144] Jiao Shuai, Ienne Paolo, Ye Xiaochun, Wang Da, Fan Dongrui, Sun Ninghui, Kaklamanis C, Papatheodorou T, Spirakis PG. CRAW/P: A Workload Partition Method for the Efficient Parallel Simulation of Manycores. EURO-PAR 2012 PARALLEL PROCESSINGnull. 2012, 7484: 102-114, [145] Xu Weizhi, Liu Zhiyong, Wu Jun, Ye Xiaochun, Jiao Shuai, Wang Da, Song Fenglong, Fan Dongrui, IEEE. Auto-Tuning GEMV on Many-Core GPU. PROCEEDINGS OF THE 2012 IEEE 18TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS 2012)null. 2012, 30-36, [146] Dongrui Fan. Self-correction trace model: A full-system simulator for optical network-on-chip. Proceedings of the 2012 IEEE 26th International Parallel and Distributed Processing Symposium Workshops, IPDPSW 2012. 2012, [147] Wang Da, Zhang Lunkai, Xu Weizhi, Fan Dongrui, Wang Fei, IEEE. A SAT-Based Diagnosis Pattern Generation Method for Timing Faults in Scan Chains. 2012 IEEE INTERNATIONAL SYMPOSIUM ON CIRCUITS AND SYSTEMS (ISCAS 2012)null. 2012, 2308-2312, [148] Fan, Dongrui, Zhang, Hao, Wang, Da, Ye, Xiaochun, Song, Fenglong, Li, Guojie, Sun, Ninghui. GODSON-T: AN EFFICIENT MANY-CORE PROCESSOR EXPLORING THREAD-LEVEL PARALLELISM. IEEE MICRO[J]. 2012, 32(2): 38-47, https://www.webofscience.com/wos/woscc/full-record/WOS:000302458600007.[149] Peng, Liu, Nakano, Aiichiro, Tan, Guangming, Vashishta, Priya, Fan, Dongrui, Zhang, Hao, Kalia, Rajiv K, Song, Fenglong, ACM. Performance Analysis and Optimization of Molecular Dynamics Simulation on Godson-T Many-core Processor. PROCEEDINGS OF THE 2011 8TH ACM INTERNATIONAL CONFERENCE ON COMPUTING FRONTIERS (CF 2011)null. 2011, http://dx.doi.org/10.1145/2016604.2016643.[150] Lei Yu, Zhi Yong Liu, Dong Rui Fan, Yi Ke Ma, Feng Long Song, Xiao Chun Ye, Wei Zhi Xu. Mapping Routing Lookup Algorithm on Many-Core Architecture Based on SPM and Cache Mixed Method. APPLIED MECHANICS AND MATERIALS. 2011, 1287: [151] Dongrui Fan. Godson-T-- High-Efficient Architecture of Godson-T Many-Core Processor. HotChips. 2011, [152] Dongrui Fan. An Efficient and Flexible Task Management for Many Cores. LNCS Transactions on High-Performance Embedded Architectures and Compilers. 2011, [153] 马宜科, 常晓涛, 范东睿, 刘志勇. 混合体系结构中有状态硬件加速器的优化. 计算机学报[J]. 2011, 34(7): 1314-1322, http://lib.cqvip.com/Qikan/Article/Detail?id=38725757.[154] Da Wang, Dongrui Fan, Yu Hu. A Case Study: Low Power Design-for-Testability Features of a Multi-core Processor Godson-T. ADVANCED MATERIALS RESEARCH. 2011, 1359: [155] 焦帅, 徐卫志, 唐士斌, 范东睿, 孙凝晖. PartitionSim:一个面向众核结构的并行模拟器. 计算机学报[J]. 2011, 34(11): 2084-2092, http://lib.cqvip.com/Qikan/Article/Detail?id=40083654.[156] 范灵俊, 颜成钢, 宋风龙, 马宜科, 范东睿. H.264去块滤波算法在众核结构上的并行优化. 小型微型计算机系统[J]. 2011, 32(11): 2263-2267, http://lib.cqvip.com/Qikan/Article/Detail?id=39785223.[157] Lei Yu, Zhi Yong Liu, Dong Rui Fan, Yike Ma, Feng Long Song, Xiao Chun Ye, Wei Zhi Xu. Study on the Mapping of Streaming Application on Many-Core Architecture. APPLIED MECHANICS AND MATERIALS. 2011, 1287: [158] Fan, DongRui, Li, XiaoWei, Li, GuoJie. New Methodologies for Parallel Architecture. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2011, 26(4): 578-587, http://lib.cqvip.com/Qikan/Article/Detail?id=38447509.[159] Peng Liu, Tan Guangming, Kalia Rajiv K, Nakano Aiichiro, Vashishta Priya, Fang Dongrui, Sun Ninghui, Guarracino MR, Vivien F, Traff JL, Cannataro M, Danelutto M, Hast A, Perla F, Knupfer A, DiMartino B, Alexander M. Preliminary Investigation of Accelerating Molecular Dynamics Simulation on Godson-T Many-Core Processor. EURO-PAR 2010 PARALLEL PROCESSING WORKSHOPSnull. 2011, 6586: 349-356, [160] Cui, Huimin, Xue, Jingling, Wang, Lei, Yang, Yang, Feng, Xiaobing, Fan, Dongrui, IEEE. Extendable Pattern-Oriented Optimization Directives. 2011 9TH ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON CODE GENERATION AND OPTIMIZATION (CGO)null. 2011, 107-118, [161] Dongrui Fan. Optimizing web browser on many-core architectures. 2011, [162] Dongrui Fan. Thread Owned Block Cache: Managing Latency in Many-Core Architecture. International Conference on Parallel Computing (Euro-Par). 2010, [163] 包尔固德, 李伟生, 范东睿, 杨扬, 马啸宇. Godson-T众核体系结构上的Broadcast性能优化. 计算机研究与发展[J]. 2010, 524-531, http://lib.cqvip.com/Qikan/Article/Detail?id=33116075.[164] Silvano, Cristina, Fornaciari, William, Palermo, Gianluca, Zaccaria, Vittorio, Castro, Fabrizio, Martinez, Marcos, Bocchio, Sara, Zafalon, Roberto, Avasare, Prabhat, Vanmeerbeeck, Geert, YkmanCouvreur, Chantal, Wouters, Maryse, Kavka, Carlos, Onesti, Luka, Turco, Alessandro, Bondi, Umberto, Mariani, Giovanni, Posadas, Hector, Villar, Eugenio, Wu, Chris, Fan Dongrui, Hao, Zhang, Tang Shibin, IEEE Comp Soc. MULTICUBE: Multi-Objective Design Space Exploration of Multi-Core Architectures. IEEE ANNUAL SYMPOSIUM ON VLSI (ISVLSI 2010)null. 2010, 488-493, [165] Cui, HuiMin, Wang, Lei, Fan, DongRui, Feng, XiaoBing. Landing Stencil Code on Godson-T. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2010, 25(4): 886-894, http://lib.cqvip.com/Qikan/Article/Detail?id=34470262.[166] 叶笑春, 林伟, 范东睿, 张浩. 蛋白质序列比对算法在众核结构上的并行优化. 软件学报[J]. 2010, 3094-3105, http://lib.cqvip.com/Qikan/Article/Detail?id=36056005.[167] Dongrui Fan. High Performance Comparison-Based Sorting Algorithm on Many-Core GPUs. International Parallel and Distributed Processing Symposium (IPDPS). 2010, [168] 崔慧敏, 王蕾, 范东睿, 冯晓兵. Landing Stencil Code on Godson-T. 计算机科学技术学报(英文版)[J]. 2010, 886-894, http://lib.cqvip.com/Qikan/Article/Detail?id=34470262.[169] 徐卫志, 宋风龙, 范东睿, 余磊, 张帅, 刘志勇. 众核处理器片上同步机制和评估方法研究. 计算机学报[J]. 2010, 1777-1787, http://lib.cqvip.com/Qikan/Article/Detail?id=35344799.[170] Dongrui Fan. Efficient Address Mapping of Shared Cache for On-Chip Many-Core Architecture. International Conference on Parallel Computing (Euro-Par). 2010, [171] Dongrui Fan. P-GAS: Parallelizing a cycle-accurate event-driven many-core processor simulator using parallel discrete event simulation. 2010, [172] Dongrui Fan. GVE: Godson-T verification engine for many-core architecture rapid prototyping and debugging. 2010, [173] Dongrui Fan. Minimal Multi-Threading: Finding and Removing Redundant Instructions in Multi-Threaded Processors. International Symposium on Microarchitecture (Micro). 2010, [174] Yu Lei, Liu Zhiyong, Fan Dongrui, Song Fenglong, Zhang Junchao, Yuan Nan, IEEE COMPUTER SOC. Study on Fine-grained Synchronization in Many-Core Architecture. SNPD 2009: 10TH ACIS INTERNATIONAL CONFERENCE ON SOFTWARE ENGINEERING, ARTIFICIAL INTELLIGENCES, NETWORKING AND PARALLEL DISTRIBUTED COMPUTING, PROCEEDINGSnull. 2009, 524-529, http://dx.doi.org/10.1109/SNPD.2009.61.[175] Dongrui Fan. Evaluation method of synchronization for shared-memory on-chip many-core processor. Proceedings - 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2009. 2009, [176] Yuan Nan, Zhou Yongbin, Tan Guangming, Zhang Junchao, Fan Dongrui, Sips H, Epema D, Lin HX. High Performance Matrix Multiplication on Many Cores. EURO-PAR 2009: PARALLEL PROCESSING, PROCEEDINGSnull. 2009, 5704: 948-959, [177] Dongrui Fan. Design of new hash mapping functions. 2009, [178] Dongrui Fan. GFFC: The global feedback based flow control in the NoC design for many-core processor. NPC 2009 - 6th International Conference on Network and Parallel Computing. 2009, [179] DongRui Fan, Nan Yuan, JunChao Zhang, YongBin Zhou, Wei Lin, FengLong Song, XiaoChun Ye, He Huang, Lei Yu, GuoPing Long, Hao Zhang, Lei Liu. Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY,[J]. 2009, 24(6): 1061-1073, https://www.webofscience.com/wos/woscc/full-record/WOS:000271535700008.[180] Fan, DongRui, Yuan, Nan, Zhang, JunChao, Zhou, YongBin, Lin, Wei, Song, FengLong, Ye, XiaoChun, Huang, He, Yu, Lei, Long, GuoPing, Zhang, Hao, Liu, Lei. Godson-T: An Efficient Many-Core Architecture for Parallel Program Executions. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2009, 24(6): 1061-1073, http://lib.cqvip.com/Qikan/Article/Detail?id=32022578.[181] Dongrui Fan. A fast linear-space sequence alignment algorithm with dynamic parallelization framework. Proceedings - IEEE 9th International Conference on Computer and Information Technology, CIT 2009. 2009, [182] Dongrui Fan. A synchronization-based alternative to directory protocol. Proceedings - 2009 IEEE International Symposium on Parallel and Distributed Processing with Applications, ISPA 2009. 2009, [183] Long, Guoping, Fan, Dongrui, Zhang, Junchao. Architectural Support for Cilk Computations on Many-core Architectures. ACM SIGPLAN NOTICES[J]. 2009, 44(4): 285-286, https://www.webofscience.com/wos/woscc/full-record/WOS:000272014600032.[184] Dongrui Fan. A low-complexity synchronization based cache coherence solution for many cores. Proceedings - IEEE 9th International Conference on Computer and Information Technology, CIT 2009. 2009, [185] Dongrui Fan. Software and hardware cooperate for 1-D FFT algorithm optimization on multicore processors. Proceedings - IEEE 9th International Conference on Computer and Information Technology, CIT 2009. 2009, [186] 龙国平, 范东睿. LU分解在Godson—Tv1众核体系结构上的并行化研究. 计算机学报[J]. 2009, 2157-2167, http://lib.cqvip.com/Qikan/Article/Detail?id=32080304.[187] 龙国平, 范东睿. LU分解在Godson-Tvl众核体系结构上的半行化研究. 计算机学报[J]. 2009, 32(11): 2157-2167, http://dx.doi.org/10.3724/SP.J.1016.2009.02157.[188] Dongrui Fan. Characterizing and understanding the bandwidth behavior of workloads on multi-core processors. Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics). 2009, [189] 宋风龙, 刘志勇, 范东睿, 张军超, 余磊. 一种片上众核结构共享Cache动态隐式隔离机制研究. 计算机学报[J]. 2009, 1896-1904, http://lib.cqvip.com/Qikan/Article/Detail?id=31781012.[190] 张浩, 林伟, 周永彬, 叶笑春, 范东睿. 通用处理器的高带宽访存流水线研究. 计算机学报[J]. 2009, 142-151, http://lib.cqvip.com/Qikan/Article/Detail?id=29336464.[191] Zhou Yongbin, Zhang Junchao, Zhang Shuai, Yuan Nan, Fan Dongrui, Liao XF, Jin H, Zheng R, Zou DQ. Data Management: The Spirit to Pursuit Peak Performance on Many-Core Processor. 2009 IEEE INTERNATIONAL SYMPOSIUM ON PARALLEL AND DISTRIBUTED PROCESSING WITH APPLICATIONS, PROCEEDINGSnull. 2009, 559-564, http://dx.doi.org/10.1109/ISPA.2009.22.[192] Dongrui Fan. A Performance Model of Dense Matrix Operations on Many-core Architectures. International Conference on Parallel Computing (Euro-Par). 2008, [193] 袁楠, 范东睿. 高性能代价比的两层关联间接转移预测器设计. 计算机学报[J]. 2008, 31(11): 1898-1906, http://lib.cqvip.com/Qikan/Article/Detail?id=28668923.[194] 段振中, 范东睿. JTAG调试通信接口的软件模拟. 微电子学与计算机[J]. 2008, 25(2): 157-159, http://lib.cqvip.com/Qikan/Article/Detail?id=26550273.[195] 龙国平, 张军超, 范东睿. 众核体系结构对Cilk语言的硬件支持及评测研究. 计算机学报[J]. 2008, 31(11): 1975-1985, http://lib.cqvip.com/Qikan/Article/Detail?id=28668931.[196] 许彤, 王朋宇, 黄海林, 范东睿, 朱鹏飞, 郑保建, 曹非. 嵌入式处理器在片调试功能的验证. 计算机辅助设计与图形学学报[J]. 2007, 19(4): 502-507, http://lib.cqvip.com/Qikan/Article/Detail?id=24260721.[197] 范东睿, 黄海林, 唐志敏. 嵌入式处理器TLB设计方法研究. 计算机学报[J]. 2006, 29(1): 73-80, http://lib.cqvip.com/Qikan/Article/Detail?id=21072974.[198] 黄海林, 范东睿, 许彤, 唐志敏. 嵌入式处理器中访存部件的低功耗设计研究. 计算机学报[J]. 2006, 29(5): 815-821, http://lib.cqvip.com/Qikan/Article/Detail?id=21884374.[199] 黄海林, 许彤, 范东睿, 唐志敏. 嵌入式处理器中降低Cache缺失代价设计方法研究. 小型微型计算机系统[J]. 2006, 27(11): 2077-2081, http://dx.doi.org/10.3969/j.issn.1000-1220.2006.11.019.[200] 黄海林, 范东睿, 许彤, 朱鹏飞, 郑保建, 曹非, 陈亮. 嵌入式处理器在片调试功能的设计与实现. 计算机辅助设计与图形学学报[J]. 2006, 18(7): 1005-1010, http://lib.cqvip.com/Qikan/Article/Detail?id=22439361.[201] 常晓涛, 范东睿, 韩银和, 张志敏. 应用输入向量控制技术降低漏电功耗的快速算法. 计算机研究与发展[J]. 2006, 43(5): 946-952, http://lib.cqvip.com/Qikan/Article/Detail?id=21816504.[202] 范东睿. 嵌入式处理器中TLB 设计方法研究. 计算机学报,. 2006, [203] Dongrui Fan. An Energy Efficient TLB Design Methodology. International Symposium on Low Power Electronics and Design (ISLPED). 2005, [204] Fan, DR, Yang, HB, Gao, GR, Zhao, RC. Evaluation and choice of various branch predictors for low-power embedded processor. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2003, 18(6): 833-838, http://lib.cqvip.com/Qikan/Article/Detail?id=8906949.[205] 蒋敬旗, 周旭, 李文, 范东睿. 系统芯片中低功耗测试的几种方法. 微电子学与计算机[J]. 2002, 19(10): 20-23, http://lib.cqvip.com/Qikan/Article/Detail?id=6962753.[206] 李文, 周旭, 范东睿, 蒋敬旗. 可测试性设计中的功耗优化技术. 贵州工业大学学报:自然科学版[J]. 2002, 31(4): 1-7, http://lib.cqvip.com/Qikan/Article/Detail?id=6763121.