发表论文
[1] 陈岳涛, 邱柯妮, 陈莉, 贾海鹏, 张云泉, 肖利民, 刘磊. Smart Scheduler: an Adaptive NVM-Aware Thread Scheduling Approach on NUMA Systems. CCF Transactions on High Performance Computing (THPC)[J]. 2022, [2] Shang, Honghui, Duan, Xiaohui, Li, Fang, Zhang, Libo, Xu, Zhiqian, Liu, Kan, Luo, Haiwen, Ji, Yingrui, Zhao, Wenxuan, Xue, Wei, Chen, Li, Zhang, Yunquan. Many-core acceleration of the first-principles all-electron quantum perturbation calculations. COMPUTER PHYSICS COMMUNICATIONS[J]. 2021, 267: http://dx.doi.org/10.1016/j.cpc.2021.108045.[3] Li Chen, Shenglin Tang, You Fu, Xiran Gao, Jie Guo, Shangzhi Jiang. AceMesh: A Structured Data Driven Programming Language for High Performance Computing. CCF Transactions on High Performance Computing[J]. 2020, [4] 姜尚志, 唐生林, 高希然, 花嵘, 陈莉, 刘颖. “神威·太湖之光”上Tend_lin应用的并行优化研究. 计算机工程与科学[J]. 2020, 42(10): 1842-1851, http://lib.cqvip.com/Qikan/Article/Detail?id=7103095078.[5] Shengjie Yang, Xinyu Li, Xinglei Dou, Xiaoli Gong, Hao Liu, Li Chen, Lei Liu. Monitoring Memory Behaviors and Mitigating NUMA Drawbacks on Tiered NVM Systems. The 17th IFIP Intl. Conf. on Network and Par. Computing[J]. 2020, [6] 郭杰, 高希然, 陈莉, 傅游, 刘颖. 用数据驱动的编程模型并行多重网格应用. 计算机科学[J]. 2020, 47(8): 32-40, http://lib.cqvip.com/Qikan/Article/Detail?id=7102493869.[7] Shengjie Yang, 李新宇, 窦星磊, 刘磊, 陈莉. 监控分层NVM系统中的内存行为以减轻NUMA影响. The 17th IFIP Intl. Conf. on Network and Par. Computingnull. 2020, https://link.springer.com/chapter/10.1007/978-3-030-79478-1_33.[8] 陈莉, 唐生林, 刘艳娜. 一个面向任务图并行程序的错误检查工具. 计算机科学[J]. 2017, 44(3): 38-41, http://lib.cqvip.com/Qikan/Article/Detail?id=671506025.[9] 刘颖, 吕方, 王蕾, 陈莉, 崔慧敏, 冯晓兵. 异构并行编程模型研究与进展. 软件学报[J]. 2014, 25(7): 1459-1475, http://lib.cqvip.com/Qikan/Article/Detail?id=50166787.[10] 刘雷, 李晶, 陈莉, 冯晓兵. 基于进程投机并行的运行时系统设计与优化. 计算机工程[J]. 2014, 40(3): 99-102,112, http://lib.cqvip.com/Qikan/Article/Detail?id=48839168.[11] 李恒杰, 何文婷, 陈莉, 刘雷, 吴承勇. 支持算法组件自动替换的编程范式及编译框架. 高技术通讯[J]. 2013, 23(11): 1131-1138, http://lib.cqvip.com/Qikan/Article/Detail?id=48009078.[12] 王蕾, 崔慧敏, 陈莉, 冯晓兵. 任务并行编程模型研究与进展. 软件学报[J]. 2013, 24(1): 77-90, http://lib.cqvip.com/Qikan/Article/Detail?id=44331446.[13] 陈莉, 寿宝江, 侯雄辉, 黄磊. A compiler-assisted runtime-prefetching scheme for heterogeneous platforms. Proceedings of the 8th International Workshop on OpenMP[J]. 2012, [14] 徐世雄, 陈莉. Shared work list: hacking amorphous data parallelism in UPC. 2012 International Workshop on Programming Models and Applications for Multicores and Manycores (in conjunction with PPoPP2012)[J]. 2012, https://dlnext.acm.org/doi/abs/10.1145/2141702.2141716.[15] Chen Li, Liu Lei, Tang Shenglin, Huang Lei, Jing Zheng, Xu Shixiong, Zhang Dingfei, Shou Baojiang, Cooper K, MellorCrummey J, Sarkar V. Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation. LANGUAGES AND COMPILERS FOR PARALLEL COMPUTINGnull. 2011, 6548: 151-+, [16] Han Dongni, Xu Shixiong, Chen Li, Huang Lei, IEEE. PADS: A Pattern-Driven Stencil Compiler-Based Tool for Reuse of Optimizations on GPGPUs. 2011 IEEE 17TH INTERNATIONAL CONFERENCE ON PARALLEL AND DISTRIBUTED SYSTEMS (ICPADS)[J]. 2011, 308-315, [17] 徐世雄, 韩冬妮, 陈莉. Computation Pattern Driven Reuse of Mannul Optimizations for GPGPUs. 12THINTERNATIONALCONFERENCEONPARALLELANDDISTRIBUTEDCOMPUTINGAPPLICATIONSANDTECHNOLOGIES[J]. 2011, https://dl.acm.org/doi/10.1109/PDCAT.2011.30.[18] 米伟, 李玉祥, 陈莉, 冯晓兵, 张兆庆. 带类型恢复的编译器源源翻译技术. 计算机研究与发展[J]. 2010, 1145-1155, http://lib.cqvip.com/Qikan/Article/Detail?id=34504080.[19] 卢兴敬, 商磊, 陈莉. POM:一个MPI程序的进程优化映射工具. 计算机工程与科学[J]. 2009, 31(A01): 201-205, http://lib.cqvip.com/Qikan/Article/Detail?id=32021597.[20] 李玉祥, 施慧, 陈莉. 面向非多媒体程序的SIMD向量化算法的研究及改进. 小型微型计算机系统[J]. 2009, 1927-1935, http://lib.cqvip.com/Qikan/Article/Detail?id=31680206.[21] Mi, Wei, Feng, XiaoBing, Jia, YaoCang, Chen, Li, Xue, JingLing. PARBLO: Page-Allocation-Based DRAM Row Buffer Locality Optimization. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2009, 24(6): 1086-1097, http://lib.cqvip.com/Qikan/Article/Detail?id=32022580.[22] 李玉祥, 施慧, 陈莉. 面向向量化的局部数据重组. 小型微型计算机系统[J]. 2009, 1528-1534, http://lib.cqvip.com/Qikan/Article/Detail?id=31169422.[23] 刘雷, 陈莉, 冯晓兵. Global loop tiling for distributed memory systems. Euro-Par[J]. 2008, [24] Liu Lei, Chen Li, Wu Cheng Yong, Feng Xiaobing, Luque E, Margalef T, Benitez D. Global tiling for communication minimal parallelization on distributed memory systems. EURO-PAR 2008 PARALLEL PROCESSING, PROCEEDINGSnull. 2008, 5168: 382-391, [25] 刘雷, 张定飞, 李恒杰, 陈莉. Automatic Implementation of Multi-partitioning Using Global Tiling. 14th IEEE International Conference on Parallel and Distributed Systems[J]. 2008, [26] 王轶然, 陈莉, 冯晓兵. 全局部分重复计算划分. 计算机研究与发展[J]. 2006, 2158-2165, [27] Feng, XB, Chen, L, Wang, YR, An, XM, Ma, L, Sang, CL, Zhang, ZQ. Integrating parallelizing compilation technologies for SMP clusters. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY[J]. 2005, 20(1): 125-133, http://lib.cqvip.com/Qikan/Article/Detail?id=11714278.[28] 马琳, 陈莉, 冯晓兵. 基于动态profiling技术的流水粒度调优. 计算机研究与发展[J]. 2005, 42(6): 1065-1072, http://lib.cqvip.com/Qikan/Article/Detail?id=15707303.[29] 王轶然, 陈莉, 张兆庆. Global Partial Replicate Computation Partitioning. International Conference on Parallel Processing[J]. 2004, https://ieeexplore.ieee.org/document/1327910/authors#authors.[30] 陈莉, 张兆庆, 冯晓兵. 分布存储系统中优化通信的冗余计算分割. 计算机学报[J]. 2003, 26(2): 180-187, http://lib.cqvip.com/Qikan/Article/Detail?id=7412034.[31] 陈莉, 张兆庆, 冯晓兵. 分布内存系统中节点间软流水优化技术. 计算机科学[J]. 2002, 29(11): 24-28, http://lib.cqvip.com/Qikan/Article/Detail?id=7768436.