发表论文
(1) AceMesh:一个面向高性能计算、结构化、数据驱动的编程语言, AceMesh: A Structured Data Driven Programming Language for High Performance Computing, CCF Transactions on High Performance Computing, 2020, 第 1 作者(2) 用数据驱动的编程模型并行多重网格应用, Parallelizing a Multigrid application using Data-Driven Programming Model, 计算机科学, 2020, 第 3 作者(3) 一个面向任务图并行程序的错误检查工具, An Error Checking Tool for DAG-based Task Parallel Programs, 计算机科学, 2017, 第 2 作者(4) 异构并行编程模型研究与进展, Research on Heterogeneous Parallel Programming Model, 软件学报, 2014, 第 4 作者(5) 任务并行编程模型研究与进展, Research on Task Parallel Programming Model, 软件学报, 2013, 第 3 作者(6) 支持算法组件自动替换的编程范式及编译框架, a programming paradigm and compiler framework for automatic replacement of algorithm components, 高技术通讯, 2013, 第 3 作者(7) 用共享工作表在UPC语言中支持无定形的数据并行性, Shared work list: hacking amorphous data parallelism in UPC, 2012 International Workshop on Programming Models and Applications for Multicores and Manycores (in conjunction with PPoPP2012) , 2012, 第 2 作者(8) 异构平台上编译辅助的运行时预取, A compiler-assisted runtime-prefetching scheme for heterogeneous platforms, Proceedings of the 8th International Workshop on OpenMP , 2012, 第 1 作者(9) PADS:基于编译技术的stencil优化工具, PADS: A Pattern-Driven Stencil Compiler-Based Tool for Reuse of Optimizations on GPGPUs, IEEE 17th International Conference on Parallel and Distributed Systems , 2011, 第 3 作者(10) 计算模式驱动的OpenMP程序的自动编译优化, Computation Pattern Driven Reuse of Mannul Optimizations for GPGPUs, 12th International Conference on Parallel and Distributed Computing, Applications and Technologies, 2011, 第 3 作者(11) GPU集群上的UPC语言扩展和编译实现, Unified Parallel C for GPU Clusters: Language Extensions and Compiler Implementation, 23rd International Workshop on Languages and Compilers for Parallel Computing, 2010, 第 1 作者(12) 带类型恢复的编译器源源翻译技术, A source-to-source translation method with type restoration in a compiler, 计算机研究与发展, 2010, 第 3 作者(13) 面向向量化的局部数据重组, Vectorization-oriented local data regrouping, 小型微型计算机系统, 2009, 第 3 作者(14) 面向非多媒体程序的SIMD向量化算法的研究及改进, Research and improvement of SIMD vectorization algorithms on non-multimedia applications, 小型微型计算机系统, 2009, 第 3 作者(15) 基于页分配面向DRAM行缓冲的局部性优化, PARBLO: Page-Allocation-Based DRAM Row Buffer Locality Optimization, Journal of Computer Science and Technology, 2009, 第 4 作者(16) 分布主存系统上的全局tiling技术, Global loop tiling for distributed memory systems, 14th International Euro-Par Conference on Parallel and Distributed Computing, 2008, 第 2 作者(17) 利用全局循环分块实现多分割, Automatic Implementation of Multi-partitioning Using Global Tiling, 14th IEEE International Conference on Parallel and Distributed Systems, 2008, 第 4 作者(18) 全局部分重复计算划分, Global partial replicated computation partitioning, 计算机研究与发展, 2006, 第 2 作者(19) 基于动态profiling技术的流水粒度调优, Tuning Pipeline granularity based on dynamic profiling framework, 计算机研究与发展, 2005, 第 2 作者(20) 针对SMP集群的并行化编译技术, Integrating Parallelizing Compilation Technologies for SMP Clusters, Journal of Computer Science and Technology , 2005, 第 2 作者(21) 全局部分冗余的计算划分技术, Global Partial Replicate Computation Partitioning, International Conference on Parallel Processing, 2004, 第 2 作者