发表论文
[1] Wang, Peng, Yu, Zhibin. RayBench: An Advanced NVIDIA-Centric GPU Rendering Benchmark Suite for Optimal Performance Analysis. ELECTRONICS[J]. 2023, 12(19): http://dx.doi.org/10.3390/electronics12194124.[2] 陈九屹, 陈玲, 喻之斌. Accelerating path tracing rendering with Multi-GPU in Blender cycles. ICACT2023null. 2023, [3] 李乐乐, 刘江佾, 樊建平, 钱学海, 黄铠, 钟叶青, 喻之斌. SOCA-DOM: A Mobile System-on-Chip Array System for Analyzing Big Data on the Move. Journal of computer science and technology: Instruction for authors.[J]. 2022, [4] 辛锦瀚, Kai Hwang, 喻之斌. LOCAT: Low—Overhead Online Configuration Auto—Tuning of Spark SQL Applications. ACMSIGMODCONFERENCE2022[J]. 2022, [5] Guo, Yijin, Shan, Huasong, Huang, Shixin, Hwang, Kai, Fan, Jianping, Yu, Zhibin. GML: Efficiently Auto-Tuning Flink's Configurations Via Guided Machine Learning. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2021, 32(12): 2921-2935, [6] Bo Dong, Zheng Wang, Wenxuan Chen, Chao Chen, Yongkui Yang, Zhibin Yu. OR-ML: Enhancing Reliability for Machine Learning Accelerator with Opportunistic Redundancy. Design, Automation & Test in Europe Conference & Exhibition (DATE)null. 2021, [7] 贝振东, Nam Sung Kim, Kai HWang, 喻之斌. OSC: An Online Self-Configuring Big Data Framework for Optimization of QoS. IEEE Transactions on Computers[J]. 2021, [8] Yin, Yan, Wu, Junmin, Zhou, Xu, Eeckhout, Lieven, Qouneh, Amer, Li, Tao, Yu, Zhibin. COPA: Highly Cost-Effective Power Back-Up for Green Datacenters. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2020, 31(4): 967-980, http://dx.doi.org/10.1109/TPDS.2019.2948336.[9] 喻之斌. 通过机器学习和模糊集方法在微体系结构层面表征区块链系统. :International Symposium on High-Performance Computer Architecture. 2020, [10] Zhu, Liang, Chen, Chao, Su, Zihao, Chen, Weiguang, Li, Tao, Yu, Zhibin, IEEE. BBS: Micro-architecture Benchmarking Blockchain Systems through Machine Learning and Fuzzy Set. 2020 IEEE INTERNATIONAL SYMPOSIUM ON HIGH PERFORMANCE COMPUTER ARCHITECTURE (HPCA 2020)[J]. 2020, 411-423, [11] Gao, Lan, Xu, Yunlong, Wang, Rui, Luan, Zhongzhi, Yu, Zhibin, Qian, Depei. Thread-Level Locking for SIMT Architectures. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2020, 31(5): 1121-1136, https://www.webofscience.com/wos/woscc/full-record/WOS:000526526100009.[12] 王玉钊, 李乐乐, You Wu, Junqing Yu, 喻之斌, 钱学海. 一种时空共享的任务调度抽象. in proceedings of International Symposium on Computer Architecture (ISCA)[J]. 2019, [13] Zhao, Xia, Adileh, Almutaz, Yu, Zhibin, Wang, Zhiying, Jaleel, Aamer, Eeckhout, Lieven, ACM. Adaptive Memory-Side Last-Level GPU Caching. PROCEEDINGS OF THE 2019 46TH INTERNATIONAL SYMPOSIUM ON COMPUTER ARCHITECTURE (ISCA '19)null. 2019, 411-423, http://dx.doi.org/10.1145/3307650.3322235.[14] Liu, Qixiao, Chen, Zhifeng, Yu, Zhibin. MiC: Multi-level Characterization and Optimization of GPGPU Kernels. ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS[J]. 2019, 15(3): https://www.webofscience.com/wos/woscc/full-record/WOS:000495421500003.[15] 尹燕, Junmin Wu, Lieven Eeckhout, Amer Qouneh, 喻之斌. 一种用于绿色电源中心的极具成本效益的电源备份方案. IEEE Transactions on Parallel and Distributed Systems (TPDS)[J]. 2019, [16] Li Lele, Yu Zhibin, Assoc Comp Machinery. SMHC: A Synthetic Metric for Heterogeneous Resources in Cloud Computing. ICBDC 2019: PROCEEDINGS OF 2019 4TH INTERNATIONAL CONFERENCE ON BIG DATA AND COMPUTINGnull. 2019, 97-101, http://dx.doi.org/10.1145/3335484.3335519.[17] 李诗逸, 古亮, 喻之斌. 基于堆叠式分布式文件系统的端到端校验. 集成技术. 2019, 13-25, http://lib.cqvip.com/Qikan/Article/Detail?id=74677473504849574853484851.[18] Yu Zhibin, Bei Zhendong, Qian Xuehai. Datasize-Aware High Dimensional Configurations Auto-Tuning of In-Memory Cluster Computing. ACM SIGPLAN NOTICESnull. 2018, 53(2): 564-577, [19] Yu, Zhibin, Wang, Jing, Eeckhout, Lieven, Xu, Chengzhong. QIG: Quantifying the Importance and Interaction of GPGPU Architecture Parameters. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS[J]. 2018, 37(6): 1211-1224, http://dx.doi.org/10.1109/TCAD.2017.2698026.[20] 吕依蓉, Sun Bin, Luo Qinyi, Wang Jing, Yu Zhibin, Qian Xuehai, IEEE. CounterMiner: Mining Big Performance Data from Hardware Counters. 2018 51ST ANNUAL IEEE/ACM INTERNATIONAL SYMPOSIUM ON MICROARCHITECTURE (MICRO)[J]. 2018, 613-626, http://dx.doi.org/10.1109/MICR0.2018.00056.[21] Bei, Zhendong, Yu, Zhibin, Luo, Ni, Jiang, Chuntao, Xu, Chengzhong, Feng, Shengzhong. Configuring in-memory cluster computing using random forest. FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE[J]. 2018, 79: 1-15, http://dx.doi.org/10.1016/j.future.2017.08.011.[22] Yu, Zhibin, Xiong, Wen, Eeckhout, Lieven, Bei, Zhendong, Mendelson, Avi, Xu, Chengzhong. MIA: Metric Importance Analysis for Big Data Workload Characterization. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2018, 29(6): 1371-1384, https://www.webofscience.com/wos/woscc/full-record/WOS:000432007000013.[23] Liu Qixiao, Yu Zhibin, ACM. The Elasticity and Plasticity in Semi-Containerized Co-locating Cloud Workload: a View from Alibaba Trace. PROCEEDINGS OF THE 2018 ACM SYMPOSIUM ON CLOUD COMPUTING (SOCC '18)null. 2018, 347-360, http://dx.doi.org/10.1145/3267809.3267830.[24] 喻之斌. 数据感知的内存大数据高维配置参数优化. proceedings of the 23rd ACM International Conference on Architectural Support for Programming Languages and Operating Systems (ASPLOS)[J]. 2018, [25] 廖小飞, 沈立, 喻之斌, 陈海波. 编译技术走向:从通用处理器到专用加速器. 前沿科学[J]. 2018, 12(4): 76-81, http://lib.cqvip.com/Qikan/Article/Detail?id=7001585990.[26] Liu Yuxi, Zhao Xia, Yu Zhibin, Wang Zhenlin, Wang Xiaolin, Luo Yingwei, Eeckhout Lieven, IEEE. BACM: Barrier-Aware Cache Management for Irregular Memory-Intensive GPGPU Workloads. 2017 IEEE 35TH INTERNATIONAL CONFERENCE ON COMPUTER DESIGN (ICCD)null. 2017, 633-640, [27] Vijay Janapa Reddi, Zhibin Yu, Jing Wang, Lieven Eeckhout, Chengzhong Xu, Jingwen Leng. QIG:Quantifying the Importance and Interaction of GPGPU Architecture Parameters. IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEM.[J]. 2017, http://ir.siat.ac.cn:8080/handle/172644/12532.[28] Bei, Zhendong, Yu, Zhibin, Liu, Qixiao, Xu, Chengzhong, Feng, Shengzhong, Song, Shuang. MEST: A Model-Driven Efficient Searching Approach for MapReduce Self-Tuning. IEEE ACCESS[J]. 2017, 5: 3580-3593, http://www.chinair.org.cn/handle/1471x/1747852.[29] Xiong, Wen, Bei, Zhengdong, Xu, Chengzhong, Yu, Zhibin. ATH: Auto-Tuning HBase's Configuration via Ensemble Learning. IEEE ACCESS[J]. 2017, 5: 13157-13170, http://dx.doi.org/10.1109/ACCESS.2017.2716441.[30] Avi Mendelson, Zhibin Yu, Wen Xiong, Lieven Eeckhout, Zhengdong Bei, Chengzhong Xu. MIA: Metric Importance Analysis for Big Data Workload Characterization. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS (TPDS)[J]. 2017, http://ir.siat.ac.cn:8080/handle/172644/12535.[31] 郭宁宁, 寇喆, 杨晓西, 杨静文. 基于大数据挖掘技术的电能表性能分析方法研究. 陕西电力[J]. 2017, 45(4): 84-87, http://lib.cqvip.com/Qikan/Article/Detail?id=672002699.[32] Jia Dan, Wang Rui, Xu Chengzhong, Yu Zhibin, Gao GR, Qian DP, Gao XB, Chapman B, Chen W. QIM: Quantifying Hyperparameter Importance for Deep Learning. NETWORK AND PARALLEL COMPUTINGnull. 2016, 9966: 180-188, [33] 喻之斌. 对多线程程序的二层次混合采样模拟. ACM Transactions on Architecture and Code Optimization. 2016, [34] Xiong, Wen, Yu, Zhibin, Eeckhout, Lieven, Bei, Zhengdong, Zhang, Fan, Xu, Chengzhong. ShenZhen transportation system (SZTS): a novel big data benchmark suite. JOURNALOFSUPERCOMPUTING[J]. 2016, 72(11): 4337-4364, https://www.webofscience.com/wos/woscc/full-record/WOS:000387234200017.[35] Yu Zhibin, Eeckhout Lieven, Xu Chengzhong, IEEE. Thread Similarity Matrix: Visualizing Branch Divergence in GPGPU programs. PROCEEDINGS 45TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING - ICPP 2016null. 2016, 179-184, [36] Jiang, Chuntao, Yu, Zhibin, Eeckhout, Lieven, Jin, Hai, Liao, Xiaofei, Xu, Chengzhong. Two-Level Hybrid Sampled Simulation of Multithreaded Applications. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION[J]. 2016, 12(4): https://www.webofscience.com/wos/woscc/full-record/WOS:000367950500007.[37] Bei, Zhendong, Yu, Zhibin, Zhang, Huiling, Xiong, Wen, Xu, Chengzhong, Eeckhout, Lieven, Feng, Shengzhong. RFHOC: A Random-Forest Approach to Auto-Tuning Hadoop's Configuration. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS[J]. 2016, 27(5): 1470-1483, http://dx.doi.org/10.1109/TPDS.2015.2449299.[38] Yu, Zhibin, Eeckhout, Lieven, Goswami, Nilanjan, Li, Tao, John, Lizy K, Jin, Hai, Xu, Chengzhong, Wu, Junmin. GPGPU-MiniBench: Accelerating GPGPU Micro-Architecture Simulation. IEEE TRANSACTIONS ON COMPUTERS[J]. 2015, 64(11): 3153-3166, http://dx.doi.org/10.1109/TC.2015.2395427.[39] Xiong Wen, Yu Zhibin, Eeckhout Lieven, Bei Zhengdong, Zhang Fan, Xu Chengzhong, IEEE. SZTS: A Novel Big Data Transportation System Benchmark Suite. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP)null. 2015, 819-828, [40] Jiang Chuntao, Yu Zhibin, Jin Hai, Liao Xiaofei, Eeckhout Lieven, Zeng Yonggang, Xu Chengzhong, IEEE. Shorter On-Line Warmup for Sampled Simulation of Multi-Threaded Applications. 2015 44TH INTERNATIONAL CONFERENCE ON PARALLEL PROCESSING (ICPP)null. 2015, 350-359, [41] Chandio, Aftab Ahmed, Bilal, Kashif, Tziritas, Nikos, Yu, Zhibin, Jiang, Qingshan, Khan, Samee U, Xu, ChengZhong. A comparative study on resource allocation and energy efficient job scheduling strategies in large-scale parallel computing systems. CLUSTER COMPUTING-THE JOURNAL OF NETWORKS SOFTWARE TOOLS AND APPLICATIONS[J]. 2014, 17(4): 1349-1367, https://www.webofscience.com/wos/woscc/full-record/WOS:000345077400019.[42] 熊文, 喻之斌, 须成忠. 大数据基准测试程序包构建方法研究. 集成技术[J]. 2014, 1-9, http://lib.cqvip.com/Qikan/Article/Detail?id=661779189.[43] Xu Chengzhong. Accelerating GPGPU architecture simulation.. 2013, [44] Xiong Wen, Yu Zhibin, Bei Zhendong, Zhao Juanjuan, Zhang Fan, Zou Yubin, Bai Xue, Li Ye, Xu Chengzhong, Hu X, Lin TY, Raghavan V, Wah B, BaezaYates R, Fox G, Shahabi C, Smith M, Yang Q, Ghani R, Fan W, Lempel R, Nambiar R. A Characterization of Big Data Benchmarks. 2013 IEEE INTERNATIONAL CONFERENCE ON BIG DATAnull. 2013, [45] Jiang, Chuntao, Yu, Zhibin, Jin, Hai, Xu, Chengzhong, Eeckhout, Lieven, Heirman, Wim, Carlson, Trevor E, Liao, Xiaofei. PCantorSim: Accelerating Parallel Architecture Simulation through Fractal-Based Sampling. ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION[J]. 2013, 10(4): https://www.webofscience.com/wos/woscc/full-record/WOS:000330509300028.[46] 喻之斌. FractalMRC: An Online Cache Miss Rate Curve Generating Approach for Commodity Systems. Proceedings of IPDPS2012 (顶级国际会议). 2013, [47] 须成忠, 喻之斌. 云计算核心技术与应用研究. 集成技术[J]. 2012, 1-3, http://lib.cqvip.com/Qikan/Article/Detail?id=3000000025.[48] 熊文, 喻之斌, 须成忠. 几个常见分布式文件系统特征分析和性能对比(英文). 集成技术[J]. 2012, 58-63, http://lib.cqvip.com/Qikan/Article/Detail?id=3000000035.[49] 喻之斌. 计算系统的能耗瓶颈. 10000个科学难题 ? 信息科学卷. 2011, [50] 喻之斌. Hierarchically Characterization CUDA program Behavior. PROCEEDINGSOFIISWC2011重要国际会议. 2011, [51] 喻之斌. MT-Profiler: A Parallel Dynamic Analysis Framework based on Two-Stage Sampling. Proceedings of APPT2011. 2011, [52] 喻之斌. System-level Max Power (SYMPO) - A systematic approach for escalating system-level power consumption using synthetic benchmarks. Proceedings of PACT 2010(顶级国际会议). 2010, [53] 喻之斌. CantorSim: Simplifying Acceleration of Micro-architecture Simulation. Proceedings of MASCOTS2010(顶级国际会议). 2010, [54] 喻之斌. Simple and Fast Micro-architecture Simulation---A Trisection Cantor Fractal Approach. Proceedings of SIGMETRICS2009(顶级国际会议). 2009, [55] Yu Zhibin, Jin Hai, Chen Jian, John Lizy K, Bradley JT, Conrad JM, Field AJ, Harder U, Knottenbelt WJ, Riley GF. TSS: Applying Two-Stage Sampling in Micro-architecture Simulations. 2009IEEEINTERNATIONALSYMPOSIUMONMODELINGANALYSISSIMULATIONOFCOMPUTERANDTELECOMMUNICATIONSYSTEMSMASCOTSnull. 2009, 463-+, http://apps.webofknowledge.com/CitedFullRecord.do?product=UA&colName=WOS&SID=5CCFccWmJJRAuMzNPjj&search_mode=CitedFullRecord&isickref=WOS:000275140200045.[56] 喻之斌, 金海, 邹南海. 计算机体系结构软件模拟技术. 软件学报[J]. 2008, 19(4): 1051-1068, http://lib.cqvip.com/Qikan/Article/Detail?id=26886681.[57] Rui Zhang, Mingcong Song, Tao Li, Zhibin Yu, Yuting Dai, Xiaoguang Liu, Gang Wang. Democratic learning: A hardware/software co-design for lightweight blockchain-secured on-device machine learning. JOURNAL OF SYSTEMS ARCHITECTURE. http://dx.doi.org/10.1016/j.sysarc.2021.102205.[58] Barrier-Aware Warp Scheduling for Throughput Processors. http://ir.siat.ac.cn:8080/handle/172644/10317.[59] RFHOC: A Random-Forest Approach to Auto-Tuning Hadoop’s Configuration. http://ir.siat.ac.cn:8080/handle/172644/10232.