General

JUNMIN XIAO

ASSOCIATE PROFESSOR

Institute of Computing Technology, Chinese Academy of Sciences

No.6 Kexueyuan South Road, Zhongguancun, Haidian District, Beijing, China

Work Phone: 8610-62600343 

E-mail: xiaojunmin@ict.ac.cn




Research Areas

My research interests focus on high performance parallel algorithm design for distributed deep learning, and kernel optimization of neural networks.

Education

(1) September 2007 -- July 2012, Academy of Mathematics and Systems Science, Chinese Academy of Sciences

Ph.D. in Computational Mathematics


(2) September 2003 -- July 2007, Xiangtan University

B.S. in Mathematics

Experience

   
Work Experience

(1) September 2018 -- Now, Institute of Computing Technology, Chinese Academy of Sciences

Associate Professor


(2) April 2015 -- September 2018, Institute of Computing Technology, Chinese Academy of Sciences

Assistant Professor


(3) July 2012 -- April 2015, National Astronomical Observatories, Chinese Academy of Sciences

Postdoctor

Publications

JOURNAL ARTICLES

(1) Xun Wang, Ruibao Song, Junmin Xiao, Tong Li, and Xueqi Li, “Accelerating k-Shape Time Series Clustering Algorithm Using GPU”, IEEE Transactions on Parallel and Distributed Systems, 2023, 34(10), pp. 2718--2734.

(2) Hang Cao, Liang Yuan, He Zhang, Yunquan Zhang, Baodong Wu, Kun Li, Shigang Li, Minghua Zhang, Pengqi Lu, and Junmin XiaoAGCM-3DLF: Accelerating Atmospheric General Circulation Model via 3-D Parallelization and Leap-Format”, IEEE Transactions on Parallel and Distributed Systems, 2023, 34(3), pp. 766--780.

(3) Di Cai, Xuehai Hong, Junmin Xiao, and Guangming Tan, “Parallel Optimization for Large-Scale Ocean Data Assimilation”, Journal of Computer Research and Development, 2023, 60(5), pp. 1177--1190.

(4) Zhongzhe Hu, Junmin Xiao, Ninghui Sun, and Guangming Tan, “Fast and Accurate Variable Batch Size Convolution Neural Network Training on Large Scale Distributed Systems”, Concurrency and Computation: Practice and Experience, Wiley, 2022, 34(21), e7119.

(5) Junmin Xiao, and Jian Peng, “Tradeoffs between Computation, Communication, and Synchronization in Stencil-collective Alternate Update”, CCF Transactions on High Performance ComputingSpringer, 2019, 1(2), pp. 144--160.

(6) Junmin Xiao, Guizhao Zhang, Yanan Gao, Xuehai Hong, and Guangming Tan, “Fast Data-obtaining Algorithm for Data Assimilation with Large Data Set”, International Journal of Parallel ProgrammingSpringer, 2019, 47, pp. 1--21.

(7) Junmin Xiao, Jun Zhang, Ting Li, and Shuhong Yang, “Dark Ribbons Propagating and Sweeping Across Extreme Ultraviolet Structures after Filament Eruptions”, The Astrophysical Journal, 2015, 805(1), pp. 25--37.

(8) Huadong Chen, Jun Zhang, Suli Ma, Shuhong Yang, Leping Li, Xin Huang, and Junmin Xiao, “Confined Flares in Solar Active Region 12192 from 2014 October 18 to 29”, The Astrophysical Journal Letters, 2015, 808, pp. L24--L31.

(9) Junmin Xiao, and Qiya Hu, “Multilevel Correction for Collocation Solutions of Volterra Integral Equations with Proportional Delays”, Advances in Computational Mathematics, 2013, 39(3-4), pp. 611--644.

(10) Xingding Chen, Qiya Hu, and Junmin Xiao, “On the Enhanced Strain Finite Element Method for Incompressible Linear Elasticity”, Applied Numerical Mathematics, 2013, 72, pp. 131--142.


CONFERENCE PROCEEDINGS

(1) Mingyi Li, Junmin Xiao, Kewei Zhang, Zhiheng Lin, Chaoyang Shui, Ke Meng, Zehua Wang, Yunfei Pang, and Guangming Tan, “A Coordinated Strategy for GNN Combining Computational Graph and Operator Optimizations”, 38th ACM International Conference on Supercomputing (ICS’24), Kyoto, Japan, 2024-06-05~2024-06-07.

(2) Zhiheng Lin, Ke Meng, Chaoyang Shui, Kewei Zhang, Junmin Xiao, and Guangming Tan, “Exploiting Fine-Grained Redundancy in Set-Centric Graph Pattern Mining”, 29th ACM SIGPLAN Annual Symposium on Principles and Practice of Parallel Programming (PPoPP’24), Edinburgh, United Kingdom, 2024-03-02~2024-03-06.

(3) Junmin Xiao, Chaoyang Shui, Di Cai, Kangyu Wang, Yunfei Pang, Mingyi Li, Hui Ma, and Guangming Tan, “Adaptive Workload-Balanced Scheduling Strategy for Global Ocean Data Assimilation on Massive GPUs”, The International Conference for High Performance Computing, Networking, Storage and Analysis (SC’23), Denver, CO, USA, 2023-11-12~2023-11-17.

(4) Kewei Zhang, Junmin Xiao, Zhiheng Lin, Ke Meng, Chaoyang Shui, Mingyi Li, and Guangming Tan, “GraphPar: Efficient Workload-Aware Subgraph Matching System on Multiple GPUs”, IEEE 29th International Conference on Parallel and Distributed Systems (ICPADS’23), Ocean Flower Island, Hainan, China, 2023-12-17~2023-12-21. 

(5) Junmin Xiao, Yunfei Pang, Qing Xue, Chaoyang Shui, Ke Meng, Hui Ma, Mingyi Li, Xiaoyang Zhang, and Guangming Tan, “W-Cycle SVD: A Multilevel Algorithm for Batched SVD on GPUs”, The International Conference for High Performance Computing, Networking, Storage, and Analysis (SC’22), Dallas, Texas, USA, 2022-11-13~2022-11-18.

(6) Zhongzhe Hu, Junmin Xiao, Zheye Deng, Mingyi Li, Kewei Zhang, Xiaoyang Zhang, Ke Meng, Ninghui Sun, and Guangming Tan, “MegTaiChi: Dynamic Tensor-based Memory Management Optimization for DNN Training”, 36th ACM International Conference on Supercomputing (ICS’22), Virtual Event, USA, 2022-06-28~2022-06-30.

(7) Junmin Xiao, Qing Xue, Hui Ma, Xiaoyang Zhang, and Guangming Tan, “POSTER: A W-cycle Algorithm for Efficient Batched SVD on GPUs”, 27th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’22), Seoul, Republic of Korea, 2021-02-27~2021-03-03.

(8) Xiaoyang Zhang, Junmin Xiao, and Guangming Tan, “I/O Lower Bounds for Auto-tuning of Convolutions in CNNs”, 26th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’21),  Virtual Event, Republic of Korea, 2021-02-27~2021-03-03. 

(9) Xiaoyang Zhang, Junmin Xiao, and Guangming Tan, “Brief Announcement: Communication Lower Bounds of Convolutions in CNNs”, 32nd ACM Symposium on Parallelism in Algorithms and Architectures (SPAA’20), Virtual Event, USA, 2020-07-15~2020-07-17. 

(10) Junmin Xiao, Shijie Wang, Weiqiang Wan, Xuehai Hong, and Guangming Tan, “S-EnKF: Co-designing for Scalable Ensemble Kalman Filter”, 24th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming (PPoPP’19), Washington, D.C., USA, 2019-02-16~2019-02-20. 

(11) Guizhao Zhang, Junmin Xiao, Xuehai Hong, and Guangming Tan, “Fast Data-obtaining Algorithm for Data Assimilation with Large Data Set”, 16th International Conference on Network and Parallel Computing (NPC’19), Hohhot, Inner Mongolia, China, 2019-08-23~2019-08-24.

(12) Zhongzhe Hu, Junmin Xiao, Zhongbo Tian, Xiaoyang Zhang, Chengji Yao, Ninghui Sun, and Guangming Tan, “A Variable Batch Size Strategy for Large Scale Distributed DNN Training”, 17th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA’19), Fujian, Xiamen, China, 2019-12-16~2019-12-19 (BEST PAPER AWARD).

(13) Xiaoyang Zhang, Junmin Xiao, Xiaobin Zhang, Zhongzhe Hu, Hongrui Zhu, Zhongbo Tian, and Guangming Tan, “Tensor Layout Optimization of Convolution for Inference on Digital Signal Processor”, 17th IEEE International Symposium on Parallel and Distributed Processing with Applications (ISPA’19), Fujian, Xiamen, China, 2019-12-16~2019-12-19.

(14) Junmin Xiao, Shigang Li, Baodong Wu, He Zhang, Kun Li, Erlin Yao, Yunquan Zhang, and Guangming Tan, “Communication-avoiding for Dynamical Core of Atmospheric General Circulation Model”, 47th ACM International Conference on Parallel Processing (ICPP’18), Eugene, Oregon, USA, 2018-08-13~2018-08-16.

(15) Baodong Wu, Shigang Li, Hang Cao, Yunquan Zhang, He Zhang, Junmin Xiao, and Minghua Zhang, “AGCM3D: A Highly Scalable Finite-Difference Dynamical Core of Atmospheric General Circulation Model based on 3D Decomposition”, 24th IEEE International Conference on Parallel and Distributed Systems (ICPADS’18), Sentosa, Singapore, 2018-12-11~2018-12-13.