General

Kejiang Ye

PhD, Professor, Director

Center for Cloud Computing

SIAT, CAS

kj.ye@siat.ac.cn

Biography

Kejiang Ye is currently a Professor and the Director of the Research Center for Cloud Computing, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences. He received his B.S and Ph.D degree both from Zhejiang University and was a Post Doctoral Research Associate at Carnegie Mellon University (CMU). His research interests include AI system acceleration and optimization. He is a Senior Member of IEEE and a Distinguished Member of China Computer Federation (CCF).

Research Interests

Cloud computing, edge computing and on-device computing

AI system acceleration and optimization

Projects

[1] National Key R&D Program of China, 2025.12-2028.11, PI

[2] National Key R&D Program of China, 2021.12-2024.11, PI

[3] NSFC Project, 2023.1-2025.12, PI

[4] NSFC Project, 2021.1-2024.12, PI

[5] NSFC Project, 2018.1-2020.12, PI

Recent Conference Papers

  1. Y. Lin, B. Chen, X. Zhang, C. Xu, K. Ye. DynoPipe: Heterogeneous Edge-Cloud LLM Serving with Dynamically Orchestrated Pipeline Boundaries, ISCA 2026.
  2. Y. Lin, S. Peng, C. Lu, C. Xu, K. Ye. FlexPipe: Adapting Dynamic LLM Serving Through Inflight Pipeline Refactoring in Fragmented Serverless Clusters, EuroSys 2026.
  3. W. Chen, C. Lu, H. Xu, K. Ye, C. Xu. High Throughput and Low Latency LLM Serving via Adaptive KV Caching, EuroSys 2026.
  4. Y. Lin, S. Wu, S. Luo, H. Xu, H. Shen, C. Ma, M. Shen, L. Chen, C. Xu, L. Qu, K. Ye. Understanding Diffussion Model Serving in Production: A Top-Down Analysis of Workload, Scheduling, and Resource Efficiency, SoCC 2025.
  5. W. Chen, C. Lu, H. Xu, K. Ye, C. Xu. Multiplexing Dynamic Deep Learning Workloads with SLO-awareness in GPU Clusters, EuroSys 2025.
  6. L. Chen, S. Luo, C. Lin, Z. Mo, H. Xu, K. Ye, C. Xu. Derm: SLA-aware Resource Management for Highly Dynamic Microservices, ISCA 2024.
  7. C. Lu, H. Xu, Y. Li, W. Chen, K. Ye, C. Xu. SMIless: Serving DAG-based Inference with Dynamic Invocations under Serverless Computing, SC 2024
  8. W. Chen, Z. Mo, H. Xu, K. Ye, C. Xu. Interference-aware Multiplexing for Deep Learning in GPU Clusters: A Middleware Approach, SC 2023.
  9. C. Lu, H. Xu, K. Ye, G. Xu, L. Zhang, G. Yang, C. Xu. Understanding and Optimizing Workloads for Unified Resource Management in Large Cloud Platforms, EuroSys 2023
  10. S. Luo, H. Xu, K. Ye, G. Xu, L. Zhang, J. He, G. Yang, C. Xu. Erms: Efficient Resource Management for Shared Microservices with SLA Guarantees, ASPLOS 2023
  11. S. Luo, H. Xu, K. Ye, G. Xu, L. Zhang, G. Yang, C. Xu. The Power of Prediction: Microservice Auto Scaling via Workload Learning, SoCC 2022.
  12. S. Luo, H. Xu, C. Lu, K. Ye, G. Xu, L. Zhang, Y. Ding, J. He, C. Xu. Characterizing Microservice Dependency and Performance: Alibaba trace analysis, SoCC 2021.(Best Paper Award