Zhenhua Han 韩震华

Logo

Senior Researcher

Microsoft Research Asia (Shanghai)

Zhenhua.Han [at] microsoft [dot] com

I am a Senior Researcher in Microsoft Research Asia (Shanghai). I obtained Ph.D. degree from The University of Hong Kong (HKU) in 2020, advised by Prof. Francis C.M. Lau. Before that, I obtained B.Eng in Electronic and Information Engineering from University of Electronic Science and Technology of China in 2014. My research interests are resource management, systems for machine learning, and cloud computing.

Selected Publications (Google Scholar)

+ Student I advised

Machine Learning Systems

  1. Parrot: Efficient Serving of LLM-based Applications with Semantic Variable
    Chaofan Lin+, Zhenhua Han*, Chengruidong Zhang, Yuqing Yang, Fan Yang, Chen Chen, Lili Qiu
    (* Corresponding author)
    USENIX OSDI 2024 [Paper] [Code]

  2. PIT: Optimization of Dynamic Sparse Deep Learning Models via Permutation Invariant Transformation
    Ningxin Zheng, Huiqiang Jiang, Quanlu Zhang*, Zhenhua Han*, Lingxiao Ma, Yuqing Yang*, Fan Yang, Chengruidong Zhang, Lili Qiu, Mao Yang, Lidong Zhou
    (* Corresponding author)
    ACM SOSP 2023

  3. Optimizing Dynamic Neural Networks with Brainstorm
    Weihao Cui+, Zhenhua Han, Lingji Ouyang+, Yichuan Wang, Ningxin Zheng, Lingxiao Ma, Yuqing Yang, Fan Yang, Jilong Xue, Lili Qiu, Lidong Zhou, Quan Chen, Haisheng Tan, Minyi Guo
    (* Corresponding author)
    USENIX OSDI 2023

  4. Dynamic Resource Allocation for Deep Learning Clusters with Separated Compute and Storage
    Mingxia Li+, Zhenhua Han, Chi Zhang, Ruiting Zhou, Yuanchi Liu, Haisheng Tan
    IEEE INFOCOM 2023

  5. ElasticFlow: An Elastic Serverless Training Platform for Distributed Deep Learning
    Diandian Gu, Yihao Zhao, Yinmin Zhong , Yifan Xiong, Zhenhua Han, Peng Cheng, Fan Yang, Gang Huang, Xin Jin, Xuanzhe Liu
    ASPLOS 2023

  6. SiloD: A Co-design of Caching and Scheduling for Deep Learning Clusters
    Hanyu Zhao*, Zhenhua Han*, Zhi Yang, Quanlu Zhang, Mingxia Li+, Fan Yang, Qianxi Zhang, Binyang Li, Yuqing Yang, Lili Qiu, Lintao Zhang, Lidong Zhou
    (*co-first author)
    EuroSys 2023 [Paper]

  7. PilotFish: Harvesting Free Cycles of Cloud Gaming with Deep Learning Training
    Wei Zhang+, Binghao Chen, Zhenhua Han, Quan Chen, Peng Cheng, Fan Yang, Ran Shu, Yuqing Yan, Minyi Guo
    (* Corresponding author)
    USENIX ATC ‘22 [Paper] [Code] [Demo Video]

  8. HiveD: Sharing a GPU Cluster for Deep Learning with Guarantees
    Hanyu Zhao*, Zhenhua Han*, Zhi Yang, Quanlu Zhang, Fan Yang, Lidong Zhou, Mao Yang, Francis C.M. Lau, Yuqi Wang, Yifan Xiong, Bin Wang
    (*co-first author)
    USENIX OSDI 2020 [Paper] [Github]

  9. Retiarii: A Deep Learning Exploratory-Training Framework
    Quanlu Zhang, Zhenhua Han, Fan Yang, Yuge Zhang, Zhe Liu, Mao Yang, Lidong Zhou
    USENIX OSDI 2020 [Artifact Code]

  10. Automating Cloud Deployment for Deep Learning Inference of Real-time Online Services
    Yang Li*, Zhenhua Han*, Quanlu Zhang, Zhenhua Li, Haisheng Tan
    (*Co-first authors)
    IEEE INFOCOM 2020
    Full version accepted by IEEE/ACM Transaction on Networking, 2023

  11. Gandiva: Introspective Cluster Scheduling for Deep Learning
    Wencong Xiao, Romil Bhardwaj , Ramachandran Ramjee, Muthian Sivathanu, Nipun Kwatra, Zhenhua Han, Pratyush Patel, Xuan Peng, Hanyu Zhao, Quanlu Zhang, Fan Yang, Lidong Zhou
    USENIX OSDI 2018 [Paper]

Scheduling

  1. Online File Caching in Latency-Sensitive Systems with Delayed Hits and Bypassing
    Chi Zhang, Haisheng Tan, Guopeng Li, Zhenhua Han, Shaofeng H-C. Jiang, Xiang-Yang Li
    IEEE INFOCOM 2022

  2. Regularization-Based Coflow Scheduling in Optical Circuit Switches
    Haisheng Tan; Chi Zhang; Chao Xu; Yupeng Li; Zhenhua Han; Xiang-Yang Li.
    To appear on IEEE/ACM Transaction on Networking, 2021

  3. Online Dispatching and Scheduling of Jobs with Heterogeneous Utilities in Edge Computing
    Chi Zhang, Haisheng Tan, Haoqiang Huang, Zhenhua Han, Shaofeng H.-C. Jiang, Nikolaos Freris, Xiang-Yang Li.
    ACM MobiHoc 2020

  4. Scheduling Placement-Sensitive BSP Jobs with Inaccurate Execution Time Estimation
    Zhenhua Han, Haisheng Tan, Shaofeng H.-C. Jiang, Xiaoming Fu, Wanli Cao, Francis C.M. Lau
    To appear on IEEE/ACM Transaction on Networking, 2021
    Previously accepted by IEEE INFOCOM 2020

  5. OnDisc: Online Latency-Sensitive Job Dispatching and Scheduling in Heterogeneous Edge-Clouds
    Zhenhua Han, Haisheng Tan, Xiang-Yang Li, Shaofeng H.-C. Jiang, Yupeng Li, Francis C.M. Lau.
    IEEE/ACM Transaction on Networking Dec. 2019
    Previously accepted by IEEE INFOCOM 2017

  6. Joint Online Coflow Routing and Scheduling in Data Center Networks
    Haisheng Tan, Shaofeng Jiang, Yupeng Li, Xiang-Yang Li, Chenzi Zhang, Zhenhua Han, Francis C.M. Lau
    IEEE/ACM Transaction on Networking Aug. 2019

  7. Camul: Online Caching on Multiple Caches with Relaying and Bypassing
    Haisheng Tan, Shaofeng Jiang, Zhenhua Han*, Liuyan Liu, Kai Han, Qinglin Zhao
    (* Corresponding author)
    IEEE INFOCOM 2019.

  8. Energy Efficient Dynamic Virtual Machine Management in Data Centers
    Zhenhua Han, Haisheng Tan, Rui Wang, Guihai Chen, Yupeng Li, Francis C.M. Lau
    IEEE/ACM Transaction on Networking Jan. 2019
    Previously accepted by IEEE INFOCOM 2016

Wireless Communication

  1. Efficient Online Learning Based Cross-Tier Uplink Scheduling in HetNets
    Zhenhua Han, Haisheng Tan, Rui Wang, Shaojie Tang, Francis C.M. Lau.
    To appear on IEEE/ACM Transaction on Networking, 2022
    Previously accepted by IEEE INFOCOM 2018

  2. Congestion Game with Agent and Resource Failures
    Yupeng Li, Yongzheng Jia, Haisheng Tan, Rui Wang, Zhenhua Han, Francis C.M. Lau.
    IEEE Journal on Selected Areas in Communications (JSAC) 2017

Services

  1. IEEE INFOCOM 2021, 2022 TPC Member
  2. MSN 2020 TPC Member
  3. Reviewer of IEEE/ACM Transaction on Networking