WebbTechnical Papers Archive QoS-Aware Irregular Collaborative Inference for Improving Throughput of DNN Services. Authors: Kaihua Fu, Jiuchen Shi, and Quan Chen (Shanghai Jiao Tong University); Ningxin Zheng (Microsoft Research Asia); Wei Zhang (Shanghai Jiao Tong University); Deze Zeng (China University of Geosciences); and Minyi Guo … WebbEnable Simultaneous DNN Services Based on Deterministic Operator Overlap and Precise Latency Prediction. Authors: Weihao Cui, Han Zhao, and Quan Chen (Shanghai Jiao Tong University); Ningxin Zheng (Microsoft Research Asia); Jingwen Leng and Jieru Zhao (Shanghai Jiao Tong University); Zhuo Song, Tao Ma, and Yong Yang (Alibaba Cloud); …
Ningxin Zheng - Home - Author DO Series
WebbSpaceEvo: Searching Hardware-Friendly Search Space for Efficient Int8 Inference. Li Lyna Zhang, Xudong Wang, Jiahang Xu, Quanlu Zhang, Yuqing Yang, Ningxin Zheng, Ting … Webb1 nov. 2024 · nni Public Forked from microsoft/nni An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture … cdl classes in midland tx
Ningxin Zheng - Publications - Author DO Series
WebbWei Zhang, Quan Chen, Kaihua Fu, Ningxin Zheng, Zhiyi Huang, Jingwen Leng, Chao Li, Wenli Zheng, Minyi Guo: Towards QoS-Aware and Resource-Efficient GPU Microservices Based on Spatial Multitasking GPUs In Datacenters. CoRR abs/2005.02088 (2024) 2010 – 2024. see FAQ. What is the meaning of the colors in the publication lists? WebbSparTA: Deep-Learning Model Sparsity via Tensor-with-Sparsity-Attribute WebbNingxin Zheng, Bin Lin, Quanlu Zhang, Lingxiao Ma, Yuqing Yang, Fan Yang, Yang Wang, Mao Yang, Lidong Zhou. OSDI 2024 July 2024 View Publication. CoDL: … cdl classes in cleveland ohio