Teng Yu, Wenlai Zhao*, Pan Liu, etc., “Large-Scale Automatic K-Means Clustering for Heterogeneous Many-Core Supercomputer”[J]. IEEE Transactions on Parallel and Distributed Systems (TPDS) 2020
Yixue Hao, Min Chen, Donggang Cao, Wenlai Zhao, Ivan Petrov, Vitaly Antonenko, Ruslan Smeliansky, “Cognitive-Caching: Cognitive Wireless Mobile Caching by Learning Fine-Grained Caching-Aware Indicators”[J], IEEE Wireless Communications 2020
Liang Qiao, Hongkun Yu, Kunpeng Wang, Ruixin Sun, Wenlai Zhao*, Guangwen Yang,“Large-scale Parallel Design for Cryo-EM Structure Determination on Heterogeneous Many-core Architectures”[C]. Proceedings of the IEEE International Conference on Bioinformatics and Biomedicine (BIBM) 2019
Ouyi Li, Wenlai Zhao*, Xuancheng Huang, etc., “Scaling the Training of Recurrent Neural Networks on Sunway TaihuLight Supercomputer”[C]. International Conference on Computational Science (ICCS) 2019
Wei Gao, Jiarui Fang, Wenlai Zhao, Jinzhe Yang, etc., “swATOP: Automatically optimizing deep learning operators on SW26010 many-core processor”[C]. Proceedings of the 48th International Conference on Parallel Processing (ICPP) 2019
Wenlai Zhao, Haohuan Fu, Jiarui Fang, etc., “Optimizing Convolutional Neural Networks on Sunway TaihuLight Supercomputer”[J], ACM Transactions on Architecture and Code Optimization (TACO) 2018
Liandeng Li, Teng Yu, Wenlai Zhao, Haohuan Fu, etc., “Large-Scale Hierarchical k-means for Heterogeneous Many-Core Supercomputers”[C], Supercomputing (SC) 2018
Jiarui Fang, Haohuan Fu, Wenlai Zhao, Bingwei Chen, Weijie Zheng, Guangwen Yang, “swDNN: A Library for Accelerating Deep Learning Applications on Sunway TaihuLight Supercomputer”[C], 31st IEEE International Parallel \& Distributed Processing Symposium (IPDPS) 2017
Wenlai Zhao, Haohuan Fu, Wayne Luk, and etc. “F-CNN: An FPGA-based Framework for Training Convolutional Neural Networks”, Application-specific Systems, Architectures and Processors (
ASAP) 2016
Wenlai Zhao, Haohuan Fu, Wayne Luk and Guangwen Yang, “Patra: Parallel Tree-reweighted Message Passing Architecture”[C], Field Programmable Logic and Applications (FPL) 2014
Wenlai Zhao, Haohuan Fu and Guangwen Yang, “A Fully-Pipelined FPGA Design for Tree-reweighted Message Passing Algorithm”[C], Field-Programmable Custom Computing Machines (FCCM) 2014