I am an experienced researcher specializing in computer vision and machine learning with 10+ years of dedicated work. My research focuses on multimodal representation learning, human perception (face & body), and AI-powered maintenance for automated condition monitoring. I have a publication record in top venues (such as CVPR/ICCV/ECCV), and am passionate in translating research into real-world applications. I have successfully led cross-functional teams from designing, implemention to delivering solutions for renowned institutions such as Tencent, Docomo, and Anding Hospital in Beijing. Currently, I am an associate professor at Beijing University of Posts and Telecommunications. I spent one year at the Robotics Institute of CMU with Fernando De la Torre and Jeff F. Cohn, and another year at the Department of Electrical and Computer Engineering of OSU with Aleix M. Martinez.
News
- [2022-12] Congrats to Lanfei Wang on the acceptance of two papers on Neural Architecture Search (with Huawei)!
- [2022-08] I am honored to be selected as a Doctoral Advisor (outstanding Associate Professors recognized for examining grant proposals and PhD theses).
- [2022-06] Congrats to Shi Pu for presenting zero-shot video classification at CVPR 2022! This work has been adopted in Tencent's Short Video Recommendation System.
- [2021-10] Congrats to Mingfei Cheng on the Curvilinear Structure Segmentation work presented at ICCV 2021!
- [2021-09] Our grant on "Cross-domain Facial Action Unit Detection" is accepted by NSFC of China with a funding rate of 17%.
- [2020-10] Congrats to Xiaolin Song for presenting Occludded Pedestrian Detection work at ECCV 2020.
- [2019-12] I was promoted to Associate Professor at the School of Artifical Intelligence and the School of Information and Communication Engineering.
Publications
Video Attribute Prototype Network: A New Perspective for Zero-Shot Video Classification Bo Wang, Kaili Zhao, Hongyang Zhao, Shi Pu, Bo Xiao, Jun Guo ICCV'W 2023 | |
---|---|
AB-Net: Enhancing Anchor-Free Human Detection through Cascade Design and Bi-Center Strategy Hongyang Zhao, Kaili Zhao Under review | |
Alignment-Uniformity Representation Learning for Zero-shot Video Classification Kaili Zhao*, Shi Pu*, Mao Zheng (*equal contribution) CVPR 2022 Paper Slides Code | |
M2NAS: Joint Neural Architecture Optimization System with Network Transmission Lanfei Wang, Lingxi Xie, Kaifeng Bi, Kaili Zhao, Jun Guo, Qi Tian IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems 2022 Paper | |
Regularized Differentiable Architecture Search Lanfei Wang, Lingxi Xie, Kaili Zhao, Jun Guo, Qi Tian IEEE Embedded Systems Letters 2022 Paper | |
Joint Topology-preserving and Feature-refinement Network for Curvilinear Structure Segmentation Kaili Zhao*, Mingfei Cheng*, Xuhong Guo, Yajing Xu, Jun Guo (*equal contribution) ICCV 2021 Paper Slides Code | |
Progressive Refinement Network for Occludded Pedestrian Detection Kaili Zhao*, Xiaolin Song*, Wen-Sheng Chu, Honggang Zhang, Jun Guo (*equal contribution) ECCV 2020 Paper Slides Code Video | |
Robust visual tracking by embedding combination and weighted-gradient optimization Jin Feng, Peng Xu, Shi Pu, Kaili Zhao, Honggang Zhang Pattern Recognition 2020 Paper | |
Enhanced Initialization with Multi-Stage Learning for Robust Visual Tracking Jin Feng, Shi Pu, Kaili Zhao, Honggang Zhang, Tianming Du IEEE Visual Communications and Image Processing 2019 (oral) Paper | |
Learning Facial Action Units from Web Images with Scalable Weakly Supervised Clustering Kaili Zhao, Wen-Sheng Chu, Aleix M. Martinez CVPR 2018 Paper Poster Code | |
Deep Region and Multi-label Learning for Facial Action Unit Detection Kaili Zhao, Wen-Sheng Chu, Honggang Zhang CVPR 2016 Paper Poster Code | |
Joint Patch and Multi-label Learning for Facial Action Unit and Holistic Expression Recognition Kaili Zhao, Wen-Sheng Chu, Fernando De la Torre, Jeffery F. Cohn, Honggang Zhang IEEE Transactions on Image Processing 2016 Paper Code | |
Joint Patch and Multi-label Learning for Facial Action Unit Detection Kaili Zhao, Wen-Sheng Chu, Fernando De la Torre, Jeffery F. Cohn, Honggang Zhang CVPR 2015 Paper Poster Code | |
Industrial Grants
- PI, “Metallic Corrosion Detection,” with DOCOMO Beijing Communications Laboratories Co., Ltd., 11/2021–now
- PI, “Cross-domain Facial Action Unit Detection,” with National Natural Science Foundation of China, 01/2021–12/2024
- PI, “Human Attention and Counting for Indoor Watching,” with DOCOMO Beijing Communications Laboratories Co., Ltd., 10/2019–06/2020
- PI, “Crowd Counting on Street-view Images,” with DOCOMO Beijing Communications Laboratories Co., Ltd., 03/2019–08/2019
- PI, “Drone-based Image Processing for Crack Inspection,” with DOCOMO Beijing Communications Laboratories Co., Ltd., 06/2018–08/2019
- PI, “Weakly-supervised Spectral Clustering and Its Application to Recognize Facial Expressions in 1 Million Facial Images,” with National Natural Science Foundation of China, 09/2017–12/2020
- PI, “Multi-label Learning for Facial Action Unit Detection,” with Fundamental Research Funds for the Central Universities, 07/2017–10/2018
- Co-PI, “Depression Detection based on Automatic Facial Action Unit Analysis,” with Institute of Mental Health, Beijing Anding Hospital (top psychiatry and mental health hospital in China), 11/2017–11/2021
Students
PhD Students- Xiaolin Song (2023), now in Alibaba
- Lanfei Wang (2022), now in Huawei
- Jin Feng (2021), now in JD
- Shi Pu (2020), now in Tencent
- Yuqi Liao (2025)
- Yishan Chen (2025)
- Wenqi Xu (2024)
- Hongyang Zhao (2023), now in Baidu
- Bo Wang (2023), now in ByteDance
- Mingfei Cheng (2022), now PhD at Singapore Management Univerisity
Teaching
- Introduction to Artificial Intelligence (2021): 110 undergrad students.
- Intelligent Image Analysis (2020, 2021): 270 graduate students.
- Digital Image Processing (2017, 2018): 190 undergrad students.