Publications
Up-to-date list can also be found on my Google Scholar. Feel free to contact me via LinkedIn or email if you have questions or would like to share collaboration/career opportunities.
[42] Adaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization
Yuanhao Zhai, Le Wang, Wei Tang, Qilin Zhang, Nanning Zheng, David Doermann, Junsong Yuan, and Gang Hua, âAdaptive Two-Stream Consensus Network for Weakly-Supervised Temporal Action Localization,â in IEEE Transactions on Pattern Analysis and Machine Intelligence, (T-PAMI), 2022. [Link], [PDF], [BibTex]
[41] Adaptive Ladder Loss for Learning Coherent Visual-Semantic Embedding
Le Wang, Mo Zhou, Zhenxing Niu, Qilin Zhang, and Nanning Zheng, âAdaptive Ladder Loss for Learning Coherent Visual-Semantic Embeddingâ, IEEE Transactions on Multimedia (T-MM), December 2021. [Link], [PDF], [BibTex]
[40] Practical Relative Order Attack in Deep Ranking
Mo Zhou, Le Wang, Zhenxing Niu, Qilin Zhang, Yinghui Xu, Nanning Zheng, and Gang Hua, âPractical Relative Order Attack in Deep Rankingâ, in Proc. IEEE International Conference on Computer Vision (ICCV 2021), Virtual, Oct. 11-17, 2021. [Link], [arXiv], [PDF], [BibTex], [Code]
[39] Weakly Supervised Temporal Action Localization through Contrast based Evaluation Networks
Ziyi Liu, Le Wang, Qilin Zhang, Wei Tang, Nanning Zheng, and Gang Hua, âWeakly Supervised Temporal Action Localization through Contrast based Evaluation Networksâ, IEEE Transactions on Pattern Analysis and Machine Intelligence (T-PAMI), May 2021. [Link], [PDF], [BibTex]
[38] Action Coherence Network for Weakly-Supervised Temporal Action Localization
Yuanhao Zhai, Le Wang, Wei Tang, Qilin Zhang, Nanning Zheng and Gang Hua, âAction Coherence Network for Weakly-Supervised Temporal Action Localizationâ, IEEE Transactions on Multimedia (T-MM), Vol. 24, pp. 1857-1870, 2022. [Link], [PDF], [BibTex]
[37] ACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localization
Ziyi Liu, Le Wang, Qilin Zhang, Wei Tang, Junsong Yuan, Nanning Zheng, and Gang Hua, âACSNet: Action-Context Separation Network for Weakly Supervised Temporal Action Localizationâ, 35th AAAI Conference on Artificial Intelligence (AAAIâ2021), Virtual Conference, February 2-9, 2021. [Link], [PDF], [arXiv], [BibTex]
[36] Giant Panda Identification
Le Wang, Rizhi Ding, Yuanhao Zhai, Qilin Zhang, Wei Tang, Nanning Zheng, and Gang Hua, âGiant Panda Identificationâ, IEEE Transactions on Image Processing, February 2021. [Link], [PDF], [Dataset], [BibTex]
[35] Graph-based Temporal Action Co-Localization from an Untrimmed Video
Le Wang, Changbo Zhai, Qilin Zhang, Wei Tang, Nanning Zheng, and Gang Hua, âGraph-based Temporal Action Co-Localization from an Untrimmed Videoâ, Neurocomputing, January 2021. [Link], [PDF], [BibTex]
[34] Multi-label X-ray Imagery Classification via Bottom-up Attention and Meta Fusion
Benyi Hu, Chi Zhang, Le Wang, Qilin Zhang, Yuehu Liu, âMulti-label X-ray Imagery Classification via Bottom-up Attention and Meta Fusionâ, in Proc. 15th Asian Conference on Computer Vision (ACCV), Nov. 30-Dec. 4, 2020, Virtual Kyoto. [CVF Link], [PDF], [Code], [BibTex]
[33] Two-Stream Consensus Networks for Weakly-Supervised Temporal Action Localization
Yuanhao Zhai, Le Wang, Wei Tang, Qilin Zhang, Junsong Yuan, Gang Hua, âTwo-Stream Consensus Networks for Weakly-Supervised Temporal Action Localizationâ, in Proc. 16th European Conference on Computer Vision (ECCV), 23-28 August 2020. [Link], [PDF], [BibTex]
[32] Adversarial Ranking Attack and Defense
Mo Zhou, Zhenxing Niu, Le Wang, Qilin Zhang, Gang Hua, âAdversarial Ranking Attack and Defenseâ, in Proc. 16th European Conference on Computer Vision (ECCV), 23-28 August 2020. [Link], [PDF], [arXiv], [BibTex]
[31] Object Cosegmentation in Noisy Videos with Multilevel Hypergraph
Le Wang, Xin Lv, Qilin Zhang, Zhenxing Niu, Nanning Zheng, Gang Hua, âObject Cosegmentation in Noisy Videos with Multilevel Hypergraphâ, IEEE Transactions on Multimedia (T-MM), May 2020. [Link], [PDF], [BibTex]
[30] Joint Multi-Object Detection and Segmentation from an Untrimmed Video
Xinling Liu, Le Wang, Qilin Zhang, Nanning Zheng, and Gang Hua, âJoint Multi-Object Detection and Segmentation from an Untrimmed Videoâ, in Proc. IFIP International Conference on Artificial Intelligence Applications and Innovations (AIAIâ2020), Halkidiki, Greece, 5-7 June, 2020. (Oral Presentation) [Link], [PDF], [BibTex]
[29] Fine-grained Giant Panda Identification
Rizhi Ding, Wang Le, Qilin Zhang, Zhenxing Niu, Nanning Zheng, Gang Hua, âFine-grained Giant Panda Identificationâ, In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2020), Barcelona, Spain. [Link], [PDF], [DataSet], [BibTex]
[28] Ladder Loss for Coherent Visual-Semantic Embedding
Mo Zhou, Zhenxing Niu, Le Wang, Zhanning Gao, Qilin Zhang, Gang Hua, âLadder Loss for Coherent Visual-Semantic Embeddingâ, In Proceedings of thirty-fourth AAAI Conference on Artificial Intelligence (AAAI-20), February 7-12, 2020, New York, New York, USA. [Link], [arXiv], [PDF], [BibTex]
[27] Action Co-Localization in an Untrimmed Video by Graph Neural Networks
Changbo Zhai, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua, âAction Co-Localization in an Untrimmed Video by Graph Neural Networksâ, in Proc. 26th International Conference On Multimedia Modeling (MMM 2020), Daejeon, Korea, Jan. 5-8, 2020. [Link], [PDF], [BibTex]
[26] Weakly Supervised Temporal Action Localization through Contrast based Evaluation Networks
Ziyi Liu, Le Wang, Qilin Zhang, Zhanning Gao, Zhenxing Niu, Nanning Zheng, Gang Hua, âWeakly Supervised Temporal Action Localization through Contrast based Evaluation Networksâ, in Proc. IEEE International Conference on Computer Vision (ICCV 2019), Seoul, Korea, Oct. 27-Nov. 2, 2019. [Link], [PDF], [BibTex]
[25] Action Coherence Network for Weakly Supervised Temporal Action Localization
Yuanhao Zhai, Le Wang, Ziyi Liu, Qilin Zhang, Gang Hua, and Nanning Zheng, âAction Coherence Network for Weakly Supervised Temporal Action Localizationâ, in Proc. IEEE International Conference on Image Processing (ICIPâ2019), Taipei, September 22-25, 2019. [Link], [PDF], [BibTeX]
[24] Object Affordances Graph Network for Action Recognition
Haoliang Tan, Le Wang, Qilin Zhang, Zhanning Gao, Nanning Zheng, and Gang Hua, âObject Affordances Graph Network for Action Recognitionâ, in Proc. 30th British Machine Vision Conference, BMVC 2019, Cardiff University, Cardiff, UK, September 9-12, 2019. (Spotlight) [Link], [PDF], [BibTeX]
[23] Extracting Action Sensitive Features to Facilitate Weakly-supervised Action Localization
Zijian Kang, Le Wang, Ziyi Liu, Qilin Zhang, and Nanning Zheng, âExtracting Action Sensitive Features to Facilitate Weakly-supervised Action Localizationâ, In Proceedings of the 15th International Conference on Artificial Intelligence Applications and Innovations (AIAIâ2019), May 24-26, 2019, Crete, Greece. [Link], [PDF], [BibTeX]
[22] Video Imprint Segmentation for Temporal Action Detection in Untrimmed Videos
Zhanning Gao, Le Wang, Qilin Zhang, Zhenxing Niu, Nanning Zheng, and Gang Hua, âVideo Imprint Segmentation for Temporal Action Detection in Untrimmed Videosâ, In Proceedings of the 33rd AAAI Conference on Artificial Intelligence (AAAI-19), January 27 - February 1, 2019, Honolulu, Hawaii, USA. (oral presentation) [Link], [PDF], [BibTex]
[21] Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition
Yunfeng Wang, Wengang Zhou, Qilin Zhang, Houqiang Li, âConvolutional Neural Networks with Generalized Attentional Pooling for Action Recognitionâ, IEEE International Conference on Visual Communications and Image Processing (VCIP), December 2018. (oral presentation) [Link], [PDF], [BibTex]
[20] Video Object Co-segmentation from Noisy Videos by a Multi-level Hypergraph Model
Lv, Xin, Le Wang, Qilin Zhang, Nanning Zheng, and Gang Hua. âVideo Object Co-segmentation from Noisy Videos by a Multi-level Hypergraph Modelâ, in Proc. IEEE International Conference on Image Processing (ICIPâ2018), Athens, Greece, October, 2018. [Link], [PDF], [BibTeX]
[19] Joint Spatio-temporal Action Localization in Untrimmed Videos with Per-frame Segmentation
Duan, Xuhuan, Le Wang, Changbo Zhai, Nanning Zheng, Qilin Zhang, Zhenzing Niu, and Gang Hua. âJoint Spatio-temporal Action Localization in Untrimmed Videos with Per-frame Segmentationâ, in Proc. IEEE International Conference on Image Processing (ICIPâ2018), Athens, Greece, October, 2018. (oral presentation) [Link], [PDF], [BibTeX]
[18] Joint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networks
Ziyi Liu, Le Wang, Gang Hua, Qilin Zhang, Zhenxing Niu, Ying Wu, Nanning Zheng, âJoint Video Object Discovery and Segmentation by Coupled Dynamic Markov Networksâ, IEEE Transactions on Image Processing, vol. 27, no. 12, pp. 5840-5853, Dec. 2018. [Link], [PDF], [BibTeX]
[17] Weighted Multi-Region Convolutional Neural Network for Action Recognition with Low-Latency Online Prediction
Wang, Yunfeng, Wengang Zhou, Qilin Zhang, Xiaotian Zhu, and Houqiang Li. âWeighted Multi-Region Convolutional Neural Network for Action Recognition with Low-Latency Online Predictionâ, in Proc. IEEE International Conference on Multimedia and Expo (ICME), Workshops, San Diego, USA, July 2018. [Link], [PDF], [extended arXiv], [BibTeX]
[16] Enhanced Action Recognition with Visual Attribute-augmented 3D Convolutional Neural Network
Wang, Yunfeng, Wengang Zhou, Qilin Zhang, and Houqiang Li. âEnhanced Action Recognition with Visual Attribute-augmented 3D Convolutional Neural Networkâ, in Proc. IEEE International Conference on Multimedia and Expo (ICME), Industry Program, San Diego, USA, July 2018. [Link], [PDF], [extended arXiv], [BibTeX]
[15] Traffic Sensory Data Classification by Quantifying Scenario Complexity
Wang, Jiajie, Chi Zhang, Yuehu Liu, and Qilin Zhang. âTraffic Sensory Data Classification by Quantifying Scenario Complexityâ, in Proc. IEEE Intelligent Vehicles Symposium (IVâ2018), Changshu, China, June, 2018. [Link], [PDF], [BibTeX]
[14] Multi-model Traffic Scene Simulation with Road Image Sequences and GIS Information
Cui, Zhichao, Yuehu Liu, Fuji Ren, and Qilin Zhang. âMulti-model Traffic Scene Simulation with Road Image Sequences and GIS Informationâ, in Proc. IEEE Intelligent Vehicles Symposium (IVâ2018), Changshu, China, June, 2018. [Link], [PDF], [BibTeX]
[13] A Graded Offline Evaluation Framework for Intelligent Vehicleâs Cognitive Ability
Zhang, Chi, Yuehu Liu, Qilin Zhang, and Le Wang, âA Graded Offline Evaluation Framework for Intelligent Vehicleâs Cognitive Abilityâ, in Proc. IEEE Intelligent Vehicles Symposium (IVâ2018), Changshu, China, June, 2018. (oral presentation) [Link], [PDF], [BibTeX]
[12] Action Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Network
Wang, Le, Jinliang Zang, Qilin Zhang, Zhenxing Niu, Gang Hua, and Nanning Zheng, âAction Recognition by an Attention-Aware Temporal Weighted Convolutional Neural Networkâ, Sensors 18, no. 7 (2018): 1979. [Link], [PDF], [BibTeX]
[11] Attention-based Temporal Weighted Convolutional Neural Network for Action Recognition
Zang, Jinliang, Le Wang, Ziyi Liu, Qilin Zhang, Zhenxing Niu, Gang Hua, and Nanning Zheng. âAttention-based Temporal Weighted Convolutional Neural Network for Action Recognition.â In Proceedings of the 14th International Conference on Artificial Intelligence Applications and Innovations (AIAIâ2018), May 25-27, 2018, Rhodes, Greece. [Link], [PDF], [arXiv], [BibTeX]
[10] Segment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentation
Wang, Le, Xuhuan Duan, Qilin Zhang, Zhenxing Niu, Gang Hua, and Nanning Zheng, âSegment-Tube: Spatio-Temporal Action Localization in Untrimmed Videos with Per-Frame Segmentationâ, Sensors 18, no. 5 (2018): 1657. [Link], [PDF], [BibTeX]
[9] Video-based Sign Language Recognition without Temporal Segmentation
Huang, Jie, Wengang Zhou, Qilin Zhang, Houqiang Li, and Weiping Li âVideo-based Sign Language Recognition without Temporal Segmentation.â In Proceedings of the 32nd AAAI Conference on Artificial Intelligence (AAAI-18), Feb. 2-7, 2018, New Orleans, Louisiana, USA. [Link], [arXiv], [PDF], [BibTeX]
[8] A Hyperspectral Image Classification Framework with Spatial Pixel Pair Features
Ran, Lingyan, Yanning Zhang, Wei Wei, and Qilin Zhang. âA Hyperspectral Image Classification Framework with Spatial Pixel Pair Features.â Sensors 17, no. 10 (2017): 2421. [Link], [PDF], [BibTeX]
[7] Convolutional Neural Network-Based Robot Navigation Using Uncalibrated Spherical Images
Ran, Lingyan, Yanning Zhang, Qilin Zhang, and Tao Yang. âConvolutional Neural Network-Based Robot Navigation Using Uncalibrated Spherical Images.â Sensors 17, no. 6 (2017): 1341. [Link], [PDF], [YouTube demo], [Code Zip], [BibTeX]
[6] Auxiliary Training Information Assisted Visual Recognition
Zhang, Qilin, Gang Hua, Wei Liu, Zicheng Liu, and Zhengyou Zhang. âAuxiliary Training Information Assisted Visual Recognition.â IPSJ Transactions on Computer Vision and Applications 7 (2015): 138-150. [Link], [PDF], [BibTeX]
[5] Multi-View Visual Recognition of Imperfect Testing Data
Zhang, Qilin, and Gang Hua. âMulti-view visual recognition of imperfect testing data.â In Proceedings of the 23rd ACM international conference on Multimedia, pp. 561-570. ACM, 2015. (oral presentation) [Link], [PDF], [BibTeX]
[4] Can Visual Recognition Benefit from Auxiliary Information in Training?
Zhang, Qilin, Gang Hua, Wei Liu, Zicheng Liu, and Zhengyou Zhang. âCan visual recognition benefit from auxiliary information in training?.â In Asian Conference on Computer Vision, pp. 65-80. Springer, Cham, 2014. (oral presentation, oral acceptance rate 4%) [Link], [PDF], [Supplemental PDF], [BibTeX]
[3] Iterative Sparse Asymptotic Minimum Variance Based Approaches for Array Processing
Abeida, Habti, Qilin Zhang, Jian Li, and Nadjim Merabtine. âIterative sparse asymptotic minimum variance based approaches for array processing.â IEEE Transactions on Signal Processing 61, no. 4 (2013): 933-944. [Link], [arXiv], [PDF], [Code Zip], [BibTeX]
[2] Fast implementation of sparse iterative covariance-based estimation for source localization
Zhang, Qilin, Habti Abeida, Ming Xue, William Rowe, and Jian Li. âFast implementation of sparse iterative covariance-based estimation for source localization.â The Journal of the Acoustical Society of America 131, no. 2 (2012): 1249-1259. [Link], [PDF], [BixTeX], [Code Zip]
[1] Fast implementation of sparse iterative covariance-based estimation for array processing
Zhang, Qilin, Habti Abeida, Ming Xue, William Rowe, and Jian Li. âFast implementation of sparse iterative covariance-based estimation for array processing.â In Signals, Systems and Computers (ASILOMAR), 2011 Conference Record of the Forty Fifth Asilomar Conference on, pp. 2031-2035. IEEE, 2011. [Link], [PDF], [BibTeX], [Code Zip]