Projects

 

Representative publications Selected publications by area My google scholar citations

 

Jiangtao Xie*, Fei Long*, Jiaming Lv, Qilong Wang, Peihua Li. Joint Distribution Matters: Deep Brownian Distance Covariance for Few-Shot Classification. IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2022. *Equal contribution. (Oral presentation, acceptance rate ~5%) [Project][code][pdf][video]

(For the first time, we introduce Brownian distance covariance (BDC), a powerful yet overlooked similarity/distance measure, into deep learning.)

Qilong Wang, Mingze Gao, Zhaolin Zhang, Jiangtao Xie, Peihua Li, Qinghua Hu. DropCov: A Simple yet Effective Method for Improving Deep Architectures. Advances in Neural Information Processing Systems (NeurIPS), 2022. [Code][pdf]

(The power in Matrix Power Normalization (MPN) tradeoffs generalization and accuracy of 2nd-order pooling.)

 

 

Zilin Gao, Qilong Wang, Bingbing Zhang, Qinghua Hu and Peihua Li. Temporal-adaptive Covariance Pooling Networks for Video Recognition. Advances in Neural Information Processing Systems (NeurIPS), 2021. [Code][pdf]

(Action recogniton using temporal attention-based 2nd-pooling.)

 

Qilong Wang, Jiangtao Xie, Wangmeng Zuo, Lei Zhang and Peihua Li. Deep CNNs Meet Global Covariance Pooling: Better Representation and Generalization. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 2021. [Project Page][pdf][bibtex]

(Methodology of Matrix Power Normalized Covariance Pooling Networks.)

Qilong Wang, Li Zhang, Banggu Wu, Dongwei Ren, Peihua Li, Wangmeng Zuo, Qinghua Hu. What Deep CNNs Benefit from Global Covariance Pooling: An Optimization Perspective. IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2020. [Code][pdf]

(Matrix Power Normalized Covariance Pooling (MPN-COV) makes optimization landscape more smooth and gradient prediction more stable than global average pooling.)

Qilong Wang, Banggu Wu, Pengfei Zhu, Peihua Li, Wangmeng Zuo, Qinghua Hu. ECA-Net: Efficient Channel Attention for Deep Convolutional Neural Networks. IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2020.[Code][pdf]

 

Zilin Gao, Jiangtao Xie, Qilong Wang and Peihua Li. Global Second-order Pooling Convolutional Networks. IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2019. [Code in PyTorch][pdf][bibtex]

(Second-order attention for CNN.)

 

Qilong Wang, Peihua Li, Qinghua Hu, Pengfei Zhu, Wangmeng Zuo. Deep Global Generalized Gaussian Networks. IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2019.[pdf][bibtex]

(Generalized Gaussian CNN.)

 

Peihua Li, Jiangtao Xie, Qilong Wang and Zilin Gao. Towards Faster Training of Global Covariance Pooling Networks by Iterative Matrix Square Root Normalization. IEEE Int. Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 947-955, 2018. [Code in PyTorch][Code in TensorFlow][Code in MatConvNet][pdf][bibtex]

(A fast algorithm for Matrix Power (1/2) Normalized COVariance pooling (MPN-COV).)

 

Based on MPN-COV, we position 1st place in iNaturalist Challenge@FGVC5 CVPR2018 spanning 8000 species

Peihua Li, Jiangtao Xie, Qilong Wang and Wangmeng Zuo. Is Second-order Information Helpful for Large-scale Visual Recognition? IEEE Int. Conf. on Computer Vision (ICCV),  pp. 2070-2078, 2017. [Code in MatConvNet][pdf][bibtex]

(Disclose statistical and geometrical insights in Matrix Power Normalized COVariance pooling (MPN-COV), which outperforms gobal average pooling in large-scale recogniton)

 

Qilong Wang*, Zilin Gao*, Jiangtao Xie, Wangmeng Zuo and Peihua Li. Global Gated Mixture of Second-order Pooling for Improving Deep Convolutional Neural Networks. Advances in Neural Information Processing Systems (NIPS), 2018. *Equal contribution. [Code in PyTorch] [pdf][bibtex]

(Mixture of covariance pooling at the end of CNN)

 

 

Peihua Li, Qilong Wang, Hui Zeng, Lei Zhang. Local Log-Euclidean Multivariate Gaussian Descriptor and Its Application to Image Classification. IEEE Trans. on Pattern Analysis and Machine Intelligence (TPAMI), 39(4): 803-817, 2017. [pdf][code][bibtex]

(For the first time we equip the manifold of Gaussians a Lie group structure, and present two methods embedding Gaussians in vector spaces)

Qilong Wang, Peihua Li, Lei Zhang. G2DeNet: Global Gaussian Distribution Embedding Network and Its Application to Visual Recognition. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 2730-2739, 2017. (Oral presentation, acceptance rate 2.7%) [pdf][code][bibtex]

(Plugging Gaussian distribution into deep CNN)

Qilong Wang, Peihua Li, Wangmeng Zuo, Lei Zhang. RAID-G: Robust Estimation of Approximate Infinite Dimensional Gaussian with Application to Materiel Recognition. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), pp. 4433-4441, 2016. [pdf][code][bibtex][Slides][poster]

(Robust Gaussians descriptors for classification)

Peihua Li, Hui Zeng, Qilong Wang, Simon C. K. Shiu, Lei Zhang. High-order Local Pooling and Encoding Gaussians Over A Dictionary of Gaussians. IEEE Trans. on Image Processing (TIP), 26(7):3372-3384, 2017. [pdf] [code][bibtex]

(Encoding higher-order statistics)

Peihua Li, Xiaoxiao Lu, Qilong Wang. From Dictionary of Visual Words to Subspaces: Locality-constrained Affine Subspace Coding. Int. Conf. on Computer Vision and Pattern Recognition (CVPR), 2015.[pdf] [Project] [bibtex]

(A BoW model with a dictionay of affine subsapces)

Qilong Wang, Wangmeng Zuo, Lei Zhang, Peihua Li. Shrinkage Expansion Adaptive Metric Learning. European Conf. on Computer Vision, ECCV (7) 2014 : 456-471 [pdf, supplement, code]

Peihua Li, Qilong Wang, Lei Zhang. A Novel Earth Mover's Distance Methodology for Image Matching with Gaussian Mixture Models. IEEE Int. Conf. on Computer Vision (ICCV), 2013. [Project page]

Peihua Li, Qilong Wang, Wangmeng Zuo, Lei Zhang. Log-Euclidean Kernels for Sparse Representation and Dictionary Learning. IEEE Int. Conf. on Computer Vision (ICCV), 2013. [Project page]

Peihua Li. Tensor-SIFT based Earth Mover's Distance for Contour Tracking. Journal of Mathematical Imaging and Vision, 2013, 46(1): 44-65. Technical Report. [pdf].

Peihua Li, Qilong Wang. Local Log-Euclidean Covariance Matrix (L2ECM) for Image Representation and Its Applications. European Conf. on Computer Vision, ECCV (3) 2012 : 469-482. Preprint version and published version. [Matlab source code , Poster].