Publications

IEEE T-PAMI (CCF-A Journal)

Hanyu Xuan; Zhiliang Wu; Jian Yang; Bo Jiang; Lei Luo; Xavier Alameda-Pineda; Yan Yan. Robust Audio-Visual Contrastive Learning for Proposal-based Self-supervised Sound Source Localization in Videos[J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2024.

AAAI’2024 (CCF-A Conference)

Zhiliang Wu; Changchang Sun; Hanyu Xuan*(corresponding author); Gaowen Liu; Yan Yan. WaveFormer: Wavelet Transformer for Noise-Robust Video Inpainting[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2024

CVPR’2023 (CCF-A Conference)

Zhiliang Wu; Hanyu Xuan*(corresponding author); Changchang Sun; Weili Guan; Kang Zhang; Yan Yan. Semi-Supervised Video Inpainting with Cycle Consistency Constraints[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2023: 22586-22595.

IJCAI’2022 (CCF-A Conference)

Hanyu Xuan; Yihong Xu; Shuo Chen; Zhiliang Wu; Jian Yang; Yan Yan; Xavier Alameda-Pineda. Robust Audio-Visual Instance Discrimination via Active Contrastive Set Mining[C]//Proceedings of the International Joint Conference on Artificial Intelligence. 2022: 3643-3649.

CVPR’2022 (CCF-A Conference)

Hanyu Xuan; Zhiliang Wu; Jian Yang; Yan Yan; Xavier Alameda-Pineda. A proposal-based paradigm for Self-supervised Sound Source Localization in Videos[C]//Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 2022: 1029-1038.

TIP (CCF-A Journal)

Hanyu Xuan; Lei Luo; Zhenyu Zhang; Jian Yang; Yan Yan. Discriminative Cross-Modality Attention Network for Temporal Inconsistent Audio-Visual Event Localization[J]. IEEE Transactions on Image Processing. 2021, 30: 7878-7888.

AAAI’2020 (CCF-A Conference)

Hanyu Xuan; Zhenyu Zhang; Shuo Chen; Jian Yang; Yan Yan. Cross-modal Attention Network for Temporal Inconsistent Audio-Visual Event Localization[C]//Proceedings of the AAAI Conference on Artificial Intelligence. 2020, 34(01): 279-286.