For more publications, please refer to the supervisors’ homepage: Prof. Dong Xu

Journals


Unsupervised Part Discovery via Dual Representation Alignment, J Xia, W Huang, M Xu, J Zhang, H Zhang, Z Sheng, D Xu, IEEE TPAMI, 2024 [PDF] [Code]


3D Reconstruction From a Single Sketch via View-Dependent Depth Sampling, C Gao, X Wang, Q Yu, L Sheng, J Zhang, X Han, YZ Song, D Xu, IEEE TPAMI, 2024 [PDF] [Code]


Cross-domain detection transformer based on spatial-aware and semantic-aware token alignment, J Deng, X Zhang, W Li, L Duan, D Xu, IEEE TMM, 2023 [PDF]


An End-to-End Learning Framework for Video Compression, G Lu, X Zhang, W Ouyang, L Chen, Z Gao, D Xu, IEEE TPAMI, 2023 [PDF]


Cross-Dataset Point Cloud Recognition Using Deep-Shallow Domain Adaptation Network, F Wang, W Li, D Xu, IEEE TIP, 2021 [PDF]


Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain, R Su, D Xu, L Zhou, W Ouyang, IEEE TIP, 2021 [PDF]


Dense Video Captioning Using Graph-Based Sentence Summarization, Z Zhang, D Xu, W Ouyang, L Zhou, IEEE TMM, 2021 [PDF]


Model Compression Using Progressive Channel Pruning, J Guo, W Zhang, W Ouyang, D Xu, IEEE TCSVT, 2021 [PDF]


Progressive Modality Cooperation for Multi-Modality Domain Adaptation, W Zhang, D Xu, J Zhang, W Ouyang, IEEE TIP, 2021 [PDF]


Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization, R Su, D Xu, L Zhou, W Ouyang, IEEE TPAMI, 2021 [PDF]


Self-Paced Collaborative and Adversarial Network for Unsupervised Domain Adaptation, W Zhang, W Ouyang, W Li, D Xu, IEEE TPAMI, 2021 [PDF]


Deep Non-Local Kalman Network for Video Compression Artifact Reduction, G Lu, X Zhang, W Ouyang, D Xu, L Chen, Z Gao, IEEE TIP, 2020 [PDF]


Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization, Z Zhang, D Xu, W Ouyang, C Tan, IEEE TCSVT, 2020 [PDF]


Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective, J Zhang, W Li, P Ogunbona, D Xu, ACM Computing Surveys, 2019 [PDF]


Conferences


Improving Long-Text Alignment for Text-to-Image Diffusion Models, L Liu, C Du, T Pang, Z Wang, C Li, D Xu, ICLR, 2025 [PDF] [Code]


Group-aware Parameter-efficient Updating for Content-adaptive Neural Video Compression, Z Chen, L Zhou, Z Hu, D Xu, ACM MM, 2024 [PDF]


Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation, X Li, Q Xu, J Zhang, T Zhang, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]


Data-free Generalized Zero-shot Learning, B Tang, J Zhang, L Yan, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]


UFDA: Universal Federated Domain Adaptation with Practical Assumptions, X Liu, Z Chen, L Zhou, D Xu, W Xi, G Bai, Y Zhao, J Zhao, AAAI, 2024 [PDF]


A Video is Worth 256 Bases: Spatial-temporal Expectation-maximization Inversion for Zero-shot Video Editing, M Li, Y Li, T Yang, Y Liu, D Yue, Z Lin, D Xu, CVPR, 2024 [PDF]


SVGDreamer: Text Guided SVG Generation with Diffusion Model, X Xing, H Zhou, C Wang, J Zhang, D Xu, Q Yu, CVPR, 2024 [PDF]


DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, X Xing, C Wang, H Zhou, J Zhang, Q Yu, D Xu, NeurIPS, 2023 [PDF]


Distortion-aware Transformer in 360° Salient Object Detection, Y Zhao, L Zhao, Q Yu, L Sheng, J Zhang, D Xu, ACM MM, 2023 [PDF]


Neural Video Compression with Spatio-temporal Cross-covariance Transformers, Z Chen, L Relic, R Azevedo, Y Zhang, M Gross, D Xu, L Zhou, C Schroers, ACM MM, 2023 [PDF]


ICMH-Net: Neural Image Compression Towards Both Machine Vision and Human Vision, L Liu, Z Hu, Z Chen, D Xu, ACM MM, 2023 [PDF]


SRDAN: Scale-aware and Range-aware Domain Adaptation Network for Cross-dataset 3D Object Detection, W Zhang, W Li, D Xu, CVPR, 2021 [PDF]


Channel Pruning Guided by Classification Loss and Feature Importance, J Guo, W Ouyang, D Xu, AAAI, 2020 [PDF]


Content Adaptive and Error Propagation Aware Deep Video Compression, G Lu, C Cai, X Zhang, L Chen, W Ouyang, D Xu, Z Gao, ECCV, 2020 [PDF]


Multi-Dimensional Pruning: A Unified Framework for Model Compression, J Guo, W Ouyang, D Xu, CVPR, 2020 [PDF]