Publication

For more publications, please refer to the supervisors’ homepage: Prof. Dong Xu

Journals

Unsupervised Part Discovery via Dual Representation Alignment, J Xia, W Huang, M Xu, J Zhang, H Zhang, Z Sheng, D Xu, IEEE TPAMI, 2024 [PDF] [Code]

3D Reconstruction From a Single Sketch via View-Dependent Depth Sampling, C Gao, X Wang, Q Yu, L Sheng, J Zhang, X Han, YZ Song, D Xu, IEEE TPAMI, 2024 [PDF] [Code]

Cross-domain detection transformer based on spatial-aware and semantic-aware token alignment, J Deng, X Zhang, W Li, L Duan, D Xu, IEEE TMM, 2023 [PDF]

An End-to-End Learning Framework for Video Compression, G Lu, X Zhang, W Ouyang, L Chen, Z Gao, D Xu, IEEE TPAMI, 2023 [PDF]

Cross-Dataset Point Cloud Recognition Using Deep-Shallow Domain Adaptation Network, F Wang, W Li, D Xu, IEEE TIP, 2021 [PDF]

Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain, R Su, D Xu, L Zhou, W Ouyang, IEEE TIP, 2021 [PDF]

Dense Video Captioning Using Graph-Based Sentence Summarization, Z Zhang, D Xu, W Ouyang, L Zhou, IEEE TMM, 2021 [PDF]

Model Compression Using Progressive Channel Pruning, J Guo, W Zhang, W Ouyang, D Xu, IEEE TCSVT, 2021 [PDF]

Progressive Modality Cooperation for Multi-Modality Domain Adaptation, W Zhang, D Xu, J Zhang, W Ouyang, IEEE TIP, 2021 [PDF]

Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization, R Su, D Xu, L Zhou, W Ouyang, IEEE TPAMI, 2021 [PDF]

Self-Paced Collaborative and Adversarial Network for Unsupervised Domain Adaptation, W Zhang, W Ouyang, W Li, D Xu, IEEE TPAMI, 2021 [PDF]

Deep Non-Local Kalman Network for Video Compression Artifact Reduction, G Lu, X Zhang, W Ouyang, D Xu, L Chen, Z Gao, IEEE TIP, 2020 [PDF]

Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization, Z Zhang, D Xu, W Ouyang, C Tan, IEEE TCSVT, 2020 [PDF]

Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective, J Zhang, W Li, P Ogunbona, D Xu, ACM Computing Surveys, 2019 [PDF]

Conferences

Improving Long-Text Alignment for Text-to-Image Diffusion Models, L Liu, C Du, T Pang, Z Wang, C Li, D Xu, ICLR, 2025 [PDF] [Code]

Group-aware Parameter-efficient Updating for Content-adaptive Neural Video Compression, Z Chen, L Zhou, Z Hu, D Xu, ACM MM, 2024 [PDF]

Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation, X Li, Q Xu, J Zhang, T Zhang, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]

Data-free Generalized Zero-shot Learning, B Tang, J Zhang, L Yan, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]

UFDA: Universal Federated Domain Adaptation with Practical Assumptions, X Liu, Z Chen, L Zhou, D Xu, W Xi, G Bai, Y Zhao, J Zhao, AAAI, 2024 [PDF]

A Video is Worth 256 Bases: Spatial-temporal Expectation-maximization Inversion for Zero-shot Video Editing, M Li, Y Li, T Yang, Y Liu, D Yue, Z Lin, D Xu, CVPR, 2024 [PDF]

SVGDreamer: Text Guided SVG Generation with Diffusion Model, X Xing, H Zhou, C Wang, J Zhang, D Xu, Q Yu, CVPR, 2024 [PDF]

DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, X Xing, C Wang, H Zhou, J Zhang, Q Yu, D Xu, NeurIPS, 2023 [PDF]

Distortion-aware Transformer in 360° Salient Object Detection, Y Zhao, L Zhao, Q Yu, L Sheng, J Zhang, D Xu, ACM MM, 2023 [PDF]

Neural Video Compression with Spatio-temporal Cross-covariance Transformers, Z Chen, L Relic, R Azevedo, Y Zhang, M Gross, D Xu, L Zhou, C Schroers, ACM MM, 2023 [PDF]

ICMH-Net: Neural Image Compression Towards Both Machine Vision and Human Vision, L Liu, Z Hu, Z Chen, D Xu, ACM MM, 2023 [PDF]

SRDAN: Scale-aware and Range-aware Domain Adaptation Network for Cross-dataset 3D Object Detection, W Zhang, W Li, D Xu, CVPR, 2021 [PDF]

Channel Pruning Guided by Classification Loss and Feature Importance, J Guo, W Ouyang, D Xu, AAAI, 2020 [PDF]

Content Adaptive and Error Propagation Aware Deep Video Compression, G Lu, C Cai, X Zhang, L Chen, W Ouyang, D Xu, Z Gao, ECCV, 2020 [PDF]

Multi-Dimensional Pruning: A Unified Framework for Model Compression, J Guo, W Ouyang, D Xu, CVPR, 2020 [PDF]