Cross-domain detection transformer based on spatial-aware and semantic-aware token alignment, J Deng, X Zhang, W Li, L Duan, D Xu, IEEE TMM, 2023 [PDF]
An End-to-End Learning Framework for Video Compression, G Lu, X Zhang, W Ouyang, L Chen, Z Gao, D Xu, IEEE TPAMI, 2023 [PDF]
Cross-Dataset Point Cloud Recognition Using Deep-Shallow Domain Adaptation Network, F Wang, W Li, D Xu, IEEE TIP, 2021 [PDF]
Improving Weakly Supervised Temporal Action Localization by Exploiting Multi-Resolution Information in Temporal Domain, R Su, D Xu, L Zhou, W Ouyang, IEEE TIP, 2021 [PDF]
Dense Video Captioning Using Graph-Based Sentence Summarization, Z Zhang, D Xu, W Ouyang, L Zhou, IEEE TMM, 2021 [PDF]
Model Compression Using Progressive Channel Pruning, J Guo, W Zhang, W Ouyang, D Xu, IEEE TCSVT, 2021 [PDF]
Progressive Modality Cooperation for Multi-Modality Domain Adaptation, W Zhang, D Xu, J Zhang, W Ouyang, IEEE TIP, 2021 [PDF]
Progressive Cross-Stream Cooperation in Spatial and Temporal Domain for Action Localization, R Su, D Xu, L Zhou, W Ouyang, IEEE TPAMI, 2021 [PDF]
Self-Paced Collaborative and Adversarial Network for Unsupervised Domain Adaptation, W Zhang, W Ouyang, W Li, D Xu, IEEE TPAMI, 2021 [PDF]
Deep Non-Local Kalman Network for Video Compression Artifact Reduction, G Lu, X Zhang, W Ouyang, D Xu, L Chen, Z Gao, IEEE TIP, 2020 [PDF]
Show, Tell and Summarize: Dense Video Captioning Using Visual Cue Aided Sentence Summarization, Z Zhang, D Xu, W Ouyang, C Tan, IEEE TCSVT, 2020 [PDF]
Recent Advances in Transfer Learning for Cross-Dataset Visual Recognition: A Problem-Oriented Perspective, J Zhang, W Li, P Ogunbona, D Xu, ACM Computing Surveys, 2019 [PDF]
Group-aware Parameter-efficient Updating for Content-adaptive Neural Video Compression, Z Chen, L Zhou, Z Hu, D Xu, ACM MM, 2024 [PDF]
Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation, X Li, Q Xu, J Zhang, T Zhang, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]
Data-free Generalized Zero-shot Learning, B Tang, J Zhang, L Yan, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]
UFDA: Universal Federated Domain Adaptation with Practical Assumptions, X Liu, Z Chen, L Zhou, D Xu, W Xi, G Bai, Y Zhao, J Zhao, AAAI, 2024 [PDF]
A Video is Worth 256 Bases: Spatial-temporal Expectation-maximization Inversion for Zero-shot Video Editing, M Li, Y Li, T Yang, Y Liu, D Yue, Z Lin, D Xu, CVPR, 2024 [PDF]
SVGDreamer: Text Guided SVG Generation with Diffusion Model, X Xing, H Zhou, C Wang, J Zhang, D Xu, Q Yu, CVPR, 2024 [PDF]
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, X Xing, C Wang, H Zhou, J Zhang, Q Yu, D Xu, NeurIPS, 2023 [PDF]
Distortion-aware Transformer in 360° Salient Object Detection, Y Zhao, L Zhao, Q Yu, L Sheng, J Zhang, D Xu, ACM MM, 2023 [PDF]
Neural Video Compression with Spatio-temporal Cross-covariance Transformers, Z Chen, L Relic, R Azevedo, Y Zhang, M Gross, D Xu, L Zhou, C Schroers, ACM MM, 2023 [PDF]
ICMH-Net: Neural Image Compression Towards Both Machine Vision and Human Vision, L Liu, Z Hu, Z Chen, D Xu, ACM MM, 2023 [PDF]
SRDAN: Scale-aware and Range-aware Domain Adaptation Network for Cross-dataset 3D Object Detection, W Zhang, W Li, D Xu, CVPR, 2021 [PDF]
Channel Pruning Guided by Classification Loss and Feature Importance, J Guo, W Ouyang, D Xu, AAAI, 2020 [PDF]
Content Adaptive and Error Propagation Aware Deep Video Compression, G Lu, C Cai, X Zhang, L Chen, W Ouyang, D Xu, Z Gao, ECCV, 2020 [PDF]
Multi-Dimensional Pruning: A Unified Framework for Model Compression, J Guo, W Ouyang, D Xu, CVPR, 2020 [PDF]