Cross-domain detection transformer based on spatial-aware and semantic-aware token alignment, J Deng, X Zhang, W Li, L Duan, D Xu, IEEE TMM, 2023 [PDF]
Group-aware Parameter-efficient Updating for Content-adaptive Neural Video Compression, Z Chen, L Zhou, Z Hu, D Xu, ACM MM, 2024 [PDF]
Multi-modality Affinity Inference for Weakly Supervised 3D Semantic Segmentation, X Li, Q Xu, J Zhang, T Zhang, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]
Data-free Generalized Zero-shot Learning, B Tang, J Zhang, L Yan, Q Yu, L Sheng, D Xu, AAAI, 2024 [PDF]
UFDA: Universal Federated Domain Adaptation with Practical Assumptions, X Liu, Z Chen, L Zhou, D Xu, W Xi, G Bai, Y Zhao, J Zhao, AAAI, 2024 [PDF]
A Video is Worth 256 Bases: Spatial-temporal Expectation-maximization Inversion for Zero-shot Video Editing, M Li, Y Li, T Yang, Y Liu, D Yue, Z Lin, D Xu, CVPR, 2024 [PDF]
SVGDreamer: Text Guided SVG Generation with Diffusion Model, X Xing, H Zhou, C Wang, J Zhang, D Xu, Q Yu, CVPR, 2024 [PDF]
DiffSketcher: Text Guided Vector Sketch Synthesis through Latent Diffusion Models, X Xing, C Wang, H Zhou, J Zhang, Q Yu, D Xu, NeurIPS, 2023 [PDF]
Distortion-aware Transformer in 360° Salient Object Detection, Y Zhao, L Zhao, Q Yu, L Sheng, J Zhang, D Xu, ACM MM, 2023 [PDF]
Neural Video Compression with Spatio-temporal Cross-covariance Transformers, Z Chen, L Relic, R Azevedo, Y Zhang, M Gross, D Xu, L Zhou, C Schroers, ACM MM, 2023 [PDF]
ICMH-Net: Neural Image Compression Towards Both Machine Vision and Human Vision, L Liu, Z Hu, Z Chen, D Xu, ACM MM, 2023 [PDF]