publications

Research outputs by year, newest first.

This page is organized like an academic portfolio rather than a raw paper list. Full BibTeX is available here.

2026

ECCV 2026

RiO-DETR: DETR for Real-time Oriented Object Detection

Z. Hu, Y. Zhao, Y. Peng, W. Sun, et al.

paper
ICML 2026

Beyond Logits: Coherent Hallucination Mitigation via Attention Contrastive Decoding

Yujia Chen, Rui Sun, Wangkai Li, Huayu Mai, Bingzhou Wang, Zhangyu He, Aibing Li, Wenzhang Sun, Tianzhu Zhang.

ICML 2026

Beyond Blind Noising: Disentangled Visual Rectification for Hallucination Mitigation in MLLMs

Yujia Chen, Rui Sun, Zhaoyang Li, Wangkai Li, Huayu Mai, Bingzhou Wang, Aibing Li, Wenzhang Sun.

arXiv 2026

Preserve, Reveal, Expand: Faithful 4D Video Editing with Region-Aware Conditioning

Zhangchi Hu, Wenzhang Sun, Xiangchen Yin, Jiahui Yuan, Chunfeng Wang, Hao Li, Kun Zhan, Xiaoyan Sun. Project Leader.

paper · project
arXiv 2026

Stable Curves, Unstable Items: Item-Level Scaling Heterogeneity in Video LLMs

Wenzhang Sun, Chunfeng Wang, Xiangchen Yin, Yujia Chen, Hao Li, Kun Zhan.

code
ICASSP 2026

DrivingScene: A Multi-Task Online Feed-Forward 3D Gaussian Splatting Method for Dynamic Driving Scenes

Q. Hou, W. Sun, C. Zeng, C. Wang, H. Li, J. Cui.

paper
ICASSP 2026

PAGS: Priority-Adaptive Gaussian Splatting for Dynamic Driving Scenes

A. Ying, W. Sun, C. Zeng, C. Wang, H. Li, J. Cui.

paper
arXiv 2026

MUSE: A Multi-agent Framework for Unconstrained Story Envisioning via Closed-Loop Cognitive Orchestration

W. Sun, Z. Wang, Z. Hu, C. Wang, H. Li, W. Chen.

paper · project

2025

MMAsia 2025

UniCP: A Unified Caching and Pruning Framework for Efficient Video Generation

W. Sun, Q. Hou, D. Di, J. Yang, Y. Ma, J. Cui.

paper
arXiv 2025

DeCo-VAE: Learning Compact Latents for Video Reconstruction via Decoupled Representation

X. Yin, J. Yuan, Z. Hu, W. Sun, et al.

paper
TPAMI 2025

TV-3DG: Mastering Text-to-3D Customized Generation with Visual Prompt

Jiahui Yang, Donglin Di, Baorui Ma, Jianxun Cui, Xun Yang, Yongjia Ma, Wenzhang Sun, Wei Chen, Zhou Xue, Meng Wang, Yebin Liu.

paper
ICCV 2025

FaceVid-1K: A Large-Scale High-Quality Multiracial Human Face Video Dataset

D. Di, H. Feng, W. Sun, Y. Ma, et al.

project
arXiv 2025

Hi-VAE: Efficient Video Autoencoding with Global and Detailed Motion

H. Liu*, W. Sun*, Q. Zhang, D. Di, et al.

paper
arXiv 2025

ChronoTailor: Harnessing Attention Guidance for Fine-Grained Video Virtual Try-On

J. Wang*, W. Sun*, M. Li, Y. Zheng, et al.

paper
CVPR 2025

MoEE: Mixture of Emotion Experts for Audio-Driven Portrait Animation

H. Liu, W. Sun, D. Di, S. Sun, J. Yang, C. Zou, H. Bao.

paper
arXiv 2025

A Self-supervised Motion Representation for Portrait Video Generation

Q. Zhang, C. Wu, W. Sun, H. Liu, et al.

paper

2024 and earlier

arXiv 2024

UniAvatar: Taming Lifelike Audio-Driven Talking Head Generation with Comprehensive Motion and Lighting Control

W. Sun, X. Li, D. Di, Z. Liang, Q. Zhang, H. Li, W. Chen, J. Cui.

paper
ICCV 2023

Neural Reconstruction of Relightable Human Model from Monocular Video

W. Sun, Y. Che, H. Huang, Y. Guo.

paper
CVIU 2022

Estimating 3D Body Mesh without SMPL Annotations via Alternating Successive Convex Approximation

W. Sun, L. Wang, S. Ma, Q. Ma.