Hongchen Wei

I am currently a second-year Ph.D. student in School of Remote Sensing and Information Engineering from Wuhan University, under the supervision of Prof. Zhenzhong Chen.

I received my M.E. degree in School of Computer Science and Engineering from Nanjing University of Science and Technology, China, in 2023.

I received my B.Sc. degree in School of Materials Science and Engineering from Xi'an Shiyou University, China, in 2020.

Email / Google Scholar / Github / CV

Main Research Interests

Image/Video Captioning
Long Video Understanding
Spatial-Temporal Video Grounding
Large Multimodal Model

Pre-prints

Training-Free Reasoning and Reflection in MLLMs
Hongchen Wei, Zhenzhong Chen
arXiv Preprint, 2025

LongCaptioning: Unlocking the Power of Long Caption Generation in Large Multimodal Models
Hongchen Wei, Zhihong Tan, Yaosi Hu, Chang Wen Chen, Zhenzhong Chen
arXiv Preprint, 2025

LOP: Learning Optimal Pruning for Efficient On-Demand MLLMs Scaling
Zhihan Zhang, Xiang Pan, Hongchen Wei, Zhenzhong Chen
arXiv Preprint, 2025

RSFAKE-1M: A Large-Scale Dataset for Detecting Diffusion-Generated Remote Sensing Forgeries
Zhihong Tan, Jiayi Wang, Huiying Shi, Binyuan Huang, Hongchen Wei, Zhenzhong Chen
arXiv Preprint, 2025

Publications

Visual Context Window Extension: A New Perspective for Long Video Understanding
Hongchen Wei, Zhenzhong Chen
ACM MM (CCF-A 会议), 2025
Project page

RealVG: Unleashing MLLMs for Training-Free Spatio-Temporal Video Grounding in the Wild
Hongchen Wei, Zhenzhong Chen
ACM MM (CCF-A 会议), 2025

Remote Sensing Semantic Segmentation Quality Assessment based on Vision Language Model
Huiying Shi, Zhihong Tan, Zhihan Zhang, Hongchen Wei, Yaosi Hu, Yingxue Zhang, Zhenzhong Chen
TGRS (CCF-B 期刊), 2025

Improving Generalization of Image Captioning with Unsupervised Prompt Learning
Hongchen Wei, Zhenzhong Chen
TOMM (CCF-B 期刊), 2024

Exploiting Cross-Modal Prediction and Relation Consistency for Semisupervised Image Captioning
Yang Yang, Hongchen Wei , Hengshu Zhu, Dianhai Yu, Hui Xiong, Jian Yang
TCYB (CCF-B 期刊), 2022 (学生一作)
Code

S2OSC: A Holistic Semi-Supervised Approach for Open Set Classification
Yang Yang, Hongchen Wei , Zhenqiang Sun, Guangyu Li, Yuanchun Zhou, Hui Xiong, Jian Yang
TKDD (CCF-B 期刊), 2021 (学生一作)

Activities

Reviewer: ICLR25, CVPR25, TNNLS

Last updated in Jun. 2025.

Homepage credits: Jon Barron.