Hongchen Wei
I am currently a second-year Ph.D. student in School of Remote Sensing and Information Engineering from Wuhan University, under the supervision of Prof. Zhenzhong Chen.
I received my M.E. degree in School of Computer Science and Engineering from Nanjing University of Science and Technology, China, in 2023.
I received my B.Sc. degree in School of Materials Science and Engineering from Xi'an Shiyou University, China, in 2020.
Email  / 
Google Scholar  / 
Github
|
|
- Image/Video Captioning
- Long Video Understanding
- Spatial-Temporal Video Grounding
- Large Multimodal Model
-
[2024-10] We propose the visual context window extension for long video understanding that enables the direct and easy scaling of pre-trained LMMs to 1024 frames, and significantly reducing memory usage.
|