CV
Summary
Master student at the Institute of Computing Technology, Chinese Academy of Sciences. Research focuses on Computer Vision and Vision-Language Models.
Education
- Computer SciencePresentInstitute of Computing Technology, Chinese Academy of Sciences
- Cyber Science and Engineering2024.6Huazhong University of Science and Technology
Work Experience
- Research Intern2025.4 - 2025.9Qwen Team, Alibaba CloudCore contributor of Qwen3-VL. Participating in multimodal positional encoding research, inference infrastructure, and model release.
Publications
- Revisiting Multimodal Positional Encoding in Vision-Language Models2026Comprehensive analysis of multimodal RoPE in VLMs. Accepted by ICLR 2026.
- Qwen3-VL Technical Report2025The most capable vision-language model in the Qwen series.
- RefHCM: A Unified Model for Referring Perceptions in Human-Centric Scenarios2025A unified framework for human-centric referring perception tasks. Accepted by TMM 2025.
- Stealthy and Effective Physical Adversarial Attacks in Autonomous Driving2024Physical adversarial attack methods targeting perception systems in autonomous driving. Accepted by TIFS 2024.