Zhejiang University & Shanghai Innovation Institute
Ph.D. in Artificial Intelligence, College of Computer Science and Technology
Ph.D. Student
Zhejiang University & Shanghai Innovation Institute
lichenglin@sii.edu.cn
12421234@zju.edu.cn
I work on large language models, long-video understanding, multimodal evaluation, and agentic llm systems.
I am a Ph.D. student jointly affiliated with Zhejiang University and Shanghai Innovation Institute. My research focuses on multimodal large language models, long-video understanding, and agentic reasoning.
I am advised by Jiaqi Wang and Yin Zhang.
Ph.D. in Artificial Intelligence, College of Computer Science and Technology
M.S. in Artificial Intelligence, School of Software Technology
Research areas: large language models, video understanding, and agents
B.S. in Computer Science and Technology
GPA: 4.05/5.00, Rank: 13/221, CET-4: 561, CET-6: 479
Developed VideoThinker, a VideoLLM framework that turns long-video understanding into an agentic retrieval-and-zoom reasoning problem.
Studied how VideoLLMs can combine direct multimodal reasoning with program-based tool use for complex video queries.
Built a controllable benchmark for evaluating symbolic, abstract, and high-level cognitive reasoning in video understanding.
Studied instruction data synthesis with tree search to improve data quality for low-resource alignment.
Proposed a mixed distillation framework for transferring reasoning supervision from strong LLMs to smaller, deployable language models.
Worked on the full research pipeline for multimodal foundation models, including data, training, and evaluation.
Worked on video understanding systems for user intent modeling in video creation scenarios.
Worked on experience-aware content modeling for search scenarios requiring subjective or experiential evidence.
Worked on post-training and evaluation for enterprise dialogue models, with a focus on data quality and efficient deployment.