I am a Master’s degree student in Wuhan University of Technology.
I’m interested in research in computer vision, particularly video understanding. My research includes video Q&A, video-text retrieval, and video captioning. Meanwhile, I also have some research and practice in large language models, prompt engineering, basic operator development and optimization, knowledge graphs, and Q&A systems.
I hope to make a generalized multimodal video model that is affordable, secure and trustworthy for everyone.