章勇,现任深圳大学人工智能学院助理教授,硕士生导师,主要研究方向为视觉-语言多模态学习、计算机视觉、大模型及智能体。他本科毕业于西安交通大学自动化系(导师:沈超教授),在香港中文大学(深圳)取得博士学位(导师:黄锐教授、梅涛博士和陈长汶教授),曾在大疆担任软件工程师,在中国科学院与深信服兼任博士后和高级算法工程师。他在人工智能领域国际顶级会议与期刊(CVPR、TMM、AAAI、TIFS等)发表多篇论文,深度参与构建中国首个网络安全大模型—深信服安全GPT、面向严肃开发场景的CoStrict AI编程智能体。曾荣获日内瓦国际发明展金奖,具备丰富的学术研究和工程实战经验。更多请参见 英文学术主页。
>> 欢迎学术合作,欢迎本科同学进组学习,感兴趣者请发送简历到我邮箱!
研究兴趣
其他
🔬 个人特点 - 长期扎根在AI前沿,“科研+工程”双栖选手
💎 学生培养
🌟 招生要求
靠谱:能力范围的事能干好,不掉链子;能力范围外的,我们一起想办法解决;
自驱:对科研有发自内心的热情,享受自我进步的快乐;
野心:想干成点事儿,想在AI浪潮中留下自己的足迹。
代表性成果
会议论文:
[CCF-B] Qiyou Liu, Yong Zhang, Jianjie Luo, Zhengguo Yang, Yu Yi. "Boosting Knowledge-based Visual Question Answering with Structured Context Reasoning.'' In IEEE International Conference on Multimedia and Expo (ICME), 2026.
[CCF-A] Meng Meng, Zichang Tan, Yong Zhang, and Xu Zhou, "Appearance-Motion Decomposed Alignment for Text-Video Retrieval." In Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2026.
[CCF-C] Yong Zhang, Rui Zhu, Shifeng Zhang, Xu Zhou, Shifeng Chen, and Xiaofan Chen. "Feature Augmentation for Self-supervised Contrastive Learning: A Closer Look." In IEEE International Joint Conference on Neural Networks (IJCNN), 2024.
[CCF-A] Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen. "Learning to Generate Language-supervised and Open-vocabulary Scene Graph using Pre-trained Visual-Semantic Space." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023. [Code]
[CCF-A] Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen. "Exploring Structure-aware Transformer over Interaction Proposals for Human-Object Interaction Detection." In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022. [Code]
[CCF-C] Chao Shen, Yong Zhang, Zhongmin Cai, Tianwen Yu, and Xiaohong Guan. "Touch-interaction behavior for continuous user authentication on smartphones." In IEEE International Conference on Biometrics (ICB), 2015.
期刊论文:
[CCF-A] Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen. "End-to-End Video Scene Graph Generation with Temporal Propagation Transformer." IEEE Transactions on Multimedia (TMM), 2023. [Code]
[CCF-B] Yong Zhang, Yingwei Pan, Ting Yao, Rui Huang, Tao Mei, and Chang-Wen Chen. "Boosting Scene Graph Generation with Visual Relation Saliency." ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), 2023.
[CCF-A] Chao Shen, Yong Zhang, Xiaohong Guan, and Roy A. Maxion. "Performance analysis of touch-interaction behavior for active smartphone authentication." IEEE Transactions on Information Forensics and Security (TIFS) 11, no. 3 (2015): 498-513.