Dai Dawei | College of Artificial Intelligence, Chongqing University of Posts and Telecommunications

About Me

A Research group at Chongqing University of Posts and Telecommunications, China. Currently, we are focusing on the applications of vision (language)-to-language (vision). Part of the data and checkpoints are availabe at: https://huggingface.co/OpenFace-CQUPT.

Research Interests

Deep Learning
Computer Vision
Vision-Language Model

OpenSource (Data & Checkpoints)

Part of Publications

Journals

Dw Dai, Long Xu, Yutang Li, et al. Humanvlm: Foundation for human-scene vision-language model. Journal of Information Fusion, 2025 (IF=15.5, CAAI-A/中科院一区) code ~ checkpoint
Y Liu, Dw Dai(通讯), Shuyin Xia et al. FDSRM: A Feature-driven Style-agnostic Foundation Model for Sketch-less Facial Image Retrieval. IEEE Transactions on Neural Networks and Learning Systems, 2025 (IF=8.9, CCF-B/中科院一区Top)
Dw Dai, Yuanhui Zhang, Qianlan Yang, et al. PathologyVLM: A Large Vision-Language Model for Pathology Image Understanding. Artificial Intelligence Review, 2025 (IF=13.9, CAAI-B/中科院一区) code ~ checkpoint
Dw Dai, Fan Chen, Shuyin Xia, et al. An Adaptive Multi-Granularity Graph Representation of Image via Granular-Ball Computing. IEEE Transactions on Image Processing, 2025 (IF=13.7, CCF-A/CAAI-A/中科院一区) code
Y Liu, Dw Dai(通讯), Guoyin Wang, et al. Multivariate Feedback-based Image-Text JointLearning for Sketch-less Facial Image Retrieva. IEEE Transactions on Circuits and Systems for Video Technology, 2025 (IF=11.1, CCF-B/CAAI-B/中科院一区)
Dw Dai, YT Li, YG Liu, et al. 15m multimodal facial image-text dataset. arXiv preprint arXiv:2407.08515, 2024. code ~ data
Dw Dai, S Fu, Y Liu and G Wang. Vision-Language Joint Representation Learning for Sketch Less Facial Image Retrieval. Journal of Information Fusion, 2024 (IF=15.5, CAAI-A/中科院一区) code
Wang Y, Dw Dai(通讯), Liu D, et al. BTSC: Binary tree structure convolution layers for building interpretable decision‐making deep CNN. CAAI Transactions on Intelligence Technology, 2024 (IF=7.3, CAAI-B/中科院一区)
Dw Dai, Liu Y, Li Y, et al. LGRL: Local-Global Representation Learning for On-the-Fly FG-SBIR. IEEE Transactions on Big Data, 2024 (IF=5.7/中科院二区) code
Liu Y, Dw Dai(通讯), Zou K, et al. Prior semantic-embedding representation learning for on-the-fly FG-SBIR[J]. Expert Systems with Applications, 2024 (IF=7.5/中科院一区).
Li D, Dw Dai(通讯), Chen J, et al. Ensemble learning framework for image retrieval via deep hash ranking. Knowledge-Based Systems, 2023 (IF=7.6/中科院一区)
C Wang, Dw Dai(通讯), S Xia, et al. One-stage deep edge detection based on dense-scale feature fusion and pixel-level imbalance learning. IEEE Transactions on Artificial Intelligence, 2022 (CAAI-B) code
Dw Dai, Li Y, Wang Y, et al. Rethinking the image feature biases exhibited by deep convolutional neural network models in image recognition. CAAI Transactions on Intelligence Technology, 2022 (IF=7.3, CAAI-B/中科院一区)
Dw Dai, Xiaoyu Tang, Yingge Liu, et al. Multi-granularity Association Learning for On-the-fly Fine-grained Sketch-based Image Retrieval. Knowledge-Based Systems, 2022 (IF=7.6/中科院一区)
Y Liu, Dw Dai (通讯), X Tang, et al. Bi-LSTM sequence modeling for on-the-fly fine-grained sketch-based image retrieval. IEEE Transactions on Artificial Intelligence, 2022 (CAAI-B)
Dw Dai, Z Zhuang, J Wei, et al. Random sharing parameters in the global region of convolutional neural network. IEEE Transactions on Artificial Intelligence, 2021 (CAAI-B)
Dw Dai, Chengfu Tang, GuoyinWang, and ShuyinXia. Building Partially Understandable Convolutional Neural Networks by Differentiating Class-Related Neural Nodes. Neurocomputing, 2021 (IF=6.5)

Conferences

Dw Dai, Mingming Jia, Yinxiu Zhou, et al. Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation. 28th European Conference on Artificial Intelligence (ECAI), 2025 (CCF-B) code
Liu Y, Dw Dai(通讯), Hou X, et al. From Sparse to Complete: Semantic Understanding Based on Stroke Evolution in On-the-fly Sketch-based Image Retrieval. 34th International Joint Conference on Artificial Intelligence (IJCAI), 2025 (CCF-A/CAAI-A) [code]
Dw Dai, Liu Y, Fu S, et al. Multimodal Image-Text Representation Learning for Sketch-Less Facial Image Retrieval. 2024 IEEE International Conference on Multimedia and Expo (ICME), 2024 (CCF-B, oral)
Dw Dai, Zhang Y, Xu L, et al. Pa-llava: A large language-vision assistant for human pathology image understanding. 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024 (CCF-B) code
Tang P, Dw Dai(通讯), Zou K, et al. GraphConvNet: A Dual Network Utilizing Local Features Coupled with Structural Information for Predicting Knee Osteoarthritis. 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024 (CCF-B) code
Dw Dai, Li Y, Wang L, et al. Sketch less face image retrieval: A new challenge. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023 (CCF-B) code