About Me
A Research group at Chongqing University of Posts and Telecommunications, China. Currently, we are focusing on the applications of vision (language)-to-language (vision). Part of the data and checkpoints are availabe at: https://huggingface.co/OpenFace-CQUPT.
Research Interests
- Deep Learning
- Computer Vision
- Vision-Language Model
OpenSource (Data & Checkpoints)
Part of Publications
Journals
- Dw Dai, Long Xu, Yutang Li, et al. Humanvlm: Foundation for human-scene vision-language model. Journal of Information Fusion, 2025 (IF=15.5, CAAI-A/中科院一区) code ~ checkpoint
- Dw Dai, Yuanhui Zhang, Qianlan Yang, et al. PathologyVLM: A Large Vision-Language Model for Pathology Image Understanding. Artificial Intelligence Review, 2025 (IF=13.9, CAAI-B/中科院一区) code ~ checkpoint
- Dw Dai, Fan Chen, Shuyin Xia, et al. An Adaptive Multi-Granularity Graph Representation of Image via Granular-Ball Computing. IEEE Transactions on Image Processing, 2025 (IF=13.7, CCF-A/CAAI-A/中科院一区) code
- Y Liu, Dw Dai(通讯), Guoyin Wang, et al. Multivariate Feedback-based Image-Text JointLearning for Sketch-less Facial Image Retrieva. IEEE Transactions on Circuits and Systems for Video Technology, 2025 (IF=11.1, CCF-B/CAAI-B/中科院一区)
- Dw Dai, YT Li, YG Liu, et al. 15m multimodal facial image-text dataset. arXiv preprint arXiv:2407.08515, 2024. code ~ data
- Dw Dai, S Fu, Y Liu and G Wang. Vision-Language Joint Representation Learning for Sketch Less Facial Image Retrieval. Journal of Information Fusion, 2024 (IF=15.5, CAAI-A/中科院一区) code
- Wang Y, Dw Dai(通讯), Liu D, et al. BTSC: Binary tree structure convolution layers for building interpretable decision‐making deep CNN. CAAI Transactions on Intelligence Technology, 2024 (IF=7.3, CAAI-B/中科院一区)
- Dw Dai, Liu Y, Li Y, et al. LGRL: Local-Global Representation Learning for On-the-Fly FG-SBIR. IEEE Transactions on Big Data, 2024 (IF=5.7/中科院二区) code
- Liu Y, Dw Dai(通讯), Zou K, et al. Prior semantic-embedding representation learning for on-the-fly FG-SBIR[J]. Expert Systems with Applications, 2024 (IF=7.5/中科院一区).
- Li D, Dw Dai(通讯), Chen J, et al. Ensemble learning framework for image retrieval via deep hash ranking. Knowledge-Based Systems, 2023 (IF=7.6/中科院一区)
- C Wang, Dw Dai(通讯), S Xia, et al. One-stage deep edge detection based on dense-scale feature fusion and pixel-level imbalance learning. IEEE Transactions on Artificial Intelligence, 2022 (CAAI-B) code
- Dw Dai, Li Y, Wang Y, et al. Rethinking the image feature biases exhibited by deep convolutional neural network models in image recognition. CAAI Transactions on Intelligence Technology, 2022 (IF=7.3, CAAI-B/中科院一区)
- Dw Dai, Xiaoyu Tang, Yingge Liu, et al. Multi-granularity Association Learning for On-the-fly Fine-grained Sketch-based Image Retrieval. Knowledge-Based Systems, 2022 (IF=7.6/中科院一区)
- Y Liu, Dw Dai (通讯), X Tang, et al. Bi-LSTM sequence modeling for on-the-fly fine-grained sketch-based image retrieval. IEEE Transactions on Artificial Intelligence, 2022 (CAAI-B)
- Dw Dai, Z Zhuang, J Wei, et al. Random sharing parameters in the global region of convolutional neural network. IEEE Transactions on Artificial Intelligence, 2021 (CAAI-B)
- Dw Dai, Chengfu Tang, GuoyinWang, and ShuyinXia. Building Partially Understandable Convolutional Neural Networks by Differentiating Class-Related Neural Nodes. Neurocomputing, 2021 (IF=6.5)
Conferences
- Dw Dai, Mingming Jia, Yinxiu Zhou, et al. Face-MakeUp: Multimodal Facial Prompts for Text-to-Image Generation. 28th European Conference on Artificial Intelligence (ECAI), 2025 (CCF-B) code
- Liu Y, Dw Dai(通讯), Hou X, et al. From Sparse to Complete: Semantic Understanding Based on Stroke Evolution in On-the-fly Sketch-based Image Retrieval. 34th International Joint Conference on Artificial Intelligence (IJCAI), 2025 (CCF-A/CAAI-A) [code]
- Dw Dai, Liu Y, Fu S, et al. Multimodal Image-Text Representation Learning for Sketch-Less Facial Image Retrieval. 2024 IEEE International Conference on Multimedia and Expo (ICME), 2024 (CCF-B, oral)
- Dw Dai, Zhang Y, Xu L, et al. Pa-llava: A large language-vision assistant for human pathology image understanding. 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024 (CCF-B) code
- Tang P, Dw Dai(通讯), Zou K, et al. GraphConvNet: A Dual Network Utilizing Local Features Coupled with Structural Information for Predicting Knee Osteoarthritis. 2024 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), 2024 (CCF-B) code
- Dw Dai, Li Y, Wang L, et al. Sketch less face image retrieval: A new challenge. 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2023 (CCF-B) code
Powered by Jekyll and Minimal Light theme.