LSECA: Local Semantic Enhancement and Cross Aggregation for Video-Text Retrieval
INTERNATIONAL JOURNAL OF MULTIMEDIA INFORMATION RETRIEVAL(2024)
Key words
Video-text retrieval,Semantic enhancement,Cross aggregation,Multi-grained contrast
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined