Enhanced Video BERT for Fast Video Advertisement Retrieval.

Yi Yang,Tan Yu,Jie Liu,Zhipeng Jin, Xuewu Jiao,Yi Li, Shuanglong Li,Ping Li

Big Data（2022）

引用 0|浏览9

暂无评分

摘要

Recently, video BERT based on cross-modal attention has achieved excellent performance in many cross-modal tasks in academia. Nevertheless, the expensive computation cost of cross-modal attention makes video BERT impractical for large-scale search in industrial applications. Inspired by the success of the tree-based deep model (TDM) in the recommendation system, we present a enhanced video BERT (EVB). It provides a practical solution to deploy the heavy video BERT for the large-scale query-to-video search. The proposed EVB overcomes the limitation of TDM relying on global features, and makes the tree structure based on a global feature compatible with version BERT using a set of local features. What’s more, we proposes a similarity-based dynamic construction to integrate the optimization of model efficiency. The proposed EVB has been deployed in our video advertising platform and brings a considerable boost in CVR and CTR for advertisers.

查看译文

关键词

cross-modal attention,cross-modal tasks,enhanced video BERT,EVB,expensive computation cost,fast video advertisement retrieval,global feature,heavy video BERT,large-scale query-to-video search,large-scale search,similarity-based dynamic construction,tree-based deep model,version BERT,video advertising platform,video BERT impractical

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要