Enhanced Video BERT for Fast Video Advertisement Retrieval.

Yi Yang,Tan Yu,Jie Liu,Zhipeng Jin, Xuewu Jiao,Yi Li, Shuanglong Li,Ping Li

Big Data(2022)

引用 0|浏览9
暂无评分
摘要
Recently, video BERT based on cross-modal attention has achieved excellent performance in many cross-modal tasks in academia. Nevertheless, the expensive computation cost of cross-modal attention makes video BERT impractical for large-scale search in industrial applications. Inspired by the success of the tree-based deep model (TDM) in the recommendation system, we present a enhanced video BERT (EVB). It provides a practical solution to deploy the heavy video BERT for the large-scale query-to-video search. The proposed EVB overcomes the limitation of TDM relying on global features, and makes the tree structure based on a global feature compatible with version BERT using a set of local features. What’s more, we proposes a similarity-based dynamic construction to integrate the optimization of model efficiency. The proposed EVB has been deployed in our video advertising platform and brings a considerable boost in CVR and CTR for advertisers.
更多
查看译文
关键词
cross-modal attention,cross-modal tasks,enhanced video BERT,EVB,expensive computation cost,fast video advertisement retrieval,global feature,heavy video BERT,large-scale query-to-video search,large-scale search,similarity-based dynamic construction,tree-based deep model,version BERT,video advertising platform,video BERT impractical
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要