A Flexible Architecture Using Temporal, Spatial and Semantic Correlation-Based Algorithms for Story Segmentation of Broadcast News.

IEEE ACM Trans. Audio Speech Lang. Process.(2023)

引用 0|浏览5
暂无评分
摘要
In this article, we propose a novel flexible architecture, with different algorithmic procedures, for effective story segmentation of broadcast news from subtitle files. The proposed system exploits spatial and temporal distance, as well as sentence similarity, to classify different stories in news broadcasts. The computational algorithms which form the architecture mainly focus on each sentence's features (temporal distance, spatial distance, and semantic similarity), and are combined to build an overall classifier. The first algorithm in the architecture focuses on the segmentation task, detecting boundaries between news. The second and third algorithms identify high semantic correlation between pieces of text, whether they are consecutive in space or not. Video Text Track (VTT) subtitle files are used to evaluate the performance of the proposed approach, although any file format that includes temporal information could also be considered. These VTT files may contain text errors and inaccuracies, and the proposed algorithms have been designed to deal with noisy content.
更多
查看译文
关键词
Natural language processing,correlation matrix,BERT,video text track
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要