ActionNet-VE Dataset: A Dataset for Describing Visual Events by Extending VIRAT Ground 2.0

2015 8th International Conference on Signal Processing, Image Processing and Pattern Recognition (SIP)(2015)

引用 5|浏览0
暂无评分
摘要
This paper introduces a dataset for recognizing and describing interactive events between objects of interest including persons, cars, bikes, and carried objects. Although there have been many video datasets for human activity recognition, most of them focus on persons and their actions and sometimes ignore the specific information on related objects, such as their object type and minimum bounding boxes, in annotations. ActionNet-VE dataset was designed to include full annotations on all objects and events of interest occurred in a video clip for describing the semantics of the event. The dataset adopt 75 video clips from VIRAT Ground 2.0, and extend annotations on the events and their related objects. In addition, the dataset describes semantics of each events by using elements of sentences, such as verb, subject, and objects.
更多
查看译文
关键词
Video dataset,video interpretation,visual events,interactive events,and VIRAT
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要