Large-Scale Topical Analysis of Multiple Online News Sources with Media Cloud

semanticscholar(2014)

引用 8|浏览13
暂无评分
摘要
Identifying topics in news, tracking their temporal dynamics, and understanding how different media sources cover them have important theoretical and practical implications for journalism researchers, producers, and consumers. The explosive growth of online news sources, however, suggests that scalable approaches to topical analysis are needed. We introduce our ongoing efforts to enable large-scale topical analysis of the Media Cloud corpus, a repository of over 200 million online news articles. Our initial experiments with 90 days of articles from 21 top media sources suggests that statistical topic modeling can identify reasonable news-related topics and produce interesting early insights into the online media ecosystem. We are currently examining mixedinitiative approaches to automate the process of topic extraction and increase the quality of the extracted topics. Finally, we discuss our further research directions on largescale news monitoring and measurement as well as analysis tools for news consumers and producers.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要