谷歌浏览器插件
订阅小程序
在清言上使用

Magnifier: Online Detection of Performance Problems in Large-Scale Cloud Computing Systems

Services Computing(2011)

引用 27|浏览3
暂无评分
摘要
In large-scale cloud computing systems, even a simple user request may go through numerous of services that are deployed on different physical machines. As a result, it is a great challenge to online localize the prime causes of performance degradation in such systems. Existing end-to-end request tracing approaches are not suitable for online anomaly detection because their time complexity is exponential in the size of the trace logs. In this paper, we propose an approach, namely Magnifier, to rapidly diagnose the source of performance degradation in large-scale non-stop cloud systems. In Magnifier, the execution path graph of a user request is modeled by a hierarchical structure including component layer, module layer and function layer, and anomalies are detected from higher layer to lower layer separately. In each layer every node is assigned a newly created identifier in addition to the global identifier of the request, which significantly decreases the size of parsing trace logs and accelerates the anomaly detection process. We conduct extensive experiments over a real-world enterprise system (the Alibaba cloud computing platform) providing services for the public. The results show that Magnifier can locate the prime causes of performance degradation more accurately and efficiently.
更多
查看译文
关键词
real-world enterprise system,simple user request,prime cause,magnifier,large-scale cloud computing systems,function layer,execution path graph,component layer,hierarchical structure,end-to-end request,online anomaly detection,user request,performance degradation,large-scale systems,module layer,performance problems,graph theory,online operation,online detection,cloud computing,lower layer,real-time systems,security of data,higher layer,degradation,real time systems,principal component analysis,servers,time complexity,indexation,enterprise system,anomaly detection,fluctuations,indexes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要