A modular software architecture for processing of big geospatial data in the cloud

Michel Krämer, Ivo Senner

Computers & Graphics(2015)

引用 78|浏览71
暂无评分
摘要
In this paper we propose a software architecture that allows for processing of large geospatial data sets in the cloud. Our system is modular and flexible and supports multiple algorithm design paradigms such as MapReduce, in-memory computing or agent-based programming. It contains a web-based user interface where domain experts (e.g. GIS analysts or urban planners) can define high-level processing workflows using a domain-specific language (DSL). The workflows are passed through a number of components including a parser, interpreter, and a service called job manager. These components use declarative and procedural knowledge encoded in rules to generate a processing chain specifying the execution of the workflows on a given cloud infrastructure according to the constraints defined by the user. The job manager evaluates this chain, spawns processing services in the cloud and monitors them. The services communicate with each other through a distributed file system that is scalable and fault-tolerant. Compared to previous work describing cloud infrastructures and architectures we focus on the processing of big heterogeneous geospatial data. In addition to that, we do not rely on only one specific programming model or a certain cloud infrastructure but support several ones. Combined with the possibility to control the processing through DSL-based workflows, this makes our architecture very flexible and configurable. We do not only see the cloud as a means to store and distribute large data sets but also as a way to harness the processing power of distributed computing environments for large-volume geospatial data sets. The proposed architecture design has been developed for the IQmulus research project funded by the European Commission. The paper concludes with the evaluation results from applying our solution to two example workflows from this project. Graphical abstractDisplay Omitted HighlightsWe present a cloud-based architecture for the processing of large-volume geospatial data.Our architecture is modular and supports a large number of geoprocessing algorithms.Compared to other work it is also configurable and can be deployed to multiple platforms.Geoprocesses can be controlled through a flexible domain-specific language.We make use of a rule-based system for process mapping and node configuration.
更多
查看译文
关键词
Cloud computing,Big Data,Geoprocessing,Distributed systems,Software architectures,Domain-specific languages
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要