Evaluation of distributed stream processing frameworks for IoT applications in smart cities

Nasiri, H ; Sharif University of Technology | 2019

178 Viewed
  1. Type of Document: Article
  2. DOI: 10.1186/s40537-019-0215-2
  3. Publisher: SpringerOpen , 2019
  4. Abstract:
  5. The widespread growth of Big Data and the evolution of Internet of Things (IoT) technologies enable cities to obtain valuable intelligence from a large amount of real-time produced data. In a Smart City, various IoT devices generate streams of data continuously which need to be analyzed within a short period of time; using some Big Data technique. Distributed stream processing frameworks (DSPFs) have the capacity to handle real-time data processing for Smart Cities. In this paper, we examine the applicability of employing distributed stream processing frameworks at the data processing layer of Smart City and appraising the current state of their adoption and maturity among the IoT applications. Our experiments focus on evaluating the performance of three DSPFs, namely Apache Storm, Apache Spark Streaming, and Apache Flink. According to our obtained results, choosing a proper framework at the data analytics layer of a Smart City requires enough knowledge about the characteristics of target applications. Finally, we conclude each of the frameworks studied here have their advantages and disadvantages. Our experiments show Storm and Flink have very similar performance, and Spark Streaming, has much higher latency, while it provides higher throughput. © 2019, The Author(s)
  6. Keywords:
  7. Distributed stream processing ; IoT applications ; Latency ; Smart City ; Throughput
  8. Source: Journal of Big Data ; Volume 6, Issue 1 , 2019 ; 21961115 (ISSN)
  9. URL: https://journalofbigdata.springeropen.com/articles/10.1186/s40537-019-0215-2