An Infrastructure for Data Analysis Extraction in Distributed Systems, M.Sc. Thesis Sharif University of Technology ; Habibi, Jafar (Supervisor) ; Mirian Hosseinabadi, Hassan (Supervisor)
Abstract
In distributed systems, a huge amount of data is dispersed among different nodes; centralization of this data is infeasible due to communication and storage costs. In addition, Databases with high dimensional data objects are becoming more prevalent is many areas. When the dimensionality increases, the volume of the space increases so fast that the available data becomes sparse. This sparsity is problematic from many aspects. In order to obtain a statistically sound and reliable result, the amount of data needed to support the result often grows exponentially with the dimensionality. Also organizing and searching data often relies on detecting areas where objects form groups with similar...
Cataloging briefAn Infrastructure for Data Analysis Extraction in Distributed Systems, M.Sc. Thesis Sharif University of Technology ; Habibi, Jafar (Supervisor) ; Mirian Hosseinabadi, Hassan (Supervisor)
Abstract
In distributed systems, a huge amount of data is dispersed among different nodes; centralization of this data is infeasible due to communication and storage costs. In addition, Databases with high dimensional data objects are becoming more prevalent is many areas. When the dimensionality increases, the volume of the space increases so fast that the available data becomes sparse. This sparsity is problematic from many aspects. In order to obtain a statistically sound and reliable result, the amount of data needed to support the result often grows exponentially with the dimensionality. Also organizing and searching data often relies on detecting areas where objects form groups with similar...
Find in contentBookmark |
|