Loading...

Semi-supervised ensemble learning of data streams in the presence of concept drift

Ahmadi, Z ; Sharif University of Technology

623 Viewed
  1. Type of Document: Article
  2. DOI: 10.1007/978-3-642-28931-6_50
  3. Abstract:
  4. Increasing access to very large and non-stationary datasets in many real problems has made the classical data mining algorithms impractical and made it necessary to design new online classification algorithms. Online learning of data streams has some important features, such as sequential access to the data, limitation on time and space complexity and the occurrence of concept drift. The infinite nature of data streams makes it hard to label all observed instances. It seems that using the semi-supervised approaches have much more compatibility with the problem. So in this paper we present a new semi-supervised ensemble learning algorithm for data streams. This algorithm uses the majority vote of learners for the labeling of unlabeled instances. The empirical study demonstrates that the proposed algorithm is comparable with the state-of-the-art semi-supervised online algorithms
  5. Keywords:
  6. Concept Drift ; Semi- Supervised Learning ; Concept drifts ; Data mining algorithm ; Data sets ; Data stream ; Empirical studies ; Ensemble learning ; Ensemble learning algorithm ; Majority vote ; Nonstationary ; On-line algorithms ; On-line classification ; Online learning ; Real problems ; Semi-supervised ; Sequential access ; Space complexity ; Stream mining ; Classification (of information) ; Data communication systems ; Intelligent systems ; Learning algorithms ; Data mining
  7. Source: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) ; Volume 7209 LNAI, Issue PART 2 , 2012 , Pages 526-537 ; 03029743 (ISSN) ; 9783642289309 (ISBN)
  8. URL: http://link.springer.com/chapter/10.1007%2F978-3-642-28931-6_50