Loading...

Data quality improvement using fuzzy association rules

Ghorbanpour Alizamini, F ; Sharif University of Technology | 2010

505 Viewed
  1. Type of Document: Article
  2. DOI: 10.1109/ICEIE.2010.5559676
  3. Publisher: 2010
  4. Abstract:
  5. The activities and decisions of organizations and companies are based on data and the information obtained from data analysis. Data quality plays a crucial role in data analysis, because the incorrect data leads to wrong decisions. Nowadays, improving the data quality manually is very difficult and in many cases is impossible as data quality is one of the complicated and non-structured concepts and data refinement process can not be done without the help of professional domain experts, and detection and correction of errors require a thorough knowledge in the related domain of the data. Thus, the necessity of using (semi-)automatic methods is discussed to find data defects and errors and solve them. Because data mining methods are designed to discover interesting patterns in datasets, we can use them efficiently to improve different dimensions of data quality. In this paper, a new method is presented to measure the accuracy dimension of data quality using fuzzy association rules. Finally, Experimental results of the proposed method show the effectiveness of the proposed method to find incorrect values in datasets
  6. Keywords:
  7. Automatic method ; Correction of errors ; Data analysis ; Data defects ; Data mining methods ; Data quality ; Data refinements ; Data sets ; Domain experts ; Fuzzy association rule ; Association rules ; Associative processing ; Data mining ; Data reduction ; Errors ; Fuzzy rules ; Quality control
  8. Source: ICEIE 2010 - 2010 International Conference on Electronics and Information Engineering, Proceedings, 1 August 2010 through 3 August 2010 ; Volume 1 , August , 2010 , Pages V1468-V1472 ; 9781424476800 (ISBN)
  9. URL: http://ieeexplore.ieee.org/document/5559676