Loading...
Search for: clustering-algorithms
0.009 seconds
Total 136 records

    Web page clustering using harmony search optimization

    , Article IEEE Canadian Conference on Electrical and Computer Engineering, CCECE 2008, Niagara Falls, ON, 4 May 2008 through 7 May 2008 ; 2008 , Pages 1601-1604 ; 08407789 (ISSN) ; 9781424416431 (ISBN) Forsati, R ; Mahdavi, M ; Kangavari, M ; Safarkhani, B ; Sharif University of Technology
    2008
    Abstract
    Clustering has become an increasingly important task in modern application domains. Targeting useful and relevant information on the World Wide Web is a topical and highly complicated research area. Clustering techniques have been applied to categorize documents on web and extracting knowledge from the web. In this paper we propose novel clustering algorithms based on Harmony Search (HS) optimization method that deals with web document clustering. By modeling clustering as an optimization problem, first, we propose a pure HS based clustering algorithm that finds near global optimal clusters within a reasonable time. Then we hybridize K-means and harmony clustering to achieve better... 

    A novel pre-processing method to reduce noise effects in a prototype-based clustering algorithm

    , Article 2008 International Conference on Information and Knowledge Engineering, IKE 2008, Las Vegas, NV, 14 July 2008 through 17 July 2008 ; July , 2008 , Pages 587-593 ; 1601320752 (ISBN); 9781601320759 (ISBN) Taghikhaki, Z ; Minaei, B ; Masoum, A ; Sharif University of Technology
    2008
    Abstract
    In this paper we introduce a preprocessing method to reduce noise effects in noise prone environments. Prototype based clustering algorithms are sensitive to noise because the effect of noisy data are as same as effect of true data and this affects on calculation of clusters center and then reduces accuracy. Therefore, these algorithms can not be applied in noise-prone environments and if this is applied there, we can not trust to the results. To overcome such problems we reduce and in some cases eliminate the noisy data. Also a part of our method is applied on the source of generated data in a network. Then noisy data that the number of them is high in noisy environments are eliminated and... 

    A novel semi-supervised clustering algorithm for finding clusters of arbitrary shapes

    , Article 13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008, Kish Island, 9 March 2008 through 11 March 2008 ; Volume 6 CCIS , 2008 , Pages 876-879 ; 18650929 (ISSN); 3540899847 (ISBN); 9783540899846 (ISBN) Soleymani Baghshah, M ; Bagheri Shouraki, S ; Sharif University of Technology
    2008
    Abstract
    Recently, several algorithms have been introduced for enhancing clustering quality by using supervision in the form of constraints. These algorithms typically utilize the pair wise constraints to either modify the clustering objective function or to learn the clustering distance measure. Very few of these algorithms show the ability of discovering clusters of different shapes along with satisfying the provided constraints. In this paper, a novel semi-supervised clustering algorithm is introduced that uses the side information and finds clusters of arbitrary shapes. This algorithm uses a two-stage clustering approach satisfying the pair wise constraints. In the first stage, the data points... 

    A novel clustering algorithm based on circlusters to find arbitrary shaped clusters

    , Article 2008 International Conference on Computer and Electrical Engineering, ICCEE 2008, Phuket, 20 December 2008 through 22 December 2008 ; January , 2008 , Pages 619-624 ; 9780769535043 (ISBN) Hassas Yeganeh, S ; Habibi, J ; Abolhassani, H ; Shirali Shahreza, S ; Sharif University of Technology
    2008
    Abstract
    Clustering is the problem of partitioning a (large) set of data using unsupervised techniques. Today, there exist many clustering techniques. The most important characteristic of a clustering technique is the shape of the cluster it can find. In this paper, we propose a method that is capable to find arbitrary shaped clusters and uses simple geometric constructs, Circlusters. Circlusters are different radius sectored circles. Circlusters can be used to create many hybrid approaches in mixture with density based or partitioning based methods. We also proposed two new clustering methods that are capable to find complex clusters in O(n), where n is the size of the data set. Both of the methods... 

    Energy adaptive cluster-head selection for wireless sensor networks using center of energy mass

    , Article 13th International Computer Society of Iran Computer Conference on Advances in Computer Science and Engineering, CSICC 2008, Kish Island, 9 March 2008 through 11 March 2008 ; Volume 6 CCIS , 2008 , Pages 130-137 ; 18650929 (ISSN); 3540899847 (ISBN); 9783540899846 (ISBN) Akhtarkavan, E ; Manzuri Shalmani, M. T ; Sharif University of Technology
    2008
    Abstract
    A set of small battery-operated sensors with low-power transceivers that can automatically form a network and collect some desired physical characteristics of the environment is called a wireless sensor network. The communications must be designed to conserve the limited energy resources of the sensors [14].By clustering sensors we can save energy. In this paper, we introduce a new concept called "Center of Energy Mass" which is a combination of both energy level and location of the nodes which is used to form the new factor of "distance of the nodes to the CEM ".Distance of the nodes to the CEM is used together with Probability Density Function of the normal distribution in optimizing... 

    Hybridization of k-means and harmony search methods for web page clustering

    , Article Proceedings - 2008 IEEE/WIC/ACM International Conference on Web Intelligence, WI 2008, 9 December 2008 through 12 December 2008, Sydney, NSW ; 2008 , Pages 329-335 ; 9780769534961 (ISBN) Forsati, R ; Meybodi, M. R ; Mahdavi, M ; Ghari Neiat, A ; Sharif University of Technology
    2008
    Abstract
    Clustering is currently one of the most crucial techniques for dealing with massive amount of heterogeneous information on the web, which is beyond human being's capacity to digest. Recent studies have shown that the most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for large datasets. However, the K-means algorithm can generate a local optimal solution. In this paper we present novel harmony search clustering algorithms that deal with documents clustering based on harmony search optimization method. By modeling clustering as an optimization problem, first, we propose a pure harmony search based clustering algorithm that finds near global... 

    Improving density-based methods for hierarchical clustering of web pages

    , Article Data and Knowledge Engineering ; Volume 67, Issue 1 , 2008 , Pages 30-50 ; 0169023X (ISSN) Haghir Chehreghani, M ; Abolhassani, H ; Haghir Chehreghani, M ; Sharif University of Technology
    2008
    Abstract
    The rapid increase of information on the web makes it necessary to improve information management techniques. One of the most important techniques is clustering web data. In this paper, we propose a new 3-phase clustering method that finds dense units in a data set using density-based algorithms. The distances in the dense units are stored in order in structures such as a min heap. In the extraction stage, these distances are extracted one by one, and their effects on the clustering process are examined. Finally, in the combination stage, clustering is completed using improved versions of well-known single and average linkage methods. All steps of the methods are performed in O(n log n) time... 

    Finding arbitrary shaped clusters and color image segmentation

    , Article 1st International Congress on Image and Signal Processing, CISP 2008, Sanya, Hainan, 27 May 2008 through 30 May 2008 ; Volume 1 , 2008 , Pages 593-597 ; 9780769531199 (ISBN) Soleymani Baghshah, M ; Bagheri Shouraki, S ; Sharif University of Technology
    2008
    Abstract
    One of the most famous approaches for the segmentation of color images is finding clusters in the color space. Shapes of these clusters are often complex and the time complexity of the existing algorithms for finding clusters of different shapes is usually high. In this paper, a novel clustering algorithm is proposed and used for the image segmentation purpose. This algorithm distinguishes clusters of different shapes using a two-stage clustering approach in a reasonable time. In the first stage, the mean-shift clustering algorithm is used and the data points are grouped into some sub-clusters. In the second stage, connections between sub-clusters are established according to a dissimilarity... 

    A POS-based fuzzy word clustering algorithm for continuous speech recognition systems

    , Article 2007 9th International Symposium on Signal Processing and its Applications, ISSPA 2007, Sharjah, 12 February 2007 through 15 February 2007 ; 2007 ; 1424407796 (ISBN); 9781424407798 (ISBN) Momtazi, S ; Sameti, H ; Bahrani, M ; Hafezi, N ; Sharif University of Technology
    2007
    Abstract
    Using word base n-gram language models in continuous speech recognition systems is so prevalent. For using this type of language models, we should extract them from large corpora. Since Persian corpora are not rich, therefore the extracted language models are not credible. For this reason, most researchers extract class n-grams instead of finding word n-grams. In this research a new idea for fuzzy word clustering is represented that each word can be assigned to more that one class. The Fuzzy c-mean algorithm is used for our clustering method and we have examined its various parameters of it. Finally, this algorithm was applied on 20000 most frequent Persian words extracted from "Persian Text... 

    Neuro-fuzzy surface EMG pattern recognition for multifunctional hand prosthesis control

    , Article 2007 IEEE International Symposium on Industrial Electronics, ISIE 2007, Caixanova - Vigo, 4 June 2007 through 7 June 2007 ; 2007 , Pages 269-274 ; 1424407559 (ISBN); 9781424407552 (ISBN) Khezri, M ; Jahed, M ; Sadati, N ; Sharif University of Technology
    2007
    Abstract
    Electromyogram (EMG) signal is an electrical manifestation of muscle contractions. EMG signal collected from surface of the skin, a non-invasive bioelectric signal, can be used in different rehabilitation applications and artificial extremities control. This study has proposed to utilize the surface EMG (SEMG) signal to recognize patterns of hand prosthesis movements. It suggests using an adaptive neuro-fuzzy inference system (ANFIS) to identify motion commands for the control of a prosthetic hand. In this work a hybrid method for training fuzzy system, consisting of back-propagation (BP) and least mean square (LMS) is utilized. Also in order to optimize the number of fuzzy rules, a... 

    Real-time intelligent pattern recognition algorithm for surface EMG signals

    , Article BioMedical Engineering Online ; Volume 6 , 3 December , 2007 ; 1475925X (ISSN) Khezri, M ; Jahed, M ; Sharif University of Technology
    2007
    Abstract
    Background: Electromyography (EMG) is the study of muscle function through the inquiry of electrical signals that the muscles emanate. EMG signals collected from the surface of the skin (Surface Electromyogram: sEMG) can be used in different applications such as recognizing musculoskeletal neural based patterns intercepted for hand prosthesis movements. Current systems designed for controlling the prosthetic hands either have limited functions or can only be used to perform simple movements or use excessive amount of electrodes in order to achieve acceptable results. In an attempt to overcome these problems we have proposed an intelligent system to recognize hand movements and have provided... 

    Solving haplotype reconstruction problem in MEC model with hybrid information fusion

    , Article EMS 2008, European Modelling Symposium, 2nd UKSim European Symposium on Computer Modelling and Simulation, Liverpool, 8 September 2008 through 10 September 2008 ; 2008 , Pages 214-218 ; 9780769533254 (ISBN) Asgarian, E ; Moeinzadeh, M. H ; Habibi, J ; Sharifian-R, S ; Rasooli-V, A ; Najafi-A, A ; Sharif University of Technology
    2008
    Abstract
    Single Nucleotide Polymorphisms (SNPs), a single DNA base varying from one individual to another, are believed to be the most frequent form responsible for genetic differences. Genotype is the conflated information of a pair of haplotypes on homologous chromosomes. Although haplotypes have more information for disease associating than individual SNPs and genotype, it is substantially more difficult to determine haplotypes through experiments. Hence, computational methods which can reduce the cost of determining haplotypes become attractive alternatives. MEC, as a standard model for haplotype reconstruction, is fed by fragments as input to infer the best pair of haplotypes with minimum error... 

    Estimating the mixing matrix in Sparse Component Analysis (SCA) based on multidimensional subspace clustering

    , Article 2007 IEEE International Conference on Telecommunications and Malaysia International Conference on Communications, ICT-MICC 2007, Penang, 14 May 2007 through 17 May 2007 ; February , 2007 , Pages 670-675 ; 1424410940 (ISBN); 9781424410941 (ISBN) Movahedi Naini, F ; Mohimani, G. H ; Babaiezadeh, M ; Jutten, C ; Sharif University of Technology
    2007
    Abstract
    In this paper we propose a new method for estimating the mixing matrix, A, in the linear model X = AS, for the problem of underdetermined Sparse Component Analysis (SCA). Contrary to most existing algorithms, in the proposed algorithm there may be more than one active source at each instant (i.e. in each column of the source matrix S), and the number of sources is not required to be known in advance. Since in the cases where more than one source is active at each instant, data samples concentrate around multidimensional subspaces, the idea of our method is to first estimate these subspaces and then estimate the mixing matrix from these estimated subspaces. ©2007 IEEE  

    Distance-based segmentation: An energy-efficient clustering hierarchy for wireless microsensor networks

    , Article CNSR 2007: 5th Annual Conference on Communication Networks and Services Research, Fredericton, NB, 14 May 2007 through 17 May 2007 ; 2007 , Pages 18-25 ; 076952835X (ISBN); 9780769528359 (ISBN) Amini, N ; Fazeli, M ; Miremadi, S. G ; Manzuri, M. T ; Sharif University of Technology
    2007
    Abstract
    This paper presents an energy-efficient communication protocol which distributes a uniform energy load to the sensors in a wireless microsensor network. This protocol, called Distance-Based Segmentation (DBS), is a clusterbased protocol that divides the entire network into equalarea segments and applies different clustering policies to each segment to reduce total energy dissipation and hence prolong the lifetime of the network. To evaluate the DBS protocol, a simulator was implemented using the MATLAB software. Simulation results show that the DBS can achieve as much as 16% reduction in total dissipated energy as compared with conventional protocols. In addition this protocol is able to... 

    Fuzzy Adaptive Resonance Theory for content-based data retrieval

    , Article 2006 Innovations in Information Technology, IIT, Dubai, 19 November 2006 through 21 November 2006 ; 2006 ; 1424406749 (ISBN); 9781424406746 (ISBN) Milani Fard, A ; Akbari, H ; Akbarzadeh-T., M. R ; Sharif University of Technology
    2006
    Abstract
    In this paper we propose a content-based text and image retrieval architecture using Fuzzy Adaptive Resonance Theory neural network. This method is equipped with an unsupervised mechanism for dynamic data clustering to deal with incremental information without metadata such as in web environment. Results show noticeable average precision and recall over search results. © 2006 IEEE  

    Sparse ICA via cluster-wise PCA

    , Article Neurocomputing ; Volume 69, Issue 13-15 , 2006 , Pages 1458-1466 ; 09252312 (ISSN) Babaie Zadeh, M ; Jutten, C ; Mansour, A ; Sharif University of Technology
    2006
    Abstract
    In this paper, it is shown that independent component analysis (ICA) of sparse signals (sparse ICA) can be seen as a cluster-wise principal component analysis (PCA). Consequently, Sparse ICA may be done by a combination of a clustering algorithm and PCA. For the clustering part, we use, in this paper, an algorithm inspired from K-means. The final algorithm is easy to implement for any number of sources. Experimental results points out the good performance of the method, whose the main restriction is to request an exponential growing of the sample number as the number of sources increases. © 2006 Elsevier B.V. All rights reserved