Loading...
Search for: clustering-algorithms
0.006 seconds
Total 136 records

    A sensitivity study of FILTERSIM algorithm when applied to DFN modeling

    , Article Journal of Petroleum Exploration and Production Technology ; Vol. 4, issue. 2 , June , 2014 , p. 153-174 ; ISSN: 21900558 Ahmadi, R ; Masihi, M ; Rasaei, M. R ; Eskandaridalvand, K ; Shahalipour, R ; Sharif University of Technology
    Abstract
    Realistic description of fractured reservoirs demands primarily for a comprehensive understanding of fracture networks and their geometry including various individual fracture parameters as well as network connectivities. Newly developed multiple-point geostatistical simulation methods like SIMPAT and FILTERSIM are able to model connectivity and complexity of fracture networks more effectively than traditional variogrambased methods. This approach is therefore adopted to be used in this paper. Among the multiple-point statistics algorithms, FILTERSIM has the priority of less computational effort than does SIMPAT by applying filters and modern dimensionality reduction techniques to the... 

    Scalable semi-supervised clustering by spectral kernel learning

    , Article Pattern Recognition Letters ; Vol. 45, issue. 1 , August , 2014 , p. 161-171 ; ISSN: 01678655 Soleymani Baghshah, M ; Afsari, F ; Bagheri Shouraki, S ; Eslami, E ; Sharif University of Technology
    Abstract
    Kernel learning is one of the most important and recent approaches to constrained clustering. Until now many kernel learning methods have been introduced for clustering when side information in the form of pairwise constraints is available. However, almost all of the existing methods either learn a whole kernel matrix or learn a limited number of parameters. Although the non-parametric methods that learn whole kernel matrix can provide capability of finding clusters of arbitrary structures, they are very computationally expensive and these methods are feasible only on small data sets. In this paper, we propose a kernel learning method that shows flexibility in the number of variables between... 

    Development of a new workflow for pseudo-component generation of reservoir fluid detailed analysis: A gas condensate case study

    , Article International Journal of Oil, Gas and Coal Technology ; Vol. 7, Issue. 3 , 2014 , pp. 275-297 ; ISSN: 1753-3317 Assareh, M ; Pishvaie, M. R ; Ghotbi, C ; Mittermeir, G. M ; Sharif University of Technology
    Abstract
    In this work, a new automatic workflow for accurate optimal pseudo-component generation from gas condensate mixtures with a large number of components is presented. This workflow has a good insight into thermo-physical and critical properties and introduces only a small amount of loss of information and EOS flexibility. In this regard, the fuzzy clustering is used to classify the components in the mixture based on the similarities in the critical properties. The mixing rules are then applied to find group properties. Two different approaches for components association in clustering process are investigated with several numbers of groups. The mathematical validity of the groups is controlled... 

    A study on clustering-based image denoising: from global clustering to local grouping

    , Article European Signal Processing Conference ; 10 November , 2014 , pp. 1657-1661 ; ISSN: 22195491 ; ISBN: 9780992862619 Joneidi, M ; Sadeghi, M ; Sahraee-Ardakan, M ; Babaie-Zadeh, M ; Jutten, C ; Sharif University of Technology
    Abstract
    This paper studies denoising of images contaminated with additive white Gaussian noise (AWGN). In recent years, clustering-based methods have shown promising performances. In this paper we show that low-rank subspace clustering provides a suitable clustering problem that minimizes the lower bound on the MSE of the denoising, which is optimum for Gaussian noise. Solving the corresponding clustering problem is not easy. We study some global and local sub-optimal solutions already presented in the literature and show that those that solve a better approximation of our problem result in better performances. A simple image denoising method based on dictionary learning using the idea of... 

    Spiking neuro-fuzzy clustering system and its memristor crossbar based implementation

    , Article Microelectronics Journal ; Vol. 45, issue. 11 , 2014 , pp. 1450-1462 ; ISSN: 00262692 Bavandpour, M ; Bagheri-Shouraki, S ; Soleimani, H ; Ahmadi, A ; Linares-Barranco, B ; Sharif University of Technology
    Abstract
    This study proposes a spiking neuro-fuzzy clustering system based on a novel spike encoding scheme and a compatible learning algorithm. In this system, we utilize an analog to binary encoding scheme that properly maps the concept of "distance" in multi-dimensional analog spaces to the concept of "dissimilarity " of binary bits in the equivalent binary spaces. When this scheme is combined with a novel binary to spike encoding scheme and a proper learning algorithm is applied, a powerful clustering algorithm is produced. This algorithm creates flexible fuzzy clusters in its analog input space and modifies their shapes to different convex shapes during the learning process. This system has... 

    Clustering and outlier detection using isoperimetric number of trees

    , Article Pattern Recognition ; Volume 46, Issue 12 , December , 2013 , Pages 3371-3382 ; 00313203 (ISSN) Daneshgar, A ; Javadi, R ; Shariat Razavi, S. B ; Sharif University of Technology
    2013
    Abstract
    We propose a graph-based data clustering algorithm which is based on exact clustering of a minimum spanning tree in terms of a minimum isoperimetry criteria. We show that our basic clustering algorithm runs in O(nlogn) and with post-processing in almost O(nlogn) (average case) and O(n2) (worst case) time where n is the size of the data-set. It is also shown that our generalized graph model, which also allows the use of potentials at vertices, can be used to extract an extra piece of information related to anomalous data patterns and outliers. In this regard, we propose an algorithm that extracts outliers in parallel to data clustering. We also provide a comparative performance analysis of... 

    Unsupervised induction of persian semantic verb classes based on syntactic information

    , Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), Warsaw ; Volume 7912 LNCS , June , 2013 , Pages 112-124 ; 03029743 (ISSN) ; 9783642386336 (ISBN) Aminian, M ; Rasooli, M. S ; Sameti, H ; Sharif University of Technology
    2013
    Abstract
    Automatic induction of semantic verb classes is one of the most challenging tasks in computational lexical semantics with a wide variety of applications in natural language processing. The large number of Persian speakers and the lack of such semantic classes for Persian verbs have motivated us to use unsupervised algorithms for Persian verb clustering. In this paper, we have done experiments on inducing the semantic classes of Persian verbs based on Levin's theory for verb classes. Syntactic information extracted from dependency trees is used as base features for clustering the verbs. Since there has been no manual classification of Persian verbs prior to this paper, we have prepared a... 

    Learning low-dimensional subspaces via sequential subspace fitting

    , Article 2013 21st Iranian Conference on Electrical Engineering, ICEE 2013 ; 2013 , 14 May-16 May 2 ; 9781467356343 (ISBN) Sadeghi, M ; Joneidi, M ; Golestani, H. B ; Sharif University of Technology
    2013
    Abstract
    In this paper we address the problem of learning low-dimensional subspaces using a given set of training data. To this aim, we propose an algorithm that performs by sequentially fitting a number of low-dimensional subspaces to the training data. Once we found a subset of the training data that is sufficiently near a fitted subspace, we omit these signals from the set of training signals and repeat the same procedure for the remaining signals until all training signals are assigned to a subspace. We then propose a robust version of the algorithm to address the situation in which the training signals are contaminated by additive white Gaussian noise (AWGN). Experimental results on both... 

    Probabilistic non-linear distance metric learning for constrained clustering

    , Article MultiClust 2013 - 4th Workshop on Multiple Clusterings, Multi-View Data, and Multi-Source Knowledge-Driven Clustering, in Conj. with the 19th ACM SIGKDD Int. Conf. on KDD 2013 ; 2013 ; 9781450323345 (ISBN) Babagholami Mohamadabadi, B ; Zarghami, A ; Pourhaghighi, H. A ; Manzuri Shalmani, M. T ; Sharif University of Technology
    2013
    Abstract
    Distance metric learning is a powerful approach to deal with the clustering problem with side information. For semi-supervised clustering, usually a set of pairwise similarity and dissimilarity constraints is provided as supervisory information. Although some of the existing methods can use both equivalence (similarity) and inequivalence (dissimilarity) constraints, they are usually limited to learning a global Mahalanobis metric (i.e., finding a linear transformation). Moreover, they find metrics only according to the data points appearing in constraints, and cannot utilize information of other data points. In this paper, we propose a probabilistic metric learning algorithm which uses... 

    GoSCAN: Decentralized scalable data clustering

    , Article Computing ; Volume 95, Issue 9 , 2013 , Pages 759-784 ; 0010485X (ISSN) Mashayekhi, H ; Habibi, J ; Voulgaris, S ; Van Steen, M ; Sharif University of Technology
    2013
    Abstract
    Identifying clusters is an important aspect of analyzing large datasets. Clustering algorithms classically require access to the complete dataset. However, as huge amounts of data are increasingly originating from multiple, dispersed sources in distributed systems, alternative solutions are required. Furthermore, data and network dynamicity in a distributed setting demand adaptable clustering solutions that offer accurate clustering models at a reasonable pace. In this paper, we propose GoScan, a fully decentralized density-based clustering algorithm which is capable of clustering dynamic and distributed datasets without requiring central control or message flooding. We identify two major... 

    An algorithm for discovering clusters of different densities or shapes in noisy data sets

    , Article Proceedings of the ACM Symposium on Applied Computing ; March , 2013 , Pages 144-149 ; 9781450316569 (ISBN) Khani, F ; Hosseini, M. J ; Abin, A. A ; Beigy, H ; Sharif University of Technology
    2013
    Abstract
    In clustering spatial data, we are given a set of points in Rn and the objective is to find the clusters (representing spatial objects) in the set of points. Finding clusters with different shapes, sizes, and densities in data with noise and potentially outliers is a challenging task. This problem is especially studied in machine learning community and has lots of applications. We present a novel clustering technique, which can solve mentioned issues considerably. In the proposed algorithm, we let the structure of the data set itself find the clusters, this is done by having points actively send and receive feedbacks to each other. The idea of the proposed method is to transform the input... 

    Using diagenetic processes in facies modeling of a carbonate reservoir

    , Article Energy Sources, Part A: Recovery, Utilization and Environmental Effects ; Volume 35, Issue 6 , Jan , 2013 , Pages 516-528 ; 15567036 (ISSN) Farzaneh, S. A ; Dehghan, A. A ; Lakzaie, A ; Sharif University of Technology
    2013
    Abstract
    The construction of a facies model could be employed as a conditional data for any property simulation that results in a more reliable reservoir characterization in further steps. In this study, an Iranian gas reservoir with six wells was studied to determine the 3D reservoir facies model. Fifteen reservoir facies were first detected along one of the wells with detailed core and thin section descriptions. Due to the significant difference between the core and log data resolution, facies were clustered into four major groups regarding the digenetic processes and petrophysical lithofacies properties (permeability and porosity). The lithofacies specification effect on petrophysical properties... 

    An analytical delumping methodology for PC-SAFT with application to reservoir fluids

    , Article Fluid Phase Equilibria ; Volume 339 , 2013 , Pages 40-51 ; 03783812 (ISSN) Assareh, M ; Ghotbi, C ; Pishvaie, M. R ; Mittermeir, G. M ; Sharif University of Technology
    2013
    Abstract
    The strong bases statistical associated fluid theory (SAFT) equations of state allow modeling for a wide range of scales and applications. The equilibrium calculations are very time-consuming in SAFT-based family of equations of state; therefore the number of components used in describing a fluid mixture must be reduced by grouping. On the other hand, in some applications it is required to retrieve the detailed fluid description from equilibrium calculation performed on the lumped fluid description. The purpose of this paper is to develop a systematic approach for lumping and delumping with equilibrium calculations using the Perturbed Chain (PC)-SAFT equation of state. The methodology... 

    Efficient stochastic algorithms for document clustering

    , Article Information Sciences ; Volume 220 , 2013 , Pages 269-291 ; 00200255 (ISSN) Forsati, R ; Mahdavi, M ; Shamsfard, M ; Meybodi, M. R ; Sharif University of Technology
    2013
    Abstract
    Clustering has become an increasingly important and highly complicated research area for targeting useful and relevant information in modern application domains such as the World Wide Web. Recent studies have shown that the most commonly used partitioning-based clustering algorithm, the K-means algorithm, is more suitable for large datasets. However, the K-means algorithm may generate a local optimal clustering. In this paper, we present novel document clustering algorithms based on the Harmony Search (HS) optimization method. By modeling clustering as an optimization problem, we first propose a pure HS based clustering algorithm that finds near-optimal clusters within a reasonable time.... 

    Visual tracking using D2-clustering and particle filter

    , Article 2012 IEEE International Symposium on Signal Processing and Information Technology, ISSPIT 2012 ; 2012 , Pages 230-235 ; 9781467356060 (ISBN) Raziperchikolaei, R ; Jamzad, M ; Sharif University of Technology
    2012
    Abstract
    Since tracking algorithms should be robust with respect to appearance changes, online algorithms has been investigated recently instead of offline ones which has shown an acceptable performance in controlled environments. The most challenging issue in online algorithms is updating of the model causing tracking failure because of introducing small errors in each update and disturbing the appearance model (drift). in this paper, we propose an online generative tracking algorithm in order to overcome the challenges such as occlusion, object shape changes, and illumination variations. In each frame, color distribution of target candidates is obtained and the candidate having the lowest distance... 

    DSCLU: A new data stream CLUstring algorithm for multi density environments

    , Article Proceedings - 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, SNPD 2012 ; 2012 , Pages 83-88 ; 9780769547619 (ISBN) Namadchian, A ; Esfandani, G ; Sharif University of Technology
    2012
    Abstract
    Recently, data stream has become popular in many contexts of data mining. Due to the high amount of incoming data, traditional clustering algorithms are not suitable for this family of problems. Many data stream clustering algorithms proposed in recent years considered the scalability of data, but most of them did not attend the following issues: (1) The quality of clustering can be dramatically low over the time. (2) Some of the algorithms cannot handle arbitrary shapes of data stream and consequently the results are limited to specific regions. (3) Most of the algorithms have not been evaluated in multi-density environments. Identifying appropriate clusters for data stream by handling the... 

    GDCLU: A new grid-density based clustring algorithm

    , Article Proceedings - 13th ACIS International Conference on Software Engineering, Artificial Intelligence, Networking, and Parallel/Distributed Computing, SNPD 2012, 8 August 2012 through 10 August 2012 ; August , 2012 , Pages 102-107 ; 9780769547619 (ISBN) Esfandani, G ; Sayyadi, M ; Namadchian, A ; Sharif University of Technology
    2012
    Abstract
    This paper addresses the density based clustering problem in data mining where clusters are established based on density of regions. The most well-known algorithm proposed in this area is DBSCAN [1] which employs two parameters influencing the shape of resulted clusters. Therefore, one of the major weaknesses of this algorithm is lack of ability to handle clusters in multi-density environments. In this paper, a new density based grid clustering algorithm, GDCLU, is proposed which uses a new definition for dense regions. It determines dense grids based on densities of their neighbors. This new definition enables GDCLU to handle different shaped clusters in multi-density environments. Also... 

    Probabilistic heuristics for hierarchical web data clustering

    , Article Computational Intelligence ; Volume 28, Issue 2 , 2012 , Pages 209-233 ; 08247935 (ISSN) Haghir Chehreghani, M ; Haghir Chehreghani, M ; Abolhassani, H ; Sharif University of Technology
    Abstract
    Clustering Web data is one important technique for extracting knowledge from the Web. In this paper, a novel method is presented to facilitate the clustering. The method determines the appropriate number of clusters and provides suitable representatives for each cluster by inference from a Bayesian network. Furthermore, by means of the Bayesian network, the contents of the Web pages are converted into vectors of lower dimensions. The method is also extended for hierarchical clustering, and a useful heuristic is developed to select a good hierarchy. The experimental results show that the clusters produced benefit from high quality  

    Developing a hybrid artificial intelligence model for outpatient visits forecasting in hospitals

    , Article Applied Soft Computing Journal ; Volume 12, Issue 2 , 2012 , Pages 700-711 ; 15684946 (ISSN) Hadavandi, E ; Shavandi, H ; Ghanbari, A ; Abbasian Naghneh, S ; Sharif University of Technology
    Abstract
    Accurate forecasting of outpatient visits aids in decision-making and planning for the future and is the foundation for greater and better utilization of resources and increased levels of outpatient care. It provides the ability to better manage the ways in which outpatient's needs and aspirations are planned and delivered. This study presents a hybrid artificial intelligence (AI) model to develop a Mamdani type fuzzy rule based system to forecast outpatient visits with high accuracy. The hybrid model uses genetic algorithm for evolving knowledge base of fuzzy system. Actually it extracts useful patterns of information with a descriptive rule induction approach based on Genetic Fuzzy Systems... 

    Automatic identification of overlapping/touching chromosomes in microscopic images using morphological operators

    , Article 2011 7th Iranian Conference on Machine Vision and Image Processing, MVIP 2011 - Proceedings, 16 November 2011 through 17 November 2011 ; November , 2011 , Page(s): 1 - 4 ; 9781457715358 (ISBN) Jahani, S ; Setarehdan, S. K ; Fatemizadeh, E ; Sharif University of Technology
    2011
    Abstract
    Karyotyping, is the process of classification of human chromosomes within the microscopic images. This is a common task for diagnosing many genetic disorders and abnormalities. Automatic Karyotyping algorithms usually suffer the poor quality of the images due to the non rigid nature of the chromosomes which makes them to have unpredictable shapes and sizes in various images. One of the main problems that usually need operator's interaction is the identification and separation of the overlapping/touching chromosomes. This paper presents an effective algorithm for identification of any cluster of the overlapping/touching chromosomes together with the number of chromosomes in the cluster, which...