Loading...
Search for: clustering-algorithms
0.009 seconds
Total 136 records

    Stratification of admixture population:A bayesian approach

    , Article 7th Iranian Joint Congress on Fuzzy and Intelligent Systems, CFIS 2019, 29 January 2019 through 31 January 2019 ; 2019 ; 9781728106731 (ISBN) Tamiji, M ; Taheri, S. M ; Motahari, S. A ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    A statistical algorithm is introduced to improve the false inference of active loci, in the population in which members are admixture. The algorithm uses an advanced clustering algorithm based on a Bayesian approach. The proposed algorithm simultaneously infers the hidden structure of the population. In this regard, the Monte Carlo Markov Chain (MCMC) algorithm has been used to evaluate the posterior probability distribution of the model parameters. The proposed algorithm is implemented in a bundle, and then its performance is widely evaluated in a number of artificial databases. The accuracy of the clustering algorithm is compared with the STRUCTURE method based on certain criterion. © 2019... 

    Partial discharges pattern recognition of transformer defect model by LBP & HOG features

    , Article IEEE Transactions on Power Delivery ; Volume 34, Issue 2 , 2019 , Pages 542-550 ; 08858977 (ISSN) Firuzi, K ; Vakilian, M ; Phung, B. T ; Blackburn, T. R ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    Partial discharge (PD) measurement and identification have great importance to condition monitoring of power transformers. In this paper, a new method for recognition of single and multi-source of PD based on extraction of high level image features has been introduced. A database, involving 365 samples of phase-resolved PD (PRPD) data, is developed by measurement carried out on transformer artificial defect models (having different sizes of defect) under a specific applied voltage, to be used for proposed algorithm validation. In the first step, each set of PRPD data is converted into grayscale images to represent different PD defects. Two 'image feature extraction' methods, the Local Binary... 

    Statistical association mapping of population-structured genetic data

    , Article IEEE/ACM Transactions on Computational Biology and Bioinformatics ; Volume 16, Issue 2 , 2019 , Pages 636-649 ; 15455963 (ISSN) Najafi, A ; Janghorbani, S ; Motahari, A ; Fatemizadeh, E ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    Association mapping of genetic diseases has attracted extensive research interest during the recent years. However, most of the methodologies introduced so far suffer from spurious inference of the associated sites due to population inhomogeneities. In this paper, we introduce a statistical framework to compensate for this shortcoming by equipping the current methodologies with a state-of-the-art clustering algorithm being widely used in population genetics applications. The proposed framework jointly infers the disease-associated factors and the hidden population structures. In this regard, a Markov Chain-Monte Carlo (MCMC) procedure has been employed to assess the posterior probability... 

    EATSDCD: A green energy-aware scheduling algorithm for parallel task-based application using clustering, duplication and DVFS technique in cloud datacenters

    , Article Journal of Intelligent and Fuzzy Systems ; Volume 36, Issue 6 , 2019 , Pages 5135-5152 ; 10641246 (ISSN) Barzegar, B ; Motameni, H ; Movaghar, A ; Sharif University of Technology
    IOS Press  2019
    Abstract
    Energy consumption and performance metrics have become critical issues for scheduling parallel task-based applications in high-performance computing systems such as cloud datacenters. The duplication and clustering strategy, as well as Dynamic Voltage Frequency Scaling (DVFS) technique, have separately been concentrated on reducing energy consumption and optimizing performance parameters such as throughput and makespan. In this paper, a dual-phase algorithm called EATSDCD which is an energy efficient time aware has been proposed. The algorithm uses the combination of duplication and clustering strategies to schedule the precedence-constrained task graph on datacenter processors through DVFS.... 

    EATSDCD: A green energy-aware scheduling algorithm for parallel task-based application using clustering, duplication and DVFS technique in cloud datacenters

    , Article Journal of Intelligent and Fuzzy Systems ; Volume 36, Issue 6 , 2019 , Pages 5135-5152 ; 10641246 (ISSN) Barzegar, B ; Motameni, H ; Movaghar, A ; Sharif University of Technology
    IOS Press  2019
    Abstract
    Energy consumption and performance metrics have become critical issues for scheduling parallel task-based applications in high-performance computing systems such as cloud datacenters. The duplication and clustering strategy, as well as Dynamic Voltage Frequency Scaling (DVFS) technique, have separately been concentrated on reducing energy consumption and optimizing performance parameters such as throughput and makespan. In this paper, a dual-phase algorithm called EATSDCD which is an energy efficient time aware has been proposed. The algorithm uses the combination of duplication and clustering strategies to schedule the precedence-constrained task graph on datacenter processors through DVFS.... 

    Learning a metric when clustering data points in the presence of constraints

    , Article Advances in Data Analysis and Classification ; Volume 14, Issue 1 , 2020 , Pages 29-56 Abin, A. A ; Bashiri, M. A ; Beigy, H ; Sharif University of Technology
    Springer  2020
    Abstract
    Learning an appropriate distance measure under supervision of side information has become a topic of significant interest within machine learning community. In this paper, we address the problem of metric learning for constrained clustering by considering three important issues: (1) considering importance degree for constraints, (2) preserving the topological structure of data, and (3) preserving some natural distribution properties in the data. This work provides a unified way to handle different issues in constrained clustering by learning an appropriate distance measure. It has modeled the first issue by injecting the importance degree of constraints directly into an objective function.... 

    A new approach for sensitivity analysis in network flow problems

    , Article International Journal of Industrial Engineering : Theory Applications and Practice ; Volume 27, Issue 1 , 2020 , Pages 72-87 Aini, A ; Eshghi, K ; Salehipour, A ; Sharif University of Technology
    University of Cincinnati  2020
    Abstract
    This paper proposes a new approach to study the sensitivity analysis in the network flow problems, in particular, the minimum spanning tree and shortest path problems. In a sensitivity analysis, one looks for the amount of changes in the edges’ weights, number of edges or number of vertices such that the optimal solution, i.e., the minimum spanning tree or shortest path does not change. We introduce a novel approach, and develop associated equations and mathematics. We discuss two illustrative examples to show the applicability of the proposed approach. © International Journal of Industrial Engineering  

    Complexity of computing the anti-ramsey numbers for paths

    , Article 45th International Symposium on Mathematical Foundations of Computer Science, MFCS 2020, 25 August 2020 through 26 August 2020 ; Volume 170 , 2020 Amiri, S. A ; Popa, A ; Roghani, M ; Shahkarami, G ; Soltani, R ; Vahidi, H ; Sharif University of Technology
    Schloss Dagstuhl- Leibniz-Zentrum fur Informatik GmbH, Dagstuhl Publishing  2020
    Abstract
    The anti-Ramsey numbers are a fundamental notion in graph theory, introduced in 1978, by Erdös, Simonovits and Sós. For given graphs G and H the anti-Ramsey number ar(G, H) is defined to be the maximum number k such that there exists an assignment of k colors to the edges of G in which every copy of H in G has at least two edges with the same color. Usually, combinatorists study extremal values of anti-Ramsey numbers for various classes of graphs. There are works on the computational complexity of the problem when H is a star. Along this line of research, we study the complexity of computing the anti-Ramsey number ar(G, Pk), where Pk is a path of length k. First, we observe that when k is... 

    Predicting scientific research trends based on link prediction in keyword networks

    , Article Journal of Informetrics ; Volume 14, Issue 4 , 2020 Behrouzi, S ; Shafaeipour Sarmoor, Z ; Hajsadeghi, K ; Kavousi, K ; Sharif University of Technology
    Elsevier Ltd  2020
    Abstract
    The rapid development of scientific fields in this modern era has raised the concern for prospective scholars to find a proper research field to conduct their future studies. Thus, having a vision of future could be helpful to pick the right path for doing research and ensuring that it is worth investing in. In this study, we use article keywords of computer science journals and conferences, assigned by INSPEC controlled indexing, to construct a temporal scientific knowledge network. By observing keyword networks snapshots over time, we can utilize the link prediction methods to foresee the future structures of these networks. We use two different approaches for this link prediction problem.... 

    A three-dimensional statistical volume element for histology informed micromechanical modeling of brain white matter

    , Article Annals of Biomedical Engineering ; Volume 48, Issue 4 , 2020 , Pages 1337-1353 Hoursan, H ; Farahmand, F ; Ahmadian, M. T ; Sharif University of Technology
    Springer  2020
    Abstract
    This study presents a novel statistical volume element (SVE) for micromechanical modeling of the white matter structures, with histology-informed randomized distribution of axonal tracts within the extracellular matrix. The model was constructed based on the probability distribution functions obtained from the results of diffusion tensor imaging as well as the histological observations of scanning electron micrograph, at two structures of white matter susceptible to traumatic brain injury, i.e. corpus callosum and corona radiata. A simplistic representative volume element (RVE) with symmetrical arrangement of fully alligned axonal fibers was also created as a reference for comparison. A... 

    High-dimensional sparse recovery using modified generalised SL0 and its application in 3D ISAR imaging

    , Article IET Radar, Sonar and Navigation ; Volume 14, Issue 8 , 6 July , 2020 , Pages 1267-1278 Nazari, M ; Mehrpooya, A ; Bastani, M. H ; Nayebi, M ; Abbasi, Z ; Sharif University of Technology
    Institution of Engineering and Technology  2020
    Abstract
    Sparse representation can be extended to high dimensions and can be used in many applications, including three-dimensional (3D) Inverse synthetic aperture radar (ISAR) imaging. In this study, the high-dimensional sparse representation problem and a recovery method called high-dimensional smoothed least zero-norm (HDSL0) are formulated. In this method, the theory and computation of tensors and approximating L0 norm using Gaussian functions are used for sparse recovery of high-dimensional data. To enhance the performance of HDSL0, modified regularised high-dimensional SL0 (MRe-HDSL0) algorithm, which benefits from the regularised form of SL0 and an additional hard thresholding step, is... 

    A clustering fuzzification algorithm based on ALM

    , Article Fuzzy Sets and Systems ; Volume 389 , 2020 , Pages 93-113 Javadian, M ; Malekzadeh, A ; Heydari, G ; Bagheri Shouraki, S ; Sharif University of Technology
    Elsevier B.V  2020
    Abstract
    In this paper, we propose a fuzzification method for clusters produced from a clustering process, based on Active Learning Method (ALM). ALM is a soft computing methodology which is based on a hypothesis claiming that human brain interprets information in pattern-like images. The proposed fuzzification method is applicable to all non-fuzzy clustering algorithms as a post process. The most outstanding advantage of this method is the ability to determine the membership degrees of each data to all clusters based on the density and shape of the clusters. It is worth mentioning that for existing fuzzy clustering algorithms such as FCM the membership degree is usually determined as a function of... 

    Density peaks clustering based on density backbone and fuzzy neighborhood

    , Article Pattern Recognition ; Volume 107 , November , 2020 Lotfi, A ; Moradi, P ; Beigy, H ; Sharif University of Technology
    Elsevier Ltd  2020
    Abstract
    Density peaks clustering (DPC) is as an efficient clustering algorithm due for using a non-iterative process. However, DPC and most of its improvements suffer from the following shortcomings: (1) highly sensitive to its cutoff distance parameter, (2) ignoring the local structure of data in computing local densities, (3) using a crisp kernel to calculate local densities, and (4) suffering from the cause of chain reaction. To address these issues, in this paper a new method called DPC-DBFN is proposed. The proposed method uses a fuzzy kernel for improving separability of clusters and reducing the impact of outliers. DPC-DBFN uses a density-based kNN graph for labeling backbones. This strategy... 

    Improved algorithms for distributed balanced clustering

    , Article 3rd IFIP WG 1.8 International Conference on Topics in Theoretical Computer Science, TTCS 2020, 1 July 2020 through 2 July 2020 ; Volume 12281 LNCS , 2020 , Pages 72-84 Mirjalali, K ; Zarrabizadeh, H ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2020
    Abstract
    We study a weighted balanced version of the k-center problem, where each center has a fixed capacity, and each element has an arbitrary demand. The objective is to assign demands of the elements to the centers, so as the total demand assigned to each center does not exceed its capacity, while the maximum distance between centers and their assigned elements is minimized. We present a deterministic O(1)-approximation algorithm for this generalized version of the k-center problem in the distributed setting, where data is partitioned among a number of machines. Our algorithm substantially improves the approximation factor of the current best randomized algorithm available for the problem. We... 

    Joint, partially-joint, and individual independent component analysis in multi-subject fMRI data

    , Article IEEE Transactions on Biomedical Engineering ; Volume 67, Issue 7 , 2020 , Pages 1969-1981 Pakravan, M ; Shamsollahi, M. B ; Sharif University of Technology
    IEEE Computer Society  2020
    Abstract
    Objective: Joint analysis of multi-subject brain imaging datasets has wide applications in biomedical engineering. In these datasets, some sources belong to all subjects (joint), a subset of subjects (partially-joint), or a single subject (individual). In this paper, this source model is referred to as joint/partially-joint/individual multiple datasets unidimensional (JpJI-MDU), and accordingly, a source extraction method is developed. Method: We present a deflation-based algorithm utilizing higher order cumulants to analyze the JpJI-MDU source model. The algorithm maximizes a cost function which leads to an eigenvalue problem solved with thin-SVD (singular value decomposition)... 

    Notice of violation of IEEE publication principles: a new automatic clustering algorithm via deadline timer for wireless ad-hoc sensor networks

    , Article 2008 3rd International Conference on Information and Communication Technologies: From Theory to Applications, ICTTA, 7 April 2008 through 11 April 2008, Damascus ; 2008 ; 9781424417513 (ISBN) Bazregar, A ; Movaghar, A ; Barati, A ; Eslami Nejhad, M. R ; Barati, H ; Sharif University of Technology
    2008
    Abstract
    This paper proposes a decentralized algorithm to organize an ad-hoc sensor network into clusters. Each sensor operates independently, monitoring communication among others. Those sensors which have many neighbors that are not already part of a cluster are likely candidates to creating a new cluster by declaring themselves to be a new cluster-head. The Clustering Algorithm via Deadline Timer (CADT) provides a protocol whereby this can be achieved and the process continues until all sensors are part of a cluster. Because of the difficulty of analyses simplified models are used to study and abstract its performance. A simple formula to estimate the number of clusters which will be formed in an... 

    A fuzzy clustering algorithm for finding arbitrary shaped clusters

    , Article 6th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2008, Doha, 31 March 2008 through 4 April 2008 ; 2008 , Pages 559-566 ; 9781424419685 (ISBN) Soleymani Baghshah, M ; Bagheri Shouraki, S ; Sharif University of Technology
    2008
    Abstract
    Until now, many algorithms have been introduced for finding arbitrary shaped clusters, but none of these algorithms is able to identify all sorts of cluster shapes and structures that are encountered in practice. Furthermore, the time complexity of the existing algorithms is usually high and applying them on large dataseis is time-consuming. In this paper, a novel fast clustering algorithm is proposed. This algorithm distinguishes clusters of different shapes using a twostage clustering approach. In the first stage, the data points are grouped into a relatively large number of fuzzy ellipsoidal sub-clusters. Then, connections between sub-clusters are established according to the Bhatiacharya... 

    An agent-based clustering algorithm using potential fields

    , Article 6th IEEE/ACS International Conference on Computer Systems and Applications, AICCSA 2008, Doha, 31 March 2008 through 4 April 2008 ; 2008 , Pages 551-558 ; 9781424419685 (ISBN) Soleymani Baghshah, M ; Bagheri Shouraki, S ; Lucas, C ; Sharif University of Technology
    2008
    Abstract
    In this paper, a novel clustering algorithm using an agent-based architecture along with a force-based clustering algorithm is proposed. To this end, a set of simple mobile agents thai have limited processing power is used. These agents communicate in a pairwise manner to exchange their position information. As opposed to the bio-inspired clustering algorithms that need a set of local rules to specify the agent movements, in this paper the agent motions are driven from attractive and repulsive potential fields that are created by the data points and the other agents respectively. Each agent moves according to the resulted force from applying the potential fields and announces its next... 

    Adaptive neuro-fuzzy inference system for classification of ACL-ruptured knees using arthrometric data

    , Article Annals of Biomedical Engineering ; Volume 36, Issue 9 , 9 July , 2008 , Pages 1449-1457 ; 00906964 (ISSN) Heydari, Z ; Farahmand, F ; Arabalibeik, H ; Parnianpour, M ; Sharif University of Technology
    2008
    Abstract
    A new approach, based on Adaptive-Network-based Fuzzy Inference System (ANFIS), is presented for the classification of arthrometric data of normal/ACL-ruptured knees, considering the insufficiency of existing criteria. An ANFIS classifier was developed and tested on a total of 4800 arthrometric data points collected from 40 normal and 40 injured subjects. The system consisted of 5 layers and 8 rules, based on the results of subtractive data clustering, and trained using the hybrid algorithm method. The performance of the system was evaluated in four runs, in the framework of a 4-fold cross validation algorithm. The results indicated a definite correct diagnosis for typical injured and normal... 

    Web page clustering using harmony search optimization

    , Article IEEE Canadian Conference on Electrical and Computer Engineering, CCECE 2008, Niagara Falls, ON, 4 May 2008 through 7 May 2008 ; 2008 , Pages 1601-1604 ; 08407789 (ISSN) ; 9781424416431 (ISBN) Forsati, R ; Mahdavi, M ; Kangavari, M ; Safarkhani, B ; Sharif University of Technology
    2008
    Abstract
    Clustering has become an increasingly important task in modern application domains. Targeting useful and relevant information on the World Wide Web is a topical and highly complicated research area. Clustering techniques have been applied to categorize documents on web and extracting knowledge from the web. In this paper we propose novel clustering algorithms based on Harmony Search (HS) optimization method that deals with web document clustering. By modeling clustering as an optimization problem, first, we propose a pure HS based clustering algorithm that finds near global optimal clusters within a reasonable time. Then we hybridize K-means and harmony clustering to achieve better...