Loading...
Search for: classification
0.009 seconds
Total 562 records

    Detection of evolving concepts in non-stationary data streams: A multiple kernel learning approach

    , Article Expert Systems with Applications ; Volume 91 , 2018 , Pages 187-197 ; 09574174 (ISSN) Kamali Siahroudi, S ; Zare Moodi, P ; Beigy, H ; Sharif University of Technology
    Elsevier Ltd  2018
    Abstract
    Due to the unprecedented speed and volume of generated raw data in most of applications, data stream mining has attracted a lot of attention recently. Methods for solving these problems should address challenges in this area such as infinite length, concept-drift, recurring concepts, and concept-evolution. Moreover, due to the speedy intrinsic of data streams, the time and space complexity of the methods are extremely important. This paper proposes a novel method based on multiple-kernels for classifying non-stationary data streams, which addresses the mentioned challenges with special attention to the space complexity. By learning multiple kernels and specifying the boundaries of classes in... 

    MGCN: Semi-supervised classification in multi-layer graphs with graph convolutional networks

    , Article 11th IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining, ASONAM 2019, 27 August 2019 through 30 August 2019 ; 2019 , Pages 208-211 ; 9781450368681 (ISBN) Ghorbani, M ; Baghshah, M. S ; Rabiee, H. R ; Sharif University of Technology
    Association for Computing Machinery, Inc  2019
    Abstract
    Graph embedding is an important approach for graph analysis tasks such as node classification and link prediction. The goal of graph embedding is to find a low dimensional representation of graph nodes that preserves the graph information. Recent methods like Graph Convolutional Network (GCN) try to consider node attributes (if available) besides node relations and learn node embeddings for unsupervised and semi-supervised tasks on graphs. On the other hand, multi-layer graph analysis has been received attention recently. However, the existing methods for multi-layer graph embedding cannot incorporate all available information (like node attributes). Moreover, most of them consider either... 

    Binary classification of imbalanced datasets: The case of CoIL challenge 2000

    , Article Expert Systems with Applications ; Volume 128 , 2019 , Pages 169-186 ; 09574174 (ISSN) Khalilpour Darzi, M. R ; Akhavan Niaki, S. T ; Khedmati, M ; Sharif University of Technology
    Elsevier Ltd  2019
    Abstract
    This paper presents some approaches based on data mining techniques to solve the prediction task of Computational Intelligence and Learning (CoIL) Challenge 2000. The prediction task of the contest is a direct mailing problem and the goal is to improve its response rate. The main issue in this competition is the incompatibility of the dataset in which the distribution of the classes of the target attribute is highly unbalanced. This in turn causes high error rate in identifying the minority class samples. Three different level methods including data-level, algorithm-level, and hybrid method are used to overcome this issue. The specificity, sensitivity, precision-recall, and ROC criteria are... 

    An Efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2019
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    Scale equivariant CNNs with scale steerable filters

    , Article Iranian Conference on Machine Vision and Image Processing, MVIP, 19 February 2020 through 20 February 2020 ; Volume 2020-February , 2020 ; ISSN: 21666776 ; ISBN: 9781728168326 Naderi, H ; Goli, L ; Kasaei, S ; Sharif University of Technology
    IEEE Computer Society  2020
    Abstract
    Convolution Neural Networks (CNNs), despite being one of the most successful image classification methods, are not robust to most geometric transformations (rotation, isotropic scaling) because of their structural constraints. Recently, scale steerable filters have been proposed to allow scale invariance in CNNs. Although these filters enhance the network performance in scaled image classification tasks, they cannot maintain the scale information across the network. In this paper, this problem is addressed. First, a CNN is built with the usage of scale steerable filters. Then, a scale equivariat network is acquired by adding a feature map to each layer so that the scale-related features are... 

    A distributed density estimation algorithm and its application to naive Bayes classification

    , Article Applied Soft Computing ; Volume 98 , 2021 ; 15684946 (ISSN) Khajenezhad, A ; Bashiri, M. A ; Beigy, H ; Sharif University of Technology
    Elsevier Ltd  2021
    Abstract
    We consider the problem of learning a density function from observations of an unknown underlying model in a distributed setting, where the observations are partitioned into different sites. Applying commonly used density estimation methods such as Gaussian Mixture Model (GMM) or Kernel Density Estimation (KDE) to distributed data leads to an extensive amount of communication. A familiar approach to address this issue is to sample a small subset of data and collect them into a central node to run the density estimation algorithms on them. In this paper, we follow an alternative to the sub-sampling approach by proposing the nested Log-Poly model. This model provides an accurate density... 

    G2D: Generate to detect anomaly

    , Article 2021 IEEE Winter Conference on Applications of Computer Vision, WACV 2021, 5 January 2021 through 9 January 2021 ; 2021 , Pages 2002-2011 ; 9780738142661 (ISBN) Pourreza, M ; Mohammadi, B ; Khaki, M ; Bouindour, S ; Snoussi, H ; Sabokrou, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2021
    Abstract
    In this paper, we propose a novel method for irregularity detection. Previous researches solve this problem as a One-Class Classification (OCC) task where they train a reference model on all of the available samples. Then, they consider a test sample as an anomaly if it has a diversion from the reference model. Generative Adversarial Networks (GANs) have achieved the most promising results for OCC while implementing and training such networks, especially for the OCC task, is a cumbersome and computationally expensive procedure. To cope with the mentioned challenges, we present a simple but effective method to solve the irregularity detection as a binary classification task in order to make... 

    Multi-cass Semi-srvised Classification of Data Streams

    , M.Sc. Thesis Sharif University of Technology Sepehr, Arman (Author) ; Beigy, Hamid (Supervisor)
    Abstract
    Recent advances in storage and processing have provided the ability of automatic gathering of information which in turn leads to fast and contineous flow of data. The data which are produced and stored in this way are named data streams. It has many applications such as processing financial transactions, the recorded data of various sensors or the collected data by web sevices. Data streams are produced with high speed, large size and much dynamism and have some unique properties which make them applicable in precise modeling of many real data mining applications. The main challenge of data streams is the occurrence of concept drift which can be in four types: sudden, gradual, incremental or... 

    Seizure Detection in Generalized and Focal Seizure from EEG Signals

    , M.Sc. Thesis Sharif University of Technology Mozafari, Mohsen (Author) ; Hajipour, Sepideh (Supervisor)
    Abstract
    Epilepsy is one of the diseases that affects the quality of life of epileptic patients. Epileptic patients lose control during epileptic seizures and are more likely to face problems. Designing and creating a seizure detection system can reduce casualties from epileptic attacks. In this study, we present an automatic method that reduces the artifact from the raw signals, and then classifies the seizure and non-seizure epochs. At all stages, it is assumed that no information is available about the patient and this detection is made only based on the information of other patients. The data from this study were recorded in Temple Hospital and the recording conditions were not controlled, so... 

    Design and Efficient Implementation of Deep Learning Algorithm for ECG Classification

    , M.Sc. Thesis Sharif University of Technology Oveisi, Mohammad Hossein (Author) ; Hashemi, Matin (Supervisor)
    Abstract
    Cardiovascular diseases are the leading cause of death globally so early diagnosis of them is important. Many researchers focused on this field. First signs of cardiac diseases appear in the electrocardiogram signal. This signal represents the electrical activity of the heart so it’s primarily used for the detection and classification of cardiac arrhythmias. Permanent monitoring of this signal is not possible for specialists so we should do this by means of Artificial Intelligence. In this thesis, we use recurrent neural networks to classify electrocardiogram’s arrhythmias. This deep learning method, use two sources of data to learn from. The first part of data is global for everyone and the... 

    From local similarity to global coding: An application to image classification

    , Article Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Portland, OR ; 2013 , Pages 2794-2801 ; 10636919 (ISSN) Shaban, A ; Rabiee, H. R ; Farajtabar, M ; Ghazvininejad, M ; Sharif University of Technology
    2013
    Abstract
    Bag of words models for feature extraction have demonstrated top-notch performance in image classification. These representations are usually accompanied by a coding method. Recently, methods that code a descriptor giving regard to its nearby bases have proved efficacious. These methods take into account the nonlinear structure of descriptors, since local similarities are a good approximation of global similarities. However, they confine their usage of the global similarities to nearby bases. In this paper, we propose a coding scheme that brings into focus the manifold structure of descriptors, and devise a method to compute the global similarities of descriptors to the bases. Given a local... 

    ECG based human identification using wavelet distance measurement

    , Article Proceedings - 2011 4th International Conference on Biomedical Engineering and Informatics, BMEI 2011, 15 October 2011 through 17 October 2011 ; Volume 2 , October , 2011 , Pages 717-720 ; 9781424493524 (ISBN) Naraghi, M. E ; Shamsollahi, M. B ; Sharif University of Technology
    2011
    Abstract
    In this Paper a new approach is proposed for electrocardiogram (ECG) based human identification using wavelet distance measurement. The main advantage of this method is that it guarantees high accuracy even in abnormal cases. Furthermore, it possesses low sensitivity to noise. The algorithm was applied on 11 normal subjects and 10 abnormal subjects of MIT-BIH Database using single lead data, and a 100% human identification rate was on both normal and abnormal subjects. Adding artificial white noise to signals shows that the method is nearly accurate in SNR level above 5dB in normal subjects and 20dB in abnormal subjects  

    Structured features in naive bayes classification

    , Article 30th AAAI Conference on Artificial Intelligence, AAAI 2016, 12 February 2016 through 17 February 2016 ; 2016 , Pages 3233-3240 ; 9781577357605 (ISBN) Choi, A ; Tavabi, N ; Darwiche, A ; Artificial Intelligence; Baidu; et al.; IBM; Infosys; NSF ; Sharif University of Technology
    AAAI press  2016
    Abstract
    We propose the structured naive Bayes (SNB) classifier, which augments the ubiquitous naive Bayes classifier with structured features. SNB classifiers facilitate the use of complex features, such as combinatorial objects (e.g., graphs, paths and orders) in a general but systematic way. Underlying the SNB classifier is the recently proposed Probabilistic Sentential Decision Diagram (PSDD), which is a tractable representation of probability distributions over structured spaces. We illustrate the utility and generality of the SNB classifier via case studies. First, we show how we can distinguish players of simple games in terms of play style and skill level based purely on observing the games... 

    Incremental evolving domain adaptation

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 28, Issue 8 , 2016 , Pages 2128-2141 ; 10414347 (ISSN) Bitarafan, A ; Soleymani Baghshah, M ; Gheisari, M ; Sharif University of Technology
    IEEE Computer Society 
    Abstract
    Almost all of the existing domain adaptation methods assume that all test data belong to a single stationary target distribution. However, in many real world applications, data arrive sequentially and the data distribution is continuously evolving. In this paper, we tackle the problem of adaptation to a continuously evolving target domain that has been recently introduced. We assume that the available data for the source domain are labeled but the examples of the target domain can be unlabeled and arrive sequentially. Moreover, the distribution of the target domain can evolve continuously over time. We propose the Evolving Domain Adaptation (EDA) method that first finds a new feature space... 

    Equimatchable regular graphs

    , Article Journal of Graph Theory ; Volume 87, Issue 1 , 2018 , Pages 35-45 ; 03649024 (ISSN) Akbari, S ; Ghodrati, A. H ; Hosseinzadeh, M. A ; Iranmanesh, A ; Sharif University of Technology
    Wiley-Liss Inc  2018
    Abstract
    A graph is called equimatchable if all of its maximal matchings have the same size. Kawarabayashi, Plummer, and Saito showed that the only connected equimatchable 3-regular graphs are K4 and K3, 3. We extend this result by showing that for an odd positive integer r, if G is a connected equimatchable r-regular graph, then G ϵ {Kr+1, Kr,r}. Also it is proved that for an even r, a connected triangle-free equimatchable r-regular graph is isomorphic to one of the graphs C5, C7, and Kr,r. © 2017 Wiley Periodicals, Inc  

    Configurable ultrasonic flaw classification of oil pipelines

    , Article Nondestructive Testing and Evaluation ; Volume 23, Issue 2 , 2008 , Pages 77-88 ; 10589759 (ISSN) Ravanbod, H ; Jalali, A ; Sharif University of Technology
    2008
    Abstract
    These two papers present an innovative method of configurable flaw classification and volume estimation in oil pipelines. In part I, the ultrasonic image acquisition system is introduced and surface and volume of the flaw are estimated with fuzzy image processing. A number of real figures illustrate the system performance. The flops calculation reveals that this fuzzy estimator could be integrated in a real time flaw detection system. In part II, at first, the dynamic detection of interesting points, i.e. as feature points at different levels of images, is proposed using wavelet transform. Furthermore, a guided searching strategy is used for the best matching from the coarse level to a fine... 

    A novel ensemble strategy for classification of prostate cancer protein mass spectra

    , Article 29th Annual International Conference of IEEE-EMBS, Engineering in Medicine and Biology Society, EMBC'07, Lyon, 23 August 2007 through 26 August 2007 ; 2007 , Pages 5987-5990 ; 05891019 (ISSN) ; 1424407885 (ISBN); 9781424407880 (ISBN) Assareh, A ; Moradi, M. H ; Esmaeili, V ; Sharif University of Technology
    2007
    Abstract
    Protein mass spectra pattern recognition is a new forum in which many machine learning algorithms have been conducted to enhance the chance of early cancer diagnosis. The high-dimensionality-small-sample (HDSS) problem of cancer proteomic datasets still requires more sophisticated approaches to improve the classification accuracy. In this study we present a simple ensemble strategy based on measuring the generalizing capability of different subsets of training data and apply it in making final decision. Using a limited number of biomarkers along with 5 classification algorithms, the proposed method achieved a promising performance over a well-known prostate cancer mass spectroscopy dataset.... 

    Topic recommendation for software repositories using multi-label classification algorithms

    , Article Empirical Software Engineering ; Volume 26, Issue 5 , 2021 ; 13823256 (ISSN) Izadi, M ; Heydarnoori, A ; Gousios, G ; Sharif University of Technology
    Springer  2021
    Abstract
    Many platforms exploit collaborative tagging to provide their users with faster and more accurate results while searching or navigating. Tags can communicate different concepts such as the main features, technologies, functionality, and the goal of a software repository. Recently, GitHub has enabled users to annotate repositories with topic tags. It has also provided a set of featured topics, and their possible aliases, carefully curated with the help of the community. This creates the opportunity to use this initial seed of topics to automatically annotate all remaining repositories, by training models that recommend high-quality topic tags to developers. In this work, we study the... 

    Optimal feature selection for SAR image classification using biogeography-based optimization (BBO), artificial bee colony (ABC) and support vector machine (SVM): a combined approach of optimization and machine learning

    , Article Computational Geosciences ; Volume 25, Issue 3 , 2021 , Pages 911-930 ; 14200597 (ISSN) Rostami, O ; Kaveh, M ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2021
    Abstract
    Land cover classification is one of the most important applications of POLSAR images. In this paper, a hybrid biogeography-based optimization support vector machine (HBBOSVM) has been introduced to classify POLSAR images of RADARSAT 2 in band C acquired from San Francisco, USA. The main purpose of this classification is to minimize the number of features and maximize classification accuracy. The proposed method consists of three main steps: preprocessing, feature selection and classification. As preprocessing, radiometric calibration, speckle reduction and feature extraction have been performed. In the proposed HBBO, the combination of onlooker bee of artificial bee colony (ABC) and... 

    Semi-supervised Learning and its Application to Image Categorization

    , M.Sc. Thesis Sharif University of Technology Farajtabar, Mehrdad (Author) ; Rabiee, Hamid Reza (Supervisor)
    Abstract
    Traditional methods for data classification only make use of the labeled data. However, in most of the applications, labeling the unlabeled data is expensive, time consuming and requires expert knowledge. To overcome these problems, Semi-supervised Learning (SSL) methods have become an area of recent research that aim to effectively addressing the problem of limited labeled data.One of the recently introduced SSL methods is the classification based on geometric structure of the data, namely the data manifold. In this approach unlabeled data is utilized to recover the underlying structure of the data. The common assumption is that despite of being represented in a high dimensional space, data...