Loading...
Search for: soleymani-baghshah--m
0.012 seconds

    Multi-label classification with feature-aware implicit encoding and generalized cross-entropy loss

    , Article 24th Iranian Conference on Electrical Engineering, 10 May 2016 through 12 May 2016 ; 2016 , Pages 1574-1579 ; 9781467387897 (ISBN) Farahnak Ghazani, F ; Soleymani Baghshah, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc 
    Abstract
    In multi-label classification problems, each instance can simultaneously have multiple labels. Since the whole number of available labels in real-world applications tends to be (very) large, multi-label classification becomes an important challenge and recently label space dimension reduction (LSDR) methods have received attention. These methods first encode the output space to a low-dimensional latent space. Afterwards, they predict the latent space from the feature space and reconstruct the original output space using a suitable decoding method. The encoding method can be implicit which learns a code matrix in the latent space by solving an optimization problem or explicit which learns a... 

    A probabilistic multi-label classifier with missing and noisy labels handling capability

    , Article Pattern Recognition Letters ; Volume 89 , 2017 , Pages 18-24 ; 01678655 (ISSN) Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    Elsevier B.V  2017
    Abstract
    Multi-label classification with a large set of labels is a challenging task. Label-Space Dimension Reduction (LSDR) is the most popular approach that addresses this problem. LSDR methods project the high-dimensional label vectors onto a low-dimensional space that can be predicted from the feature space. Many LSDR methods assume that the training data provide complete label vector for all training samples while this assumption is usually violated particularly when label vectors are high dimensional. In this paper, we propose a probabilistic model that has an effective mechanism to handle missing and noisy labels. In the proposed Bayesian network model, a set of auxiliary random variables,... 

    Joint predictive model and representation learning for visual domain adaptation

    , Article Engineering Applications of Artificial Intelligence ; Volume 58 , 2017 , Pages 157-170 ; 09521976 (ISSN) Gheisari, M ; Soleymani Baghshah, M ; Sharif University of Technology
    Elsevier Ltd  2017
    Abstract
    Traditional learning algorithms cannot perform well in scenarios where training data (source domain data) that are used to learn the model have a different distribution with test data (target domain data). The domain adaptation that intends to compensate this problem is an important capability for an intelligent agent. This paper presents a domain adaptation method which learns to adapt the data distribution of the source domain to that of the target domain where no labeled data of the target domain is available (and just unlabeled data are available for the target domain). Our method jointly learns a low dimensional representation space and an adaptive classifier. In fact, we try to find a... 

    An efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; 2018 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2018
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    An Efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2019
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    Active distance-based clustering using k-medoids

    , Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 19 April 2016 through 22 April 2016 ; Volume 9651 , 2016 , Pages 253-264 ; 03029743 (ISSN) ; 9783319317526 (ISBN) Aghaee, A ; Ghadiri, M ; Soleymani Baghshah, M ; Sharif University of Technology
    Springer Verlag  2016
    Abstract
    k-medoids algorithm is a partitional, centroid-based clustering algorithm which uses pairwise distances of data points and tries to directly decompose the dataset with n points into a set of k disjoint clusters. However, k-medoids itself requires all distances between data points that are not so easy to get in many applications. In this paper, we introduce a new method which requires only a small proportion of the whole set of distances and makes an effort to estimate an upperbound for unknown distances using the inquired ones. This algorithm makes use of the triangle inequality to calculate an upper-bound estimation of the unknown distances. Our method is built upon a recursive approach to... 

    Incremental evolving domain adaptation

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 28, Issue 8 , 2016 , Pages 2128-2141 ; 10414347 (ISSN) Bitarafan, A ; Soleymani Baghshah, M ; Gheisari, M ; Sharif University of Technology
    IEEE Computer Society 
    Abstract
    Almost all of the existing domain adaptation methods assume that all test data belong to a single stationary target distribution. However, in many real world applications, data arrive sequentially and the data distribution is continuously evolving. In this paper, we tackle the problem of adaptation to a continuously evolving target domain that has been recently introduced. We assume that the available data for the source domain are labeled but the examples of the target domain can be unlabeled and arrive sequentially. Moreover, the distribution of the target domain can evolve continuously over time. We propose the Evolving Domain Adaptation (EDA) method that first finds a new feature space... 

    An attribute learning method for zero-shot recognition

    , Article 2017 25th Iranian Conference on Electrical Engineering, ICEE 2017, 2 May 2017 through 4 May 2017 ; 2017 , Pages 2235-2240 ; 9781509059638 (ISBN) Yazdanian, R ; Shojaee, S. M ; Soleymani Baghshah, M ; Sharif University of Technology
    Abstract
    Recently, the problem of integrating side information about classes has emerged in the learning settings like zero-shot learning. Although using multiple sources of information about the input space has been investigated in the last decade and many multi-view and multi-modal learning methods have already been introduced, the attribute learning for classes (output space) is a new problem that has been attended in the last few years. In this paper, we propose an attribute learning method that can use different sources of descriptions for classes to find new attributes that are more proper to be used as class signatures. Experimental results show that the learned attributes by the proposed... 

    Multi-modal deep distance metric learning

    , Article Intelligent Data Analysis ; Volume 21, Issue 6 , 2017 , Pages 1351-1369 ; 1088467X (ISSN) Roostaiyan, S. M ; Imani, E ; Soleymani Baghshah, M ; Sharif University of Technology
    IOS Press  2017
    Abstract
    In many real-world applications, data contain heterogeneous input modalities (e.g., web pages include images, text, etc.). Moreover, data such as images are usually described using different views (i.e. different sets of features). Learning a distance metric or similarity measure that originates from all input modalities or views is essential for many tasks such as content-based retrieval ones. In these cases, similar and dissimilar pairs of data can be used to find a better representation of data in which similarity and dissimilarity constraints are better satisfied. In this paper, we incorporate supervision in the form of pairwise similarity and/or dissimilarity constraints into... 

    Sample complexity of classification with compressed input

    , Article Neurocomputing ; Volume 415 , 2020 , Pages 286-294 Hafez Kolahi, H ; Kasaei, S ; Soleymani Baghshah, M ; Sharif University of Technology
    Elsevier B.V  2020
    Abstract
    One of the most studied problems in machine learning is finding reasonable constraints that guarantee the generalization of a learning algorithm. These constraints are usually expressed as some simplicity assumptions on the target. For instance, in the Vapnik–Chervonenkis (VC) theory the space of possible hypotheses is considered to have a limited VC dimension One way to formulate the simplicity assumption is via information theoretic concepts. In this paper, the constraint on the entropy H(X) of the input variable X is studied as a simplicity assumption. It is proven that the sample complexity to achieve an ∊-δ Probably Approximately Correct (PAC) hypothesis is bounded by [Formula... 

    DGSAN: Discrete generative self-adversarial network

    , Article Neurocomputing ; Volume 448 , 2021 , Pages 364-379 ; 09252312 (ISSN) Montahaei, E ; Alihosseini, D ; Soleymani Baghshah, M ; Sharif University of Technology
    Elsevier B.V  2021
    Abstract
    Although GAN-based methods have received many achievements in the last few years, they have not been entirely successful in generating discrete data. The most crucial challenge of these methods is the difficulty of passing the gradient from the discriminator to the generator when the generator outputs are discrete. Despite the fact that several attempts have been made to alleviate this problem, none of the existing GAN-based methods have improved the performance of text generation compared with the maximum likelihood approach in terms of both the quality and the diversity. In this paper, we proposed a new framework for generating discrete data by an adversarial approach in which there is no... 

    Transformer-based deep neural network language models for Alzheimer’s disease risk assessment from targeted speech

    , Article BMC Medical Informatics and Decision Making ; Volume 21, Issue 1 , 2021 ; 14726947 (ISSN) Roshanzamir, A ; Aghajan, H ; Soleymani Baghshah, M ; Sharif University of Technology
    BioMed Central Ltd  2021
    Abstract
    Background: We developed transformer-based deep learning models based on natural language processing for early risk assessment of Alzheimer’s disease from the picture description test. Methods: The lack of large datasets poses the most important limitation for using complex models that do not require feature engineering. Transformer-based pre-trained deep language models have recently made a large leap in NLP research and application. These models are pre-trained on available large datasets to understand natural language texts appropriately, and are shown to subsequently perform well on classification tasks with small training sets. The overall classification model is a simple classifier on... 

    MDL-CW: A multimodal deep learning framework with cross weights

    , Article 2016 IEEE Conference on Computer Vision and Pattern Recognition, 26 June 2016 through 1 July 2016 ; Volume 2016-January , 2016 , Pages 2601-2609 ; 10636919 (ISSN) ; 9781467388511 (ISBN) Rastegar, S ; Soleymani Baghshah, M ; Rabiee, H. R ; Shojaee, S. M ; Sharif University of Technology
    IEEE Computer Society 
    Abstract
    Deep learning has received much attention as of the most powerful approaches for multimodal representation learning in recent years. An ideal model for multimodal data can reason about missing modalities using the available ones, and usually provides more information when multiple modalities are being considered. All the previous deep models contain separate modality-specific networks and find a shared representation on top of those networks. Therefore, they only consider high level interactions between modalities to find a joint representation for them. In this paper, we propose a multimodal deep learning framework (MDLCW) that exploits the cross weights between representation of... 

    Deep graph generators: A survey

    , Article IEEE Access ; Volume 9 , 2021 , Pages 106675-106702 ; 21693536 (ISSN) Faez, F ; Ommi, Y ; Soleymani Baghshah, M ; Rabiee, H. R ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2021
    Abstract
    Deep generative models have achieved great success in areas such as image, speech, and natural language processing in the past few years. Thanks to the advances in graph-based deep learning, and in particular graph representation learning, deep graph generation methods have recently emerged with new applications ranging from discovering novel molecular structures to modeling social networks. This paper conducts a comprehensive survey on deep learning-based graph generation approaches and classifies them into five broad categories, namely, autoregressive, autoencoder-based, reinforcement learning-based, adversarial, and flow-based graph generators, providing the readers a detailed description... 

    SCGG: A deep structure-conditioned graph generative model

    , Article PLoS ONE ; Volume 17, Issue 11 November , 2022 ; 19326203 (ISSN) Faez, F ; Hashemi Dijujin, N ; Soleymani-Baghshah, M ; Rabiee, H. R ; Sharif University of Technology
    Public Library of Science  2022
    Abstract
    Deep learning-based graph generation approaches have remarkable capacities for graph data modeling, allowing them to solve a wide range of real-world problems. Making these methods able to consider different conditions during the generation procedure even increases their effectiveness by empowering them to generate new graph samples that meet the desired criteria. This paper presents a conditional deep graph generation method called SCGG that considers a particular type of structural conditions. Specifically, our proposed SCGG model takes an initial subgraph and autoregressively generates new nodes and their corresponding edges on top of the given conditioning substructure. The architecture... 

    RA-GCN: Graph convolutional network for disease prediction problems with imbalanced data

    , Article Medical Image Analysis ; Volume 75 , 2022 ; 13618415 (ISSN) Ghorbani, M ; Kazi, A ; Soleymani Baghshah, M ; Rabiee, H. R ; Navab, N ; Sharif University of Technology
    Elsevier B.V  2022
    Abstract
    Disease prediction is a well-known classification problem in medical applications. Graph Convolutional Networks (GCNs) provide a powerful tool for analyzing the patients’ features relative to each other. This can be achieved by modeling the problem as a graph node classification task, where each node is a patient. Due to the nature of such medical datasets, class imbalance is a prevalent issue in the field of disease prediction, where the distribution of classes is skewed. When the class imbalance is present in the data, the existing graph-based classifiers tend to be biased towards the major class(es) and neglect the samples in the minor class(es). On the other hand, the correct diagnosis... 

    GKD: Semi-supervised graph knowledge distillation for graph-independent inference

    , Article 24th International Conference on Medical Image Computing and Computer Assisted Intervention, MICCAI 2021, 27 September 2021 through 1 October 2021 ; Volume 12905 LNCS , 2021 , Pages 709-718 ; 03029743 (ISSN) ; 9783030872397 (ISBN) Ghorbani, M ; Bahrami, M ; Kazi, A ; Soleymani Baghshah, M ; Rabiee, H. R ; Navab, N ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2021
    Abstract
    The increased amount of multi-modal medical data has opened the opportunities to simultaneously process various modalities such as imaging and non-imaging data to gain a comprehensive insight into the disease prediction domain. Recent studies using Graph Convolutional Networks (GCNs) provide novel semi-supervised approaches for integrating heterogeneous modalities while investigating the patients’ associations for disease prediction. However, when the meta-data used for graph construction is not available at inference time (e.g., coming from a distinct population), the conventional methods exhibit poor performance. To address this issue, we propose a novel semi-supervised approach named GKD... 

    CCGG: A deep autoregressive model for class-conditional graph generation

    , Article 31st ACM Web Conference, WWW 2022, 25 April 2022 ; 2022 , Pages 1092-1098 ; 9781450391306 (ISBN) Ommi, Y ; Yousefabadi, M ; Faez, F ; Sabour, A ; Soleymani Baghshah, M ; Rabiee, H. R ; ACM SIGWEB ; Sharif University of Technology
    Association for Computing Machinery, Inc  2022
    Abstract
    Graph data structures are fundamental for studying connected entities. With an increase in the number of applications where data is represented as graphs, the problem of graph generation has recently become a hot topic. However, despite its significance, conditional graph generation that creates graphs with desired features is relatively less explored in previous studies. This paper addresses the problem of class-conditional graph generation that uses class labels as generation constraints by introducing the Class Conditioned Graph Generator (CCGG). We built CCGG by injecting the class information as an additional input into a graph generator model and including a classification loss in its... 

    Personalized computational human phantoms via a hybrid model-based deep learning method

    , Article 15th IEEE International Symposium on Medical Measurements and Applications, MeMeA 2020, 1 June 2020 through 3 June 2020 ; July , 2020 Khodajou Chokami, H ; Bitarafan, A ; Dylov, D. V ; Soleymani Baghshah, M ; Hosseini, S. A ; IEEE; IEEE Instrumentation and Measurement Society; IEEE Sensors Council Italy Chapter; Politecnica di Bari; Politecnico di Torino; Societa Italiana di Analisi del Movimento in Clinica ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    Computed tomography (CT) simulators are versatile tools for scanning protocol evaluation, optimization of geometrical design parameters, assessment of image reconstruction algorithms, and evaluation of the impact of future innovations attempting to improve the performance of CT scanners. Computational human phantoms (CHPs) play a key role in simulators for the radiation dosimetry and assessment of image quality tasks in the medical x-ray systems. Since the construction of patient-specific CHPs can be both difficult and time-consuming, nominal standard/reference CHPs have been established, yielding significant discrepancies in the special design and optimization demands of patient dose and...