Loading...
Search for: dimensionality-reduction
0.004 seconds
Total 31 records

    Robust fuzzy rough set based dimensionality reduction for big multimedia data hashing and unsupervised generative learning

    , Article Multimedia Tools and Applications ; Volume 80, Issue 12 , 2021 , Pages 17745-17772 ; 13807501 (ISSN) Khanzadi, P ; Majidi, B ; Adabi, S ; Patra, J. C ; Movaghar, A ; Sharif University of Technology
    Springer  2021
    Abstract
    The amount of high dimensional data produced by visual sensors in the smart environments and by autonomous vehicles is increasing exponentially. In order to search and model this data for real-time applications, the dimensionality of the data should be reduced. In this paper, a novel dimensionality reduction algorithm based on fuzzy rough set theory, called Centralized Binary Mapping (CBM), is proposed. The fuzzy CBM kernel is used for extracting the central elements and the memory cells from the blocks of high dimensional data. The proposed applications of CBM in this paper include hashing and generative modelling of multimedia big data. The robustness of the proposed CBM based hashing... 

    Various reduced-order surrogate models for fluid flow and mass transfer in human bronchial tree

    , Article Biomechanics and Modeling in Mechanobiology ; Volume 20, Issue 6 , 2021 , Pages 2203-2226 ; 16177959 (ISSN) Abbasi, Z ; Bozorgmehry Boozarjomehry, R ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2021
    Abstract
    The bronchial tree plays a main role in the human respiratory system because the air distribution throughout the lungs and gas exchange with blood occur in the airways whose dimensions vary from several centimeters to micrometers. Organization of about 60,000 conducting airways and 33 million respiratory airways in a limited space results in a complex structure. Due to this inherent complexity and a high number of airways, using target-oriented dimensional reduction is inevitable. In addition, there is no general reduced-order model for various types of problems. This necessitates coming up with an appropriate model from a variety of different reduced-order models to solve the desired... 

    Various reduced-order surrogate models for fluid flow and mass transfer in human bronchial tree

    , Article Biomechanics and Modeling in Mechanobiology ; Volume 20, Issue 6 , 2021 , Pages 2203-2226 ; 16177959 (ISSN) Abbasi, Z ; Bozorgmehry Boozarjomehry, R ; Sharif University of Technology
    Springer Science and Business Media Deutschland GmbH  2021
    Abstract
    The bronchial tree plays a main role in the human respiratory system because the air distribution throughout the lungs and gas exchange with blood occur in the airways whose dimensions vary from several centimeters to micrometers. Organization of about 60,000 conducting airways and 33 million respiratory airways in a limited space results in a complex structure. Due to this inherent complexity and a high number of airways, using target-oriented dimensional reduction is inevitable. In addition, there is no general reduced-order model for various types of problems. This necessitates coming up with an appropriate model from a variety of different reduced-order models to solve the desired... 

    Multiclass classification of patients during different stages of Alzheimer's disease using fMRI time-series

    , Article Biomedical Physics and Engineering Express ; Volume 6, Issue 5 , 2020 Ahmadi, H ; Fatemizadeh, E ; Motie Nasrabadi, A ; Sharif University of Technology
    IOP Publishing Ltd  2020
    Abstract
    Alzheimer's Disease (AD) begins several years before the symptoms develop. It starts with Mild Cognitive Impairment (MCI) which can be separated into Early MCI and Late MCI (EMCI and LMCI). Functional connectivity analysis and classification are done among the different stages of illness with Functional Magnetic Resonance Imaging (fMRI). In this study, in addition to the four stages including healthy, EMCI, LMCI, and AD, the patients have been tracked for a year. Indeed, the classification has been done among 7 groups to analyze the functional connectivity changes in one year in different stages. After generating the functional connectivity graphs for eliminating the weak links, three... 

    Variants of vector space reductions for predicting the compositionality of English noun compounds

    , Article 12th International Conference on Language Resources and Evaluation, LREC 2020, 11 May 2020 through 16 May 2020 ; 2020 , Pages 4379-4387 Alipoor, P ; Schulte im Walde, S ; Sharif University of Technology
    European Language Resources Association (ELRA)  2020
    Abstract
    Predicting the degree of compositionality of noun compounds such as snowball and butterfly is a crucial ingredient for lexicography and Natural Language Processing applications, to know whether the compound should be treated as a whole, or through its constituents, and what it means. Computational approaches for an automatic prediction typically represent and compare compounds and their constituents within a vector space and use distributional similarity as a proxy to predict the semantic relatedness between the compounds and their constituents as the compound's degree of compositionality. This paper provides a systematic evaluation of vector-space reduction variants across kinds, exploring... 

    Private Inner product retrieval for distributed machine learning

    , Article 2019 IEEE International Symposium on Information Theory, ISIT 2019, 7 July 2019 through 12 July 2019 ; Volume 2019-July , 2019 , Pages 355-359 ; 21578095 (ISSN); 9781538692912 (ISBN) Mousavi, M. H ; Maddah Ali, M. A ; Mirmohseni, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    In this paper, we argue that in many basic algorithms for machine learning, including support vector machine (SVM) for classification, principal component analysis (PCA) for dimensionality reduction, and regression for dependency estimation, we need the inner products of the data samples, rather than the data samples themselves.Motivated by the above observation, we introduce the problem of private inner product retrieval for distributed machine learning, where we have a system including a database of some files, duplicated across some non-colluding servers. A user intends to retrieve a subset of specific size of the set of the inner product of every pair of data items in the database with... 

    An Efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2019
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    An efficient semi-supervised multi-label classifier capable of handling missing labels

    , Article IEEE Transactions on Knowledge and Data Engineering ; 2018 ; 10414347 (ISSN) Hosseini Akbarnejad, A ; Soleymani Baghshah, M ; Sharif University of Technology
    IEEE Computer Society  2018
    Abstract
    Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these... 

    A novel forensic image analysis tool for discovering double JPEG compression clues

    , Article Multimedia Tools and Applications ; Volume 76, Issue 6 , 2017 , Pages 7749-7783 ; 13807501 (ISSN) Taimori, A ; Razzazi, F ; Behrad, A ; Ahmadi, A ; Babaie Zadeh, M ; Sharif University of Technology
    Springer New York LLC  2017
    Abstract
    This paper presents a novel technique to discover double JPEG compression traces. Existing detectors only operate in a scenario that the image under investigation is explicitly available in JPEG format. Consequently, if quantization information of JPEG files is unknown, their performance dramatically degrades. Our method addresses both forensic scenarios which results in a fresh perceptual detection pipeline. We suggest a dimensionality reduction algorithm to visualize behaviors of a big database including various single and double compressed images. Based on intuitions of visualization, three bottom-up, top-down and combined top-down/bottom-up learning strategies are proposed. Our tool... 

    Optimal supervised feature extraction in internet traffic classification

    , Article IEEE Pacific RIM Conference on Communications, Computers, and Signal Processing - Proceedings ; 2013 , Pages 102-107 ; 1555-5798 (ISSN) ; 9781479915019 (ISBN) Aliakbarian, M. S ; Fanian, A ; Saleh, F. S ; Gulliver, T. A ; Sharif University of Technology
    2013
    Abstract
    Internet traffic classification is important in many aspects of network management such as data exploitation detection, malicious user identification, and restricting application traffic. Previously, features such as port and protocol numbers have been used to classify traffic, but these features can now be changed easily, making their use in traffic classification inadequate. Consequently, traffic classification based on machine learning (ML) is now employed. The number of features used in an ML algorithm has a significant impact on performance, in particular accuracy. In this paper, a minimum best feature set is chosen using a supervised method to obtain uncorrelated features. Outlier... 

    A new ensemble method for feature ranking in text mining

    , Article International Journal on Artificial Intelligence Tools ; Volume 22, Issue 3 , June , 2013 ; 02182130 (ISSN) Sadeghi, S ; Beigy, H ; Sharif University of Technology
    2013
    Abstract
    Dimensionality reduction is a necessary task in data mining when working with high dimensional data. A type of dimensionality reduction is feature selection. Feature selection based on feature ranking has received much attention by researchers. The major reasons are its scalability, ease of use, and fast computation. Feature ranking methods can be divided into different categories and may use different measures for ranking features. Recently, ensemble methods have entered in the field of ranking and achieved more accuracy among others. Accordingly, in this paper a Heterogeneous ensemble based algorithm for feature ranking is proposed. The base ranking methods in this ensemble structure are... 

    Comparison of classification and dimensionality reduction methods used in fMRI decoding

    , Article Iranian Conference on Machine Vision and Image Processing, MVIP ; 2013 , Pages 175-179 ; 21666776 (ISSN) ; 9781467361842 (ISBN) Alamdari, N. T ; Fatemizadeh, E ; Sharif University of Technology
    2013
    Abstract
    In the last few years there has been growing interest in the use of functional Magnetic Resonance Imaging (fMRI) for brain mapping. To decode brain patterns in fMRI data, we need reliable and accurate classifiers. Towards this goal, we compared performance of eleven popular pattern recognition methods. Before performing pattern recognition, applying the dimensionality reduction methods can improve the classification performance; therefore, seven methods in region of interest (RDI) have been compared to answer the following question: which dimensionality reduction procedure performs best? In both tasks, in addition to measuring prediction accuracy, we estimated standard deviation of... 

    Manifold learning for ECG arrhythmia recognition

    , Article 2013 20th Iranian Conference on Biomedical Engineering, ICBME 2013 ; 2013 , Pages 126-131 Lashgari, E ; Jahed, M ; Khalaj, B ; Sharif University of Technology
    IEEE Computer Society  2013
    Abstract
    Heart is a complex system and we can find its function in electrocardiogram (ECG) signal. The records show high mortality rate of heart diseases. So it is essential to detect and recognize ECG arrhythmias. The problem with ECG analysis is the vast variations among morphologies of ECG signals. Premature Ventricular Contractions (PVC) is a common type of arrhythmia which may lead to critical situations and contains risk. This study, proposes a novel approach for detecting PVC and visualizing data with respect to ECG morphologies by using manifold learning. To this end, the Laplacian Eigenmaps - One of the reduction method and it is in the nonlinear category - is used to extract important... 

    Two-dimensional random projection

    , Article Signal Processing ; Volume 91, Issue 7 , 2011 , Pages 1589-1603 ; 01651684 (ISSN) Eftekhari, A ; Babaie-Zadeh, M ; Abrishami Moghaddam, H ; Sharif University of Technology
    2011
    Abstract
    As an alternative to adaptive nonlinear schemes for dimensionality reduction, linear random projection has recently proved to be a reliable means for high-dimensional data processing. Widespread application of conventional random projection in the context of image analysis is, however, mainly impeded by excessive computational and memory requirements. In this paper, a two-dimensional random projection scheme is considered as a remedy to this problem, and the associated key notion of concentration of measure is closely studied. It is then applied in the contexts of image classification and sparse image reconstruction. Finally, theoretical results are validated within a comprehensive set of... 

    An exploratory study to design a novel hand movement identification system

    , Article Computers in Biology and Medicine ; Volume 39, Issue 5 , 2009 , Pages 433-442 ; 00104825 (ISSN) Khezri, M ; Jahed, M ; Sharif University of Technology
    2009
    Abstract
    Electromyogram signal (EMG) is an electrical manifestation of contractions of muscles. Surface EMG (sEMG) signal collected from the surface of skin has been used in diverse applications. One of its usages is in pattern recognition of hand prosthesis movements. The ability of current prosthesis devices has been generally limited to simple opening and closing tasks, minimizing their efficacy compared to natural hand capabilities. In order to extend the abilities and accuracy of prosthesis arm movements and performance, a novel sEMG pattern recognizing system is proposed. To extract more pertinent information we extracted sEMGs for selected hand movements. These features constitute our main... 

    Ordinal embedding: Approximation algorithms and dimensionality reduction

    , Article 11th International Workshop on Approximation Algorithms for Combinatorial Optimization Problems, APPROX 2008 and 12th International Workshop on Randomization and Computation, RANDOM 2008, Boston, MA, 25 August 2008 through 27 August 2008 ; Volume 5171 LNCS , 2008 , Pages 21-34 ; 03029743 (ISSN) ; 9783540853626 (ISBN) Bǎdoiu, M ; Demaine, E. D ; Hajiaghayi, M ; Sidiropoulos, A ; Zadimoghaddam, M ; Sharif University of Technology
    2008
    Abstract
    This paper studies how to optimally embed a general metric, represented by a graph, into a target space while preserving the relative magnitudes of most distances. More precisely, in an ordinal embedding, we must preserve the relative order between pairs of distances (which pairs are larger or smaller), and not necessarily the values of the distances themselves. The relaxation of an ordinal embedding is the maximum ratio between two distances whose relative order is inverted by the embedding. We develop polynomial-time constant-factor approximation algorithms for minimizing the relaxation in an embedding of an unweighted graph into a line metric and into a tree metric. These two basic target... 

    Damage Detection in Offshore Jacket Structures Using Invers Vibration Problem (IVP)Technique with a Look on Probabilistic Bayesian Network Modeling

    , M.Sc. Thesis Sharif University of Technology Ahmadi, Ali (Author) ; Abbaspour, Madjid (Supervisor) ; Tabeshpour, Mohammad Reza (Supervisor)
    Abstract
    Damage detection in all kinds of human-made structures has been given serious attention as one of the main branches of engineering sciences, and significant progress has been made in this field, especially in the field of civil engineering, aerospace and heavy industries. However, although this topic is discussed academically in the field of offshore structures for two decades, not much development has happened in practice and it is still in the stage of knowledge development. One of the most important characteristics of practical online damage detection methods is the ability to be used knowing little information about the modal characteristics of the structure. In this research, an attempt... 

    Design and Implementation of Distributed Dimensionality Reduction Algorithms under Communication Constraints

    , M.Sc. Thesis Sharif University of Technology Rahmani, Mohammad Reza (Author) ; Maddah Ali, Mohammad Ali (Supervisor) ; Salehkaleybar, Saber (Supervisor)
    Abstract
    Nowadays we are witnessing the emergence of machine learning in various applications. One of the key problems in data science and machine learning is the problem of dimensionality reduction, which deals with finding a mapping that embeds samples to a lower-dimensional space such that, the relationships between the samples and their properties are preserved in the secondary space as much as possible. Obtaining such mapping is essential in today's high-dimensional settings. Moreover, due to the large volume of data and high-dimensional samples, it is infeasible or insecure to process and store all data in a single machine. As a result, we need to process data in a distributed manner.In this... 

    Robust Design Optimization for Fatigue Life with Geometric and Material Uncertainties of Mechanical Parts Under Random Loading Based on Maximizing Fatigue Life and Minimizing Uncertainty in Fatigue Llife Prediction

    , M.Sc. Thesis Sharif University of Technology Esfahani, Saeed (Author) ; Khodaygan, Saeed (Supervisor)
    Abstract
    Fatigue life prediction of a mechanical part is one of issues which a group of engineers are engaged with it and always they try to design the parts with the maximum of lifetime. Although many researches have been done in this field but yet we can see that predicted life are different from that happens in the reality because there are some uncertainties in the phenomena. Our effort in this project is creating an algorithm design so that the parts are designed by it, have the maximum fatigue life and the minimum uncertainty in prediction. In this project we have considered geometrical, material and random loading uncertainties as error resources. Older methods those are presented in this... 

    Analysis of Designed Experiments with Multichannel Profiles Response Variable

    , M.Sc. Thesis Sharif University of Technology Badfar, Mohammad (Author) ; Niaki, Akhavan (Supervisor)
    Abstract
    The purpose of this research is analyzing designed experiments which their response variable is in form of multichannel profiles. For this purpose, a number of experiments with multichannel profile response variable designed at first. Then by random effect model, output data calculated. Experiments output data dimension reduced using principal component analysis and its extensions. After that, regression analysis used to analyze results of dimensionality reduction data in order to estimate coefficients of potentially effective variables in response. At the end, coefficients of effective variables classified with a hierarchical classification method in order to discover change and its root...