Loading...
Search for:
dimensionality-reduction
0.009 seconds
Total 31 records
Feature Ranking in Text Classification
, M.Sc. Thesis Sharif University of Technology ; Beigy, Hamid (Supervisor)
Abstract
Text classification is one if the widest and most important applications in data mining. Because of the huge number of features in these applications, a method for dimensionality reduction is needed before applying the classification algorithm. Various number of methods for dimensionality reduction and feature selection are proposed. Feature selection based on feature ranking has received much attention by researchers. The major reasons are their scalability, ease of use, and fast computation. Feature ranking methods are divided to different categories and use different measures for scoring features. Recently ensemble methods have entered the field of ranking, and achieved more accuracy...
Using Manifold Learning for ECG Processing
, M.Sc. Thesis Sharif University of Technology ; Jahed, Mehran (Supervisor) ; Hossein Khalaj, Babak (Supervisor)
Abstract
The human heart is a complex system that contains many clues about its function in its electrocardiogram (ECG) signal. Due to the high mortality rate of heart diseases, detection and recognition of ECG arrhythmias is essential. The most difficult problem faced by ECG analysis is the vast variations among morphologies of ECG signals. In this study, we propose an approach for y detection of abnormal beats and data visualization with respect to ECG morphologies by using manifold learning. In order to do so, a nonlinear dimensionality reduction method based on the Laplacian Eigenmaps is used to reduce the high dimensions of the ECG signals, followed by the application of Bayesian and FLDA method...
Analysis of Designed Experiments with Multichannel Profiles Response Variable
, M.Sc. Thesis Sharif University of Technology ; Niaki, Akhavan (Supervisor)
Abstract
The purpose of this research is analyzing designed experiments which their response variable is in form of multichannel profiles. For this purpose, a number of experiments with multichannel profile response variable designed at first. Then by random effect model, output data calculated. Experiments output data dimension reduced using principal component analysis and its extensions. After that, regression analysis used to analyze results of dimensionality reduction data in order to estimate coefficients of potentially effective variables in response. At the end, coefficients of effective variables classified with a hierarchical classification method in order to discover change and its root...
The effect of a two steps searching mechanism Using Feature Vectors Related to Image Class in Improving the Performance of CBIR System
,
M.Sc. Thesis
Sharif University of Technology
;
Jamzad, Mansoor
(Supervisor)
;
Manzuri Shalmani, Mohammad Taghi
(Co-Advisor)
Abstract
Nowadays, retrieval is an inseparable part of user activities and due to growing usage of Content-Based Image Retrieval (CBIR), it has become a hot and challenging research topic specially in the past decade. The most important challenge that retrieval systems (including CBIR systems) are facing is the semantic gap between abstractions in the user’s mind and what is searched. One of the ways of dealing with this challenge is getting more information from the user about what he needs and so decreasing the distance between user’s will and what he gives to search engine as the description of his need. In this research, the class of query image is supposed to be given. For using this...
Geometrical Fracture Modeling Within Multiple-Point Statistics Framework
, M.Sc. Thesis Sharif University of Technology ; Masihi, Mohsen (Supervisor) ; Rasaei, Mohammad Reza (Supervisor) ; Eskandaridalvand, Kiomars (Supervisor) ; Shahalipour, Reza (Co-Advisor)
Abstract
Majority of the oil and gas reservoirs, in the main hydrocarbon production regions around the world, are naturally fractured reservoirs. Fractures play an important role in reservoir fluid flow either in the form of high permeable complex conduits or strong permeability anisotropies. Realistic characterization of naturally fractured reservoirs requires an exhaustive understanding of fracture connectivity and fracture pattern geometry. These subsequently demand description of many fracture parameters such as density (intensity), spacing, orientation, size and aperture. Therefore, a first step in fractured reservoirs characterization is the static geometric modeling of the subsurface fracture...
Design and Implementation of Distributed Dimensionality Reduction Algorithms under Communication Constraints
, M.Sc. Thesis Sharif University of Technology ; Maddah Ali, Mohammad Ali (Supervisor) ; Salehkaleybar, Saber (Supervisor)
Abstract
Nowadays we are witnessing the emergence of machine learning in various applications. One of the key problems in data science and machine learning is the problem of dimensionality reduction, which deals with finding a mapping that embeds samples to a lower-dimensional space such that, the relationships between the samples and their properties are preserved in the secondary space as much as possible. Obtaining such mapping is essential in today's high-dimensional settings. Moreover, due to the large volume of data and high-dimensional samples, it is infeasible or insecure to process and store all data in a single machine. As a result, we need to process data in a distributed manner.In this...
Damage Detection in Offshore Jacket Structures Using Invers Vibration Problem (IVP)Technique with a Look on Probabilistic Bayesian Network Modeling
, M.Sc. Thesis Sharif University of Technology ; Abbaspour, Madjid (Supervisor) ; Tabeshpour, Mohammad Reza (Supervisor)
Abstract
Damage detection in all kinds of human-made structures has been given serious attention as one of the main branches of engineering sciences, and significant progress has been made in this field, especially in the field of civil engineering, aerospace and heavy industries. However, although this topic is discussed academically in the field of offshore structures for two decades, not much development has happened in practice and it is still in the stage of knowledge development. One of the most important characteristics of practical online damage detection methods is the ability to be used knowing little information about the modal characteristics of the structure. In this research, an attempt...
Robust Design Optimization for Fatigue Life with Geometric and Material Uncertainties of Mechanical Parts Under Random Loading Based on Maximizing Fatigue Life and Minimizing Uncertainty in Fatigue Llife Prediction
, M.Sc. Thesis Sharif University of Technology ; Khodaygan, Saeed (Supervisor)
Abstract
Fatigue life prediction of a mechanical part is one of issues which a group of engineers are engaged with it and always they try to design the parts with the maximum of lifetime. Although many researches have been done in this field but yet we can see that predicted life are different from that happens in the reality because there are some uncertainties in the phenomena. Our effort in this project is creating an algorithm design so that the parts are designed by it, have the maximum fatigue life and the minimum uncertainty in prediction. In this project we have considered geometrical, material and random loading uncertainties as error resources. Older methods those are presented in this...
Various reduced-order surrogate models for fluid flow and mass transfer in human bronchial tree
, Article Biomechanics and Modeling in Mechanobiology ; Volume 20, Issue 6 , 2021 , Pages 2203-2226 ; 16177959 (ISSN) ; Bozorgmehry Boozarjomehry, R ; Sharif University of Technology
Springer Science and Business Media Deutschland GmbH
2021
Abstract
The bronchial tree plays a main role in the human respiratory system because the air distribution throughout the lungs and gas exchange with blood occur in the airways whose dimensions vary from several centimeters to micrometers. Organization of about 60,000 conducting airways and 33 million respiratory airways in a limited space results in a complex structure. Due to this inherent complexity and a high number of airways, using target-oriented dimensional reduction is inevitable. In addition, there is no general reduced-order model for various types of problems. This necessitates coming up with an appropriate model from a variety of different reduced-order models to solve the desired...
Various reduced-order surrogate models for fluid flow and mass transfer in human bronchial tree
, Article Biomechanics and Modeling in Mechanobiology ; Volume 20, Issue 6 , 2021 , Pages 2203-2226 ; 16177959 (ISSN) ; Bozorgmehry Boozarjomehry, R ; Sharif University of Technology
Springer Science and Business Media Deutschland GmbH
2021
Abstract
The bronchial tree plays a main role in the human respiratory system because the air distribution throughout the lungs and gas exchange with blood occur in the airways whose dimensions vary from several centimeters to micrometers. Organization of about 60,000 conducting airways and 33 million respiratory airways in a limited space results in a complex structure. Due to this inherent complexity and a high number of airways, using target-oriented dimensional reduction is inevitable. In addition, there is no general reduced-order model for various types of problems. This necessitates coming up with an appropriate model from a variety of different reduced-order models to solve the desired...
A novel forensic image analysis tool for discovering double JPEG compression clues
, Article Multimedia Tools and Applications ; Volume 76, Issue 6 , 2017 , Pages 7749-7783 ; 13807501 (ISSN) ; Razzazi, F ; Behrad, A ; Ahmadi, A ; Babaie Zadeh, M ; Sharif University of Technology
Springer New York LLC
2017
Abstract
This paper presents a novel technique to discover double JPEG compression traces. Existing detectors only operate in a scenario that the image under investigation is explicitly available in JPEG format. Consequently, if quantization information of JPEG files is unknown, their performance dramatically degrades. Our method addresses both forensic scenarios which results in a fresh perceptual detection pipeline. We suggest a dimensionality reduction algorithm to visualize behaviors of a big database including various single and double compressed images. Based on intuitions of visualization, three bottom-up, top-down and combined top-down/bottom-up learning strategies are proposed. Our tool...
Robust fuzzy rough set based dimensionality reduction for big multimedia data hashing and unsupervised generative learning
, Article Multimedia Tools and Applications ; Volume 80, Issue 12 , 2021 , Pages 17745-17772 ; 13807501 (ISSN) ; Majidi, B ; Adabi, S ; Patra, J. C ; Movaghar, A ; Sharif University of Technology
Springer
2021
Abstract
The amount of high dimensional data produced by visual sensors in the smart environments and by autonomous vehicles is increasing exponentially. In order to search and model this data for real-time applications, the dimensionality of the data should be reduced. In this paper, a novel dimensionality reduction algorithm based on fuzzy rough set theory, called Centralized Binary Mapping (CBM), is proposed. The fuzzy CBM kernel is used for extracting the central elements and the memory cells from the blocks of high dimensional data. The proposed applications of CBM in this paper include hashing and generative modelling of multimedia big data. The robustness of the proposed CBM based hashing...
Multiclass classification of patients during different stages of Alzheimer's disease using fMRI time-series
, Article Biomedical Physics and Engineering Express ; Volume 6, Issue 5 , 2020 ; Fatemizadeh, E ; Motie Nasrabadi, A ; Sharif University of Technology
IOP Publishing Ltd
2020
Abstract
Alzheimer's Disease (AD) begins several years before the symptoms develop. It starts with Mild Cognitive Impairment (MCI) which can be separated into Early MCI and Late MCI (EMCI and LMCI). Functional connectivity analysis and classification are done among the different stages of illness with Functional Magnetic Resonance Imaging (fMRI). In this study, in addition to the four stages including healthy, EMCI, LMCI, and AD, the patients have been tracked for a year. Indeed, the classification has been done among 7 groups to analyze the functional connectivity changes in one year in different stages. After generating the functional connectivity graphs for eliminating the weak links, three...
Private Inner product retrieval for distributed machine learning
, Article 2019 IEEE International Symposium on Information Theory, ISIT 2019, 7 July 2019 through 12 July 2019 ; Volume 2019-July , 2019 , Pages 355-359 ; 21578095 (ISSN); 9781538692912 (ISBN) ; Maddah Ali, M. A ; Mirmohseni, M ; Sharif University of Technology
Institute of Electrical and Electronics Engineers Inc
2019
Abstract
In this paper, we argue that in many basic algorithms for machine learning, including support vector machine (SVM) for classification, principal component analysis (PCA) for dimensionality reduction, and regression for dependency estimation, we need the inner products of the data samples, rather than the data samples themselves.Motivated by the above observation, we introduce the problem of private inner product retrieval for distributed machine learning, where we have a system including a database of some files, duplicated across some non-colluding servers. A user intends to retrieve a subset of specific size of the set of the inner product of every pair of data items in the database with...
Manifold learning for ECG arrhythmia recognition
, Article 2013 20th Iranian Conference on Biomedical Engineering, ICBME 2013 ; 2013 , Pages 126-131 ; Jahed, M ; Khalaj, B ; Sharif University of Technology
IEEE Computer Society
2013
Abstract
Heart is a complex system and we can find its function in electrocardiogram (ECG) signal. The records show high mortality rate of heart diseases. So it is essential to detect and recognize ECG arrhythmias. The problem with ECG analysis is the vast variations among morphologies of ECG signals. Premature Ventricular Contractions (PVC) is a common type of arrhythmia which may lead to critical situations and contains risk. This study, proposes a novel approach for detecting PVC and visualizing data with respect to ECG morphologies by using manifold learning. To this end, the Laplacian Eigenmaps - One of the reduction method and it is in the nonlinear category - is used to extract important...
An efficient semi-supervised multi-label classifier capable of handling missing labels
, Article IEEE Transactions on Knowledge and Data Engineering ; 2018 ; 10414347 (ISSN) ; Soleymani Baghshah, M ; Sharif University of Technology
IEEE Computer Society
2018
Abstract
Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these...
An Efficient semi-supervised multi-label classifier capable of handling missing labels
, Article IEEE Transactions on Knowledge and Data Engineering ; Volume 31, Issue 2 , 2019 , Pages 229-242 ; 10414347 (ISSN) ; Soleymani Baghshah, M ; Sharif University of Technology
IEEE Computer Society
2019
Abstract
Multi-label classification has received considerable interest in recent years. Multi-label classifiers usually need to address many issues including: handling large-scale datasets with many instances and a large set of labels, compensating missing label assignments in the training set, considering correlations between labels, as well as exploiting unlabeled data to improve prediction performance. To tackle datasets with a large set of labels, embedding-based methods represent the label assignments in a low-dimensional space. Many state-of-the-art embedding-based methods use a linear dimensionality reduction to map the label assignments to a low-dimensional space. However, by doing so, these...
Nonlinear Dimensionality Reduction via Path-Based Isometric Mapping
, Article IEEE Transactions on Pattern Analysis and Machine Intelligence ; Volume 38, Issue 7 , 2016 , Pages 1452-1464 ; 01628828 (ISSN) ; Joudaki, A ; Fatemizadeh, E ; Sharif University of Technology
IEEE Computer Society
Abstract
Nonlinear dimensionality reduction methods have demonstrated top-notch performance in many pattern recognition and image classification tasks. Despite their popularity, they suffer from highly expensive time and memory requirements, which render them inapplicable to large-scale datasets. To leverage such cases we propose a new method called "Path-Based Isomap". Similar to Isomap, we exploit geodesic paths to find the low-dimensional embedding. However, instead of preserving pairwise geodesic distances, the low-dimensional embedding is computed via a path-mapping algorithm. Due to the much fewer number of paths compared to number of data points, a significant improvement in time and memory...
Variants of vector space reductions for predicting the compositionality of English noun compounds
, Article 12th International Conference on Language Resources and Evaluation, LREC 2020, 11 May 2020 through 16 May 2020 ; 2020 , Pages 4379-4387 ; Schulte im Walde, S ; Sharif University of Technology
European Language Resources Association (ELRA)
2020
Abstract
Predicting the degree of compositionality of noun compounds such as snowball and butterfly is a crucial ingredient for lexicography and Natural Language Processing applications, to know whether the compound should be treated as a whole, or through its constituents, and what it means. Computational approaches for an automatic prediction typically represent and compare compounds and their constituents within a vector space and use distributional similarity as a proxy to predict the semantic relatedness between the compounds and their constituents as the compound's degree of compositionality. This paper provides a systematic evaluation of vector-space reduction variants across kinds, exploring...
A sensitivity study of FILTERSIM algorithm when applied to DFN modeling
, Article Journal of Petroleum Exploration and Production Technology ; Vol. 4, issue. 2 , June , 2014 , p. 153-174 ; ISSN: 21900558 ; Masihi, M ; Rasaei, M. R ; Eskandaridalvand, K ; Shahalipour, R ; Sharif University of Technology
Abstract
Realistic description of fractured reservoirs demands primarily for a comprehensive understanding of fracture networks and their geometry including various individual fracture parameters as well as network connectivities. Newly developed multiple-point geostatistical simulation methods like SIMPAT and FILTERSIM are able to model connectivity and complexity of fracture networks more effectively than traditional variogrambased methods. This approach is therefore adopted to be used in this paper. Among the multiple-point statistics algorithms, FILTERSIM has the priority of less computational effort than does SIMPAT by applying filters and modern dimensionality reduction techniques to the...