Loading...
Search for: feature-selection
0.012 seconds
Total 57 records

    Data Stream Whole Clustering

    , M.Sc. Thesis Sharif University of Technology Jafari Asbagh, Mohsen (Author) ; Abolhassani, Hassan (Supervisor)
    Abstract
    Due to the application of data streams in various data sources such as Web click streams, Web pages, and data generated by sensors and satellites, data streams have attracted a huge attention recently. A data stream is an ordered sequence of points that must be accessed in order and can be read only once or a small number of times. For mining such data, the ability to process in one pass along with limited memory usage is very important. Data stream clustering also has received a huge attention in recent years and numerous algorithms are developed in this field. None of them has paid attention to the feature selection problem as an effective factor in clustering quality especially when the... 

    An Investigation of Data Mining Methods in E-Learning

    , M.Sc. Thesis Sharif University of Technology Falakmasir, Mohammad Hassan (Author) ; Habibi, Jafar (Supervisor)
    Abstract
    In the pas few years, the use of web-based education systems have grown exponentially spurred by the fact that neither students nor teachers are bound to a specific location and that this form of computer-based education is virtually independent of any specific hardware platforms. These systems can offer a great variety of channels and workspaces to facilitate information sharing and communication between participants in a course, let educators distribute information to students, produce content material, prepare assignments and tests, engage in discussions, manage distance classes and enable collaborative learning with virtual classroom sessions, forums, chats, file storage areas, news... 

    Author Identification Using Statistical Methods

    , M.Sc. Thesis Sharif University of Technology Ameri, Reyhaneh (Author) ; Beigy, Hamid (Supervisor)
    Abstract
    With the increasing use of the Internet, we are witnessing the exchange of gigabytes of text in cyberspace. Cyberspace makes it possible for individuals to hide their true identity and enter this space with an spurious one. Abuses that occur in online communities due to the use of unknown identities, reduce confidence in this type of communication and create many challenges in this area. Hence the importance of maintaining the security of the space, controling the user-generated content and identifying the authors of texts increases day by day. In this Research we have presented an approach to author identification. This approach is based on modeling the style of the authors on the basis of... 

    A bi-objective Hybrid Algorithm to Reduce Noise and Data Dimension in Diabetes Disease Diagnosis Using Support Vector Machines

    , M.Sc. Thesis Sharif University of Technology Alirezaei, Mahsa (Author) ; Akhavan Niaki, Taghi (Supervisor)
    Abstract
    There is a significant amount of data in the healthcare domain and it is unfeasible to process such volume of data manually in order to diagnose the diseases and develop a treatment method in the short term. Diabetes mellitus has attracted the attention of data miners for a couple of reasons among which significant effects on the health and well-being of the contracted people and the economic burdens on the health care system are of prime importance. Researchers are trying to find a statistical correlation between the causes of this disease and factors like patient's lifestyle, hereditary information, etc. The purpose of data mining is to discover rules that facilitate the early diagnosis... 

    Estimation Influential Parameters in Operation of the Bushehr Nuclear Power Plant using Neural Network

    , M.Sc. Thesis Sharif University of Technology Ghanbari, Mohammad (Author) ; Ghofrani, Mohammad Bagher (Supervisor) ; Moshkbar Bakhshayesh, Khalil (Co-Advisor)
    Abstract
    Given many computing errors in current systems, a method appears necessary for predicting the nuclear parameter quickly and accurately. In this thesis, a neural network was used to predict safety in a nuclear power plant in order to develop an operating aid tool for preventive measures.First, some studies were conducted on appropriate feature selection for training neural networks. Some case studies have also been carried out on parameter prediction through soft computing in a power plant. In the next section, an expert judgment was taken into account to select DNBR (Departure from Nucleate Boiling Ratio) as a criterion for safety evaluation in the exploitation of a nuclear power plant (PWR)... 

    Learning and Associating Phenotypic Behavior of Organisms using Biological data

    , M.Sc. Thesis Sharif University of Technology Mehrabi, Aslan (Author) ; Beigy, Hamid (Supervisor) ; Motahari, Abolfazl (Supervisor)
    Abstract
    Datasets extracted from gene expression microarrays contain information about the phenotypic behavior of organisms. Turning this information into knowledge, i.e. finding associative genes with a given phenotype, is a daunting task. This is due to the high dimensionality of the data as the number of features on a gene expression microarray is usually very large. Moreover, a phenotype may change the expression pattern of a set of genes rather than changing each gene’s expression independently. To tackle the second problem, integrating other sources of information such as Protein-Protein Interaction (PPI) networks is required. In this thesis, the PPI network extracted from the String database... 

    Trajectory Estimation of a Vehicle Using Stereo Cameras

    , M.Sc. Thesis Sharif University of Technology Eftekhar, Parham (Author) ; Moghadasi, Reza (Supervisor)
    Abstract
    Visual odometry(VO) is the process of estimating the egomotion of an agent(e.g., vehicle, human, and robot) using the input of a single or multiple cameras attached to it. Application domains include robotics, wearable computing, augmented reality, and automotive. The term was chosen for its similarity to wheel odometry, which incrementally estimates the motion of a vehicle by integrating the number of turns of its wheels over time. Likewise, VO operates by incrementally estimating the pose of the vehicle through examination of the changes that movements induces on the images of its onboard cameras. For the VO to work effectively, there should be sufficient illumination in the environment... 

    Heart Disease Diagnosis Based on Heart Sounds Using Signal Processing and Machine Learning Algorithms

    , M.Sc. Thesis Sharif University of Technology Zeinali, Yasser (Author) ; Akhavan Niaki, Taghi (Supervisor)
    Abstract
    The research in this study aims to analyze data in healthcare, especially the diagnosis of several diseases caused by heart failure. Analyzing and analyzing this data can lead to the discovery of relationships and patterns that can play an important role in the decision-making process of relevant officials in any field. Today, medical data around the world is stored in large volumes for future research. Various infrastructures and software have been set up in many health centers and research centers affiliated with those organizations.In this research, the general process of work is such that the data related to the heart sounds, which are in the four broad categories of S1 to S4, are... 

    Estimation of Power Peaking Factor (PPF)Parameter in VVER Reactor Using Soft Computing, Case Study: Bushehr Nuclear Power Plant

    , M.Sc. Thesis Sharif University of Technology Sharifi, Saeed (Author) ; Ghofrani, Mohammad Bagher (Supervisor) ; Moshkbar Bakhshayesh, Khalil (Supervisor)
    Abstract
    operation of a nuclear power plant. Therefore, constant monitoring of the reactor core with reliable methods is important. To monitor the reactor heart, it is necessary to estimate and calculate some parameters, with high speed and accuracy, such as power distribution inside the heart, reactivity feedback coefficients, PPF, DNBR, etc. Analytical methods are often used to calculate these parameters, which in case of failure of the sensors, the calculations will be practically disrupted, and the method used in this research can solve these problems by losing a small amount of accuracy.In this study using real data of Bushehr nuclear power plant (BNPP) and by soft computing methods and... 

    Data Mining for Rational Use of Drugs

    , Ph.D. Dissertation Sharif University of Technology Moradi, Morteza (Author) ; Modarres Yazdi, Mohammad (Supervisor)
    Abstract
    Prescribing and consuming drugs more than necessary is considered polypharmacy, which is both wasteful and harmful. In this study, an innovative data mining framework is developed for analyzing prescriptions regarding polypharmacy. The approach consists of three main steps: pre-modeling, modeling, and post-modeling. In the first step, after collecting and cleaning the raw data, several novel features are extracted for physicians and patients. In the modeling step, decision trees are applied to generate a set of If-Then rules to detect and describe physicians’ features or patients’ features associated with polypharmacy. A novel approach based on the response surface methodology (RSM) is... 

    Development of a Validation and Calibration Algorithm for Thermohydraulic Sensors of Bushehr NPP First Circuit Using Neural Networks

    , M.Sc. Thesis Sharif University of Technology Ebrahimzadeh, Alireza (Author) ; Ghaffari, Mohsen (Supervisor) ; Moshkbar-Bakhshayesh, Khalil (Co-Supervisor)
    Abstract
    Sensors are one of the most vital instruments in Nuclear Power Plants (NPP), and operators and safety systems monitor various parts of the NPP and control transients by analyzing the values reported by the sensors. Failure to detect malfunctions or anomalies in them would lead to catastrophic consequences. A new approach based on thermo-hydraulic simulation by RELAP5 code and Feed-Forward Neural Networks (FFNN) is given to detect faulty sensors and estimate their correct value which are two main objectives of the current study. This approach consists of two main parts; The first part, Fault Detection Hyper Block (FDHB), responsible for detecting faulty sensors, and the second part,... 

    Traffic Data Modelling with Gaussian Processes

    , M.Sc. Thesis Sharif University of Technology Jamal Bafrani, Fateme (Author) ; Gholampour, Iman (Supervisor)
    Abstract
    In the transportation industry, one of the most important and fundamental problems is the traffic of vehicles in the transportation roads. This problem is especially seen in large and densely populated cities such as Tehran. If traffic control is not done properly, it can lead to problems such as reduced traffic dynamics, environmental pollution, wasted drivers' time, disorder and loss of energy. If the traffic control is done after creating a traffic problem, it will not bring good results and will have low efficiency. For this reason, optimal traffic management and control has been raised as an important issue, especially in large cities. Predicting traffic flow is one of the important and... 

    Develop a Monitoring Center Conceptual Framework for Chain Stores

    , M.Sc. Thesis Sharif University of Technology Kishani Farahani, Masoud (Author) ; Rajabi Ghahnaviyeh, Abbas (Supervisor)
    Abstract
    Among the equipment used in supermarkets, refrigeration systems should be the focus of energy efficiency initiatives; Because they are the biggest consumers of energy and refrigerant with significant maintenance costs. Fault detection and diagnosis (FDD) can provide considerable potential for energy savings as well as reduced maintenance costs. Although there have been numerous investigations of FDD for HVAC systems, there has been very little research on the application of FDD for supermarket refrigeration systems. Therefore, this thesis focuses on the application of FDD to these systems and helps to fill these research gaps. The studied system is a commercial refrigerator system on a... 

    Prognostic Biomarker Selection for Breast Cancer using Bioinformatics and Deep Learning

    , M.Sc. Thesis Sharif University of Technology Salimi , Adel (Author) ; Sharifi Zarchi, Ali (Supervisor)
    Abstract
    Triple Negative Breast Cancer (TNBC) is an invasive subtype of breast cancer. Finding prognostic biomarkers is helpful in choosing the appropriate treatment procedure for patients of this cancer. In recent years, the role of microRNAs in various biological processes, including cancer, has been identified, and their accessibility and stability have made these types of molecules an ideal biomarker. In the first phase of this study, with the aim of overcoming the limitations of previous studies, a new bioinformatics protocol has been proposed to investigate the prognostic miRNAs of triple negative breast cancer. First, using survival analysis, 56 prognostic miRNAs which had a significant... 

    PCR Amplification Prediction using Machine Learning

    , M.Sc. Thesis Sharif University of Technology Latifian, Niloofar (Author) ; Hossein Khalaj, Babak (Supervisor)
    Abstract
    Polymerase Chain Reaction (PCR) is a laboratory method for amplifying a part of DNA. This method is used in determining the sequence of genes, detecting pathogenic agents in epidemics, creating genetic changes in bacteria, diseases, plants and even animals. Many factors affect the quality of the reaction. Each of these factors can be effective in amplifying the target in DNA. If we can predict the result of PCR using the factors involved in the reaction, it will save a lot of money and time. The aim of this research is to predict the result of PCR amplification using machine learning methods. For this purpose, two methods are proposed: feature-based method and string-based method. In the... 

    Towards an automatic diagnosis system for lumbar disc herniation: the significance of local subset feature selection

    , Article Biomedical Engineering - Applications, Basis and Communications ; 2018 ; 10162372 (ISSN) Ebrahimzadeh, E ; Fayaz, F ; Nikravan, M ; Ahmadi, F ; Dolatabad, M. R ; Sharif University of Technology
    World Scientific Publishing Co. Pte Ltd  2018
    Abstract
    Herniation in the lumbar area is one of the most common diseases which results in lower back pain (LBP) causing discomfort and inconvenience in the patients' daily lives. A computer aided diagnosis (CAD) system can be of immense benefit as it generates diagnostic results within a short time while increasing precision of diagnosis and eliminating human errors. We have proposed a new method for automatic diagnosis of lumbar disc herniation based on clinical MRI data. We use T2-W sagittal and myelograph images. The presented method has been applied on 30 clinical cases, each containing 7 discs (210 lumbar discs) for the herniation diagnosis. We employ Otsu thresholding method to extract the... 

    Small-Scale building load forecast based on hybrid forecast engine

    , Article Neural Processing Letters ; Volume 48, Issue 1 , 2018 , Pages 329-351 ; 13704621 (ISSN) Mohammadi, M ; Talebpour, F ; Safaee, E ; Ghadimi, N ; Abedinia, O ; Sharif University of Technology
    Springer New York LLC  2018
    Abstract
    Electricity load forecasting plays an important role for optimal power system operation. Accordingly, short term load forecast (STLF) is also becoming an important task by researchers to tackle the mentioned problem. As a consequence of the highly non-smooth and volatile trend of the load time series specially in building levels, its STLF is even a more complex procedure than that of a power system. For this purpose, in this paper we proposed a new prediction model based on a new feature selection algorithm and hybrid forecast engine of enhanced version of empirical mode decomposition named sliding window EMD bundled with an intelligent algorithm. The proposed forecast engine is combined... 

    A new prediction model based on cascade NN for wind power prediction

    , Article Computational Economics ; March , 2018 , Pages 1-25 ; 09277099 (ISSN) Torabi, A ; Kiaian Mousavy, S. A ; Dashti, V ; Saeedi, M ; Yousefi, N ; Sharif University of Technology
    Springer New York LLC  2018
    Abstract
    This paper presents a new prediction model based on empirical mode decomposition, feature selection and hybrid forecast engine. The whole structure of proposed model is based on nonstationarity and non-convex nature of wind power signal. The hybrid forecast engine consists of three main stages as; empirical mode decomposition, an intelligent algorithm and three stage neural network. All parameters of proposed neural network will be optimized by intelligent algorithm. Effectiveness of the proposed model is tested with real-world hourly data of wind farms in Canada, Spain and Texas. In order to demonstrate the validity of the proposed model, it is compared with several other wind speed and... 

    Detection of change to SSVEPs using analysis of phase space topological : a novel approach

    , Article Neurophysiology ; Volume 51, Issue 3 , 2019 , Pages 180-190 ; 00902977 (ISSN) Soroush, M. Z ; Maghooli, K ; Pisheh, N. F ; Mohammadi, M ; Soroush, P. Z ; Tahvilian, P ; Sharif University of Technology
    Springer New York LLC  2019
    Abstract
    A novel method based on EEG nonlinear analysis and analysis of steady-state visual evoked potentials (SSVEPs) has been processed. The EEG phase space is reconstructed, and some new geometrical features are extracted. Statistical analysis is carried out based on ANOVA, and most significant features are selected and then fed into a multi-class support vector machine (MSVM). Both offline and online phases are considered to fully address SSVEP detection. In the offline mode, the whole design evaluation, feature selection, and classifier training are performed. In the online scenario, the proposed method is evaluated and the detection rate is reported for both phases. Subject-dependent and... 

    A new prediction model based on cascade NN for wind power prediction

    , Article Computational Economics ; Volume 53, Issue 3 , 2019 , Pages 1219-1243 ; 09277099 (ISSN) Torabi, A ; Kiaian Mousavy, S. A ; Dashti, V ; Saeedi, M ; Yousefi, N ; Sharif University of Technology
    Springer New York LLC  2019
    Abstract
    This paper presents a new prediction model based on empirical mode decomposition, feature selection and hybrid forecast engine. The whole structure of proposed model is based on nonstationarity and non-convex nature of wind power signal. The hybrid forecast engine consists of three main stages as; empirical mode decomposition, an intelligent algorithm and three stage neural network. All parameters of proposed neural network will be optimized by intelligent algorithm. Effectiveness of the proposed model is tested with real-world hourly data of wind farms in Canada, Spain and Texas. In order to demonstrate the validity of the proposed model, it is compared with several other wind speed and...