Loading...
Search for: decision-trees
0.008 seconds

    Persian pronoun resolution using data driven approaches

    , Article 23rd International Conference on Information and Software Technologies, ICIST 2017, 12 October 2017 through 14 October 2017 ; Volume 756 , 2017 , Pages 574-585 ; 18650929 (ISSN); 9783319676418 (ISBN) Nourbakhsh, A ; Bahrani, M ; Sharif University of Technology
    Springer Verlag  2017
    Abstract
    Pronoun resolution is one of the challenges of natural language processing (NLP). The proposed solutions range from heuristic rule-based to machine learning data driven approaches. In this article, we follow a previous machine learning approach on Persian pronoun anaphora resolution. The primary goal of this paper is to improve the results, mainly by extracting more balanced data through using heuristic rules in instance sampling, and utilizing more relevant features in classification. Using PCAC2008 dataset, we consider noun phrase structure as a way to extract more suitable training data. Incorporated features include syntactic and semantic information. Finally, we train and test different... 

    Additive model decision tree-based adaptive wide-area damping controller design

    , Article IEEE Systems Journal ; Volume 12, Issue 1 , 2018 , Pages 328-339 ; 19328184 (ISSN) Beiraghi, M ; Ranjbar, A. M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2018
    Abstract
    An adaptive wide-area damping controller (WADC) scheme based on gain scheduling (GS) technique and decision tree (DT) approach is presented. For the predefined operating scenarios, several robust controllers are designed, so that each controller performs well within a certain range of load/generation variation and contingencies. Considering the best acting controller in each operating condition as the target of classification, several DTs are trained using stage-wise additive modeling by multiclass exponential loss function (SAMME). The weighted combination of DTs is referred to as additive model DT (AMDT) that is used to interpolate the controller coefficients from phasor measurement unit... 

    Predictive analytics for fault reasoning in gas flow control facility: A hybrid fuzzy theory and expert system approach

    , Article Journal of Loss Prevention in the Process Industries ; Volume 77 , 2022 ; 09504230 (ISSN) Hassannayebi, E ; Nourian, R ; Mousavi, S. M ; Seyed Alizadeh, S. M ; Memarpour, M ; Sharif University of Technology
    Elsevier Ltd  2022
    Abstract
    Gas pressure reduction stations are essential in energy distribution networks because even a minor failure of these systems causes disruptive consumer problems. This study aims to introduce and implement a new knowledge-based platform that uses the synthesized expert's opinions to improve gas pressure control facilities. Given the record of failure of gas transmission system components and the data's uncertain nature, a new fuzzy expert system is developed that takes advantage of the object-oriented programming paradigm to analyze failure modes and conditions. The artificial intelligent model is designed in C# programming language, and a user-friendly interface is developed for ease of... 

    Model Selection for Complex Network Generation

    , M.Sc. Thesis Sharif University of Technology Motallebi, Sadegh (Author) ; Habibi, Jafar (Supervisor)
    Abstract
    Nowadays, there exist many real networks with distinctive features in comparison with random networks. Social networks, collaboration networks, citation networks, protein networks and communication networks are some example of complex network classes. Nowadays these networks are widespread and have many applications and the study of complex networks is an important research area. In many applications, the “synthetic networks generation” is one of the first levels of complex networks analysis. This level has many applications such as simulation and extrapolation. Many generative models are proposed for complex network modeling in recent years. By the use of these models, synthetic networks... 

    Developing an Artificial Intelligence Algorithm for Diagnosis and Prognosis of Failures

    , M.Sc. Thesis Sharif University of Technology Chenariyan Nakhaee, Muhammad (Author) ; Houshmand, Mahmood (Supervisor) ; Fattahi, Omid (Co-Advisor)
    Abstract
    Prognostics is necessary to ensure the reliability and safety of lithium-ion batteries for hybrid electric vehicles or satellites. This process can be achieved by capacity estimation, which is a direct fading indicator for assessing the state of health of a battery. However, the capacity of a lithium-ion battery onboard is difficult to monitor. This paper presents a data-driven approach for capacity estimation. First, new features are extracted from cyclic charge/discharge cycles and used as health indicators. Three algorithms are used to characterize the relationship between extracted features and battery capacity. Decision tree, random forest and boosting algorithms are trained using a... 

    Damping Controller Design for Inter-area Oscillations Using Wide-area Measurements

    , Ph.D. Dissertation Sharif University of Technology Beiraghi, Mojtaba (Author) ; Ranjbar, Ali Mohammad (Supervisor)
    Abstract
    The wide-area damping controller (WADC) has been proposed to enhance the damping of inter-area oscillations. The most challenging deficiencies to make this controller practical are the power system operating condition changes and the inherent time delay of remote signals. They can deteriorate the controller performance and the whole system stability if not properly accounted for in the design procedure. This thesis presents an adaptive delay compensator (ADC) on the basis of the latest development in the wide-area measurement system (WAMS) to cater to varying latencies. The proposed compensator can effectively reciprocate the phase deviation resulting from varying delays to improve the... 

    A multistage stochastic programming approach in project selection and scheduling

    , Article International Journal of Advanced Manufacturing Technology ; Vol. 70, issue. 9-12 , 2014 , pp. 2125-2137 ; ISSN: 02683768 Rafiee, M ; Kianfar, F ; Farhadkhani, M ; Sharif University of Technology
    Abstract
    In this paper, the joint problem of project selection and project scheduling under uncertain environment is formulated, analyzed, and solved by multistage stochastic programs. First of all, a general mathematical formulation which can address several versions of the problem is presented. A multi-period project selection and scheduling problem is introduced and modeled by multistage stochastic programs, which are effective for solving long-term planning problems under uncertainty. A set of scenarios and corresponding probabilities is applied to model the multivariate random data process (costs or revenues, available budget, chance of success). Then, due to computational complexity, a scenario... 

    The Differential Diagnosis of Crohn's Disease and Celiac Disease Using Nuclear Magnetic Resonance Spectroscopy

    , Article Applied Magnetic Resonance ; Volume 45, Issue 5 , May , 2014 , Pages 451-459 Fathi, F ; Kasmaee, L. M ; Sohrabzadeh, K ; Nejad, M. R ; Tafazzoli, M ; Oskouie, A. A ; Sharif University of Technology
    Abstract
    Crohn's disease and celiac disease belong to a group of autoimmune conditions that affect the digestive system, specifically the small intestine. They both attack the digestive tract and share many symptoms. Thus, the discovery of proper methods would be a major step toward differentiating celiac disease from Crohn's disease. The aim of this study was to search for the metabolic biomarkers to differentiate between these two diseases. Proton nuclear magnetic resonance spectroscopy (1H NMR) was employed as the metabolic profiling method to look for serum metabolites that differentiate between celiac disease and Crohn's disease. Classification of celiac disease and Crohn's disease was done... 

    Optimal supervised feature extraction in internet traffic classification

    , Article IEEE Pacific RIM Conference on Communications, Computers, and Signal Processing - Proceedings ; 2013 , Pages 102-107 ; 1555-5798 (ISSN) ; 9781479915019 (ISBN) Aliakbarian, M. S ; Fanian, A ; Saleh, F. S ; Gulliver, T. A ; Sharif University of Technology
    2013
    Abstract
    Internet traffic classification is important in many aspects of network management such as data exploitation detection, malicious user identification, and restricting application traffic. Previously, features such as port and protocol numbers have been used to classify traffic, but these features can now be changed easily, making their use in traffic classification inadequate. Consequently, traffic classification based on machine learning (ML) is now employed. The number of features used in an ML algorithm has a significant impact on performance, in particular accuracy. In this paper, a minimum best feature set is chosen using a supervised method to obtain uncorrelated features. Outlier... 

    Relationship between serum level of selenium and metabolites using 1hnmr-based metabonomics in parkinson's disease

    , Article Applied Magnetic Resonance ; Volume 44, Issue 6 , January , 2013 , Pages 721-734 ; 09379347 (ISSN) Fathi, F ; Kyani, A ; Darvizeh, F ; Mehrpour, M ; Tafazzoli, M ; Shahidi, G ; Sharif University of Technology
    2013
    Abstract
    Parkinson's disease (PD) is a neurodegenerative disease, which is not easily diagnosed using clinical tests and the discovery of proper methods would be a major step towards a successful diagnosis. In the present study, we employed metabolic profiling using proton nuclear magnetic resonance spectroscopy to find metabolites in serum, which are helpful for the diagnosis of PD. Classification of PD and healthy subject was done using random forest. Serum levels of selenium measured by atomic absorption spectrometry in PD group were lower than the serum selenium levels in the control group. The metabolites causing selenium changes in PD patients were identified using random forest, and a model... 

    An adaptive regression tree for non-stationary data streams

    , Article Proceedings of the ACM Symposium on Applied Computing ; March , 2013 , Pages 815-816 ; 9781450316569 (ISBN) Gholipour, A ; Hosseini, M. J ; Beigy, H ; Sharif University of Technology
    2013
    Abstract
    Data streams are endless flow of data produced in high speed, large size and usually non-stationary environments. The main property of these streams is the occurrence of concept drifts. Using decision trees is shown to be a powerful approach for accurate and fast learning of data streams. In this paper, we present an incremental regression tree that can predict the target variable of newly incoming instances. The tree is updated in the case of occurring concept drifts either by altering its structure or updating its embedded models. Experimental results show the effectiveness of our algorithm in speed and accuracy aspects in comparison to the best state-of-the-art methods  

    A scenario tree approach to multi-period project selection problem using real-option valuation method

    , Article International Journal of Advanced Manufacturing Technology ; Volume 56, Issue 1-4 , 2011 , Pages 411-420 ; 02683768 (ISSN) Rafiee, M ; Kianfar, F ; Sharif University of Technology
    Abstract
    Multistage stochastic programs are effective for solving long-term planning problems under uncertainty. Multi-period project portfolio selection problems can be modeled by multistage stochastic programs. These models utilize a set of scenarios and corresponding probabilities to model the multivariate random data process (costs or revenues, available budget, chance of success). For most practical problems, the optimization problem that contains all possible scenarios is too large. Due to computational complexity, this program is often approximated by a model involving a (much) smaller number of scenarios. The scenario reduction algorithms determine a subset of the initial scenario set and... 

    Persian handwritten digit recognition by random forest and convolutional neural networks

    , Article 9th Iranian Conference on Machine Vision and Image Processing,18 November 2015 through 19 November 2015 ; Volume 2016-February , 2015 , Pages 37-40 ; 21666776 (ISSN) ; 9781467385398 (ISBN) Zamani, Y ; Souri, Y ; Rashidi, H ; Kasaei, S ; Sharif University of Technology
    IEEE Computer Society 
    Abstract
    Persian handwritten digit recognition has attracted some interests in the research community by introduction of large Hoda dataset. In this paper, the well-known random forest (RF) and convolutional neural network (CNN) algorithms are investigated for Persian handwritten digit recognition on the Hoda dataset. Using the Hoda dataset as a standard testbed, we have performed some experiments with different preprocessing steps, feature types, and baselines. It is then shown that RFs and CNNs perform competitively with the state-of-the-art methods on this dataset, while CNNs being the fastest if appropriate hardware is available  

    A new analysis of RC4: A data mining approach (J48)

    , Article SECRYPT 2009 - International Conference on Security and Cryptography, Proceedings, 7 July 2009 through 7 October 2009, Milan ; 2009 , Pages 213-218 ; 9789896740054 (ISBN) HajSalehi Sichani, M ; Movaghar, A ; Sharif University of Technology
    Abstract
    This paper combines the cryptanalysis of RC4 and Data mining algorithm. It analyzes RC4 by Data mining algorithm (J48) for the first time and discloses more vulnerabilities of RC4. The motivation for this paper is combining Artificial Intelligence and Machine learning with cryptography to decrypt cyphertext in the shortest possible time. This analysis shows that lots of numbers in RC4 during different permutations and substitutions do not change their positions and are fixed in their places. This means KSA and PRGA are bad shuffle algorithms. In this method, the information theory and Decision trees are used which are very powerful for solving hard problems and extracting information from... 

    A novel fuzzy genetic annealing classification approach

    , Article EMS 2009 - UKSim 3rd European Modelling Symposium on Computer Modelling and Simulation, 25 November 2009 through 27 November 2009, Athens ; 2009 , Pages 87-91 ; 9780769538860 (ISBN) Baran Pouyan, M ; Mohamadi, H ; Saniee Abadeh, M ; Foroughifar, A ; Sharif University of Technology
    Abstract
    In this paper, a novel classification approach is presented. This approach uses fuzzy if-then rules for classification task and employs a hybrid optimization method to improve the accuracy and comprehensibility of obtained outcome. The mentioned optimization method has been formulated by simulated annealing and genetic algorithm. In fact, the genetic operators have been used as perturb functions at the core of simulated annealing heuristic. Results of proposed approach have been compared with several well-known methods such as Naïve Bayes, Support Vector Machine, Decision Tree, k-NN, and GBML, and show that our method performs the classification task as well as other famous algorithms. ©... 

    Detecting malicious applications using system services request behavior

    , Article 16th EAI International Conference on Mobile and Ubiquitous Systems: Computing, Networking and Services, MobiQuitous 2019, 12 November 2019 through 14 November 2019 ; 2019 , Pages 200-209 ; 9781450372831 (ISBN) Salehi, M ; Amini, M ; Crispo, B ; Sharif University of Technology
    Association for Computing Machinery  2019
    Abstract
    Widespread growth in Android malware stimulates security researchers to propose different methods for analyzing and detecting malicious behaviors in applications. Nevertheless, current solutions are ill-suited to extract the fine-grained behavior of Android applications accurately and efficiently. In this paper, we propose ServiceMonitor, a lightweight host-based detection system that dynamically detects malicious applications directly on mobile devices. ServiceMonitor reconstructs the fine-grained behavior of applications based on their interaction with system services (i.e. SMS manager, camera, wifi networking, etc). ServiceMonitor monitors the way applications request system services in... 

    DSCA: an inline and adaptive application identification approach in encrypted network traffic

    , Article 3rd International Conference on Cryptography, Security and Privacy, ICCSP 2019 with Workshop 2019 the 4th International Conference on Multimedia and Image Processing, ICMIP 2019, 19 January 2019 through 21 January 2019 ; 2019 , Pages 39-43 ; 9781450366182 (ISBN) Nazari, Z ; Noferesti, M ; Jalili, R ; Sharif University of Technology
    Association for Computing Machinery  2019
    Abstract
    Adaptive application detection in today's high-bandwidth networks is resource consuming and inaccurate due to the high volume, velocity, and variety characteristics of the networks traffic. To generate a robust classifier for identifying applications over encrypted traffic, we proposed DSCA as a DPI-based Stream Classification Algorithm. DSCA utilizes applications detected by the DPI, Deep Packet Inspection technique, as ground truth data and updates the classification model accordingly. To reduce the classification algorithms overhead without accuracy reduction, a feature selection method, named CfsSubsetEval, is deployed in DSCA. The proposed approach is implemented via the MOA tool and... 

    Supervised fuzzy partitioning

    , Article Pattern Recognition ; Volume 97 , 2020 Ashtari, P ; Nateghi Haredasht, F ; Beigy, H ; Sharif University of Technology
    Elsevier Ltd  2020
    Abstract
    Centroid-based methods including k-means and fuzzy c-means are known as effective and easy-to-implement approaches to clustering purposes in many applications. However, these algorithms cannot be directly applied to supervised tasks. This paper thus presents a generative model extending the centroid-based clustering approach to be applicable to classification and regression tasks. Given an arbitrary loss function, the proposed approach, termed Supervised Fuzzy Partitioning (SFP), incorporates labels information into its objective function through a surrogate term penalizing the empirical risk. Entropy-based regularization is also employed to fuzzify the partition and to weight features,... 

    Predicting scientific research trends based on link prediction in keyword networks

    , Article Journal of Informetrics ; Volume 14, Issue 4 , 2020 Behrouzi, S ; Shafaeipour Sarmoor, Z ; Hajsadeghi, K ; Kavousi, K ; Sharif University of Technology
    Elsevier Ltd  2020
    Abstract
    The rapid development of scientific fields in this modern era has raised the concern for prospective scholars to find a proper research field to conduct their future studies. Thus, having a vision of future could be helpful to pick the right path for doing research and ensuring that it is worth investing in. In this study, we use article keywords of computer science journals and conferences, assigned by INSPEC controlled indexing, to construct a temporal scientific knowledge network. By observing keyword networks snapshots over time, we can utilize the link prediction methods to foresee the future structures of these networks. We use two different approaches for this link prediction problem.... 

    Improving quality of a post's set of answers in stack overflow

    , Article 46th Euromicro Conference on Software Engineering and Advanced Applications, SEAA 2020, 26 August 2020 through 28 August 2020 ; 2020 , Pages 504-512 Tavakoli, M ; Izadi, M ; Heydarnoori, A ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2020
    Abstract
    Community Question Answering platforms such as Stack Overflow help a wide range of users solve their challenges on-line. As the popularity of these communities has grown over the years, both the number of members and posts have escalated. Also, due to the diverse backgrounds, skills, expertise, and viewpoints of users, each question may obtain more than one answer. Therefore, the focus has changed toward producing posts that have a set of answers more valuable for the community as a whole, not just one accepted-answer aimed at satisfying only the question-asker. Same as every universal community, a large number of low-quality posts on Stack Overflow require improvement. We call these posts...