Loading...
Search for: database
0.018 seconds
Total 198 records

    IMOS: improved meta-aligner and minimap2 on spark

    , Article BMC Bioinformatics ; Volume 20, Issue 1 , 2019 ; 14712105 (ISSN) Hadadian Nejad Yousefi, M ; Goudarzi, M ; Motahari, A ; Sharif University of Technology
    BioMed Central Ltd  2019
    Abstract
    Background: Long reads provide valuable information regarding the sequence composition of genomes. Long reads are usually very noisy which renders their alignments on the reference genome a daunting task. It may take days to process datasets enough to sequence a human genome on a single node. Hence, it is of primary importance to have an aligner which can operate on distributed clusters of computers with high performance in accuracy and speed. Results: In this paper, we presented IMOS, an aligner for mapping noisy long reads to the reference genome. It can be used on a single node as well as on distributed nodes. In its single-node mode, IMOS is an Improved version of Meta-aligner (IM)... 

    Improvements on the k-center problem for uncertain data extended abstract

    , Article Proceedings of the ACM SIGACT-SIGMOD-SIGART Symposium on Principles of Database Systems ; 27 May , 2018 , Pages 425-433 ; 9781450347068 (ISBN) Alipour, S ; Jafari, A ; Sharif University of Technology
    Association for Computing Machinery  2018
    Abstract
    In real applications, there are situations where we need to model some problems based on uncertain data. This leads us to define an uncertain model for some classical geometric optimization problems and propose algorithms to solve them. The assigned version of the k-center problem for n uncertain points in a metric space is studied in this paper. The main approach is to replace each uncertain point with a clever choice of a certain point. We argue that the k-center solution for these certain replacements of our uncertain points, is a good constant approximation factor for the original uncertain k-center problem. This approach enables us to present fast and simple algorithms that give... 

    Improve the classification and sales management of products using multi-relational data mining

    , Article 2011 IEEE 3rd International Conference on Communication Software and Networks, ICCSN 2011, Xi'an, 27 May 2011 through 29 May 2011 ; 2011 , Pages 329-337 ; 9781612844855 (ISBN) Houshmand, M ; Alishahi, M ; Sharif University of Technology
    2011
    Abstract
    There are some elements such as competition among companies and changes in demands which result in changes of customers' behaviors. Therefore, paying no attention to these changes may lead to a reduction in company benefits and loss of customers. Since data and their analyses determine the activities and decision makings of companies, data quality is of paramount in analyzing them because misinformation leads to wrong decision making. Since data mining has been designed to find out multi repetition patterns, it can be used to improve the product sales violations by sales people and increase the quality of data. Most of data mining models available try to find patterns in one table, but the... 

    Integrated one-against-one classifiers as tools for virtual screening of compound databases: A case study with CNS inhibitors

    , Article Molecular Informatics ; Volume 32, Issue 8 , 2013 , Pages 742-753 ; 18681743 (ISSN) Jalali Heravi, M ; Mani-Varnosfaderani, A ; Valadkhani, A ; Sharif University of Technology
    2013
    Abstract
    A total of 21 833 inhibitors of the central nervous system (CNS) were collected from Binding-database and analyzed using discriminant analysis (DA) techniques. A combination of genetic algorithm and quadratic discriminant analysis (GA-QDA) was proposed as a tool for the classification of molecules based on their therapeutic targets and activities. The results indicated that the one-against-one (OAO) QDA classifiers correctly separate the molecules based on their therapeutic targets and are comparable with support vector machines. These classifiers help in charting the chemical space of the CNS inhibitors and finding specific subspaces occupied by particular classes of molecules. As a next... 

    Introducing a framework to create telephony speech databases from direct ones

    , Article 14th International Conference on Systems Signals and Image Processing, IWSSIP 2007 and 6th EURASIP Conference Focused on Speech and Image Processing, Multimedia Communications and Services, EC-SIPMCS 2007, Maribor, 27 June 2007 through 30 June 2007 ; November , 2007 , Pages 327-330 ; 9789612480295 (ISBN) Momtazi, S ; Sameti, H ; Vaisipour, S ; Tefagh, M ; Sharif University of Technology
    2007
    Abstract
    A Comprehensive speech database is one of the important tools for developing speech recognition systems; these tools are necessary for telephony recognition, too. Although adequate databases for direct speech recognizers exist, there is not an appropriate database for telephony speech recognizers. Most methods suggested for solving this problem are based on building new databases which tends to consume much time and many resources; or they used a filter which simulates circuit switch behavior to transform direct databases to telephony ones, in this case resulted databases have many differences with real telephony databases. In this paper we introduce a framework for creating telephony speech... 

    Intrusion detection using a fuzzy genetics-based learning algorithm

    , Article Journal of Network and Computer Applications ; Volume 30, Issue 1 , 2007 , Pages 414-428 ; 10848045 (ISSN) Saniee Abadeh, M ; Habibi, J ; Lucas, C ; Sharif University of Technology
    2007
    Abstract
    Fuzzy systems have demonstrated their ability to solve different kinds of problems in various applications domains. Currently, there is an increasing interest to augment fuzzy systems with learning and adaptation capabilities. Two of the most successful approaches to hybridize fuzzy systems with learning and adaptation methods have been made in the realm of soft computing. Neural fuzzy systems and genetic fuzzy systems hybridize the approximate reasoning method of fuzzy systems with the learning capabilities of neural networks and evolutionary algorithms. The objective of this paper is to describe a fuzzy genetics-based learning algorithm and discuss its usage to detect intrusion in a... 

    Iran atlas of offshore renewable energies

    , Article Renewable Energy ; Volume 36, Issue 1 , January , 2011 , Pages 388-398 ; 09601481 (ISSN) Abbaspour, M ; Rahimi, R ; Sharif University of Technology
    2011
    Abstract
    The aim of the present study is to provide an Atlas of IRAN Offshore Renewable Energy Resources (hereafter called 'the Atlas') to map out wave and tidal resources at a national scale, extending over the area of the Persian Gulf and Sea of Oman. Such an Atlas can provide necessary tools to identify the areas with greatest resource potential and within reach of present technology development. To estimate available tidal energy resources at the site, a two-dimensional tidally driven hydrodynamic numerical model of Persian Gulf was developed using the hydrodynamic model in the MIKE 21 Flow Model (MIKE 21HD), with validation using tidal elevation measurements and tidal stream diamonds from... 

    Isolatedword recognition based on intelligent segmentation by using hybrid HTD-HMM

    , Article International Conference on Circuits, Systems, Signal and Telecommunications - Proceedings, 21 October 2010 through 23 October 2010 ; October , 2011 , Pages 38-41 ; 9789604742714 (ISBN) Kazemi, A. R ; Ehsandoust, B. B ; Rezazadeh, C. A ; Ghaemmaghami, D. S ; Sharif University of Technology
    2011
    Abstract
    In recent years, IWR (Isolated Word Recognition) was one of the main concerns of speech processing. The challenging problems in this field appear when the database become so large or when we have a lot of word with similarly pronounce in the database. This paper introduces a general solution for a traditional problem in isolated similarly pronounced word recognition, especially in large databases. One the important problem of traditional IWR is referred to their segmentation algorithm, their methods were lacking in efficiency due to the following reasons: First, using equal segmentation is not at all intelligent at all and as a result, cannot produce accurate results; besides, utilizing... 

    KNNDIST: A non-parametric distance measure for speaker segmentation

    , Article 13th Annual Conference of the International Speech Communication Association 2012, INTERSPEECH 2012 ; Volume 3 , 2012 , Pages 2279-2282 ; 9781622767595 (ISBN) Mohammadi, S. H ; Sameti, H ; Langarani, M. S. E ; Tavanaei, A ; Sharif University of Technology
    2012
    Abstract
    A novel distance measure for distance-based speaker segmentation is proposed. This distance measure is nonparametric, in contrast to common distance measures used in speaker segmentation systems, which often assume a Gaussian distribution when measuring the distance between two audio segments. This distance measure is essentially a k-nearest-neighbor distance measure. Non-vowel segment removal in preprocessing stage is also proposed. Speaker segmentation performance is tested on artificially created conversations from the TIMIT database and two AMI conversations. For short window lengths, Missed Detection Rated is decreased significantly. For moderate window lengths, a decrease in both... 

    Knowledge-based design of space-time transmit code and receive filter for a multiple-input-multiple-output radar in signal-dependent interference

    , Article IET Radar, Sonar and Navigation ; Volume 9, Issue 8 , 2015 , Pages 1124-1135 ; 17518784 (ISSN) Karbasi, S. M ; Aubry, A ; Carotenuto, V ; Naghsh, M. M ; Bastani, M. H ; Sharif University of Technology
    Institution of Engineering and Technology  2015
    Abstract
    The authors deal with the robust design of multiple-input-multiple-output (MIMO) space-time transmit code (STTC) and space-time receive filter (STRF) for a point-like target embedded in signal-dependent interference. Specifically, they assume that the radar exploits knowledge provided by dynamic environmental database, to roughly predict the actual scattering scenario. Then, they devise an iterative method to optimise the (constrained) STTC and the (constrained) STRF which sequentially improves the worst-case (over interfering scatterers statistics) signal-to-interference-plus-noise ratio (SINR). Each iteration of the algorithm is handled via solving two (hidden) convex optimisation... 

    Knowledge discovery using a new interpretable simulated annealing based fuzzy classification system

    , Article Proceedings - 2009 1st Asian Conference on Intelligent Information and Database Systems, ACIIDS 2009, 1 April 2009 through 3 April 2009, Dong Hoi ; 2009 , Pages 271-276 ; 9780769535807 (ISBN) Mohamadia, H ; Habibib, J ; Moavena, S ; Sharif University of Technology
    2009
    Abstract
    This paper presents a new interpretable fuzzy classification system. Simulated annealing heuristic is employed to effectively investigate the large search space usually associated with classification problem. Here, two criteria are used to evaluate the proposed method. The first criterion is accuracy of extracted fuzzy if-then rules, and the other is comprehensibility of obtained rules. Experiments are performed with some data sets from UCI machine learning repository. Results are compared with several well-known classification algorithms, and show that the proposed approach provides more accurate and interpretable classification system. © 2009 IEEE  

    Large-scale image annotation using prototype-based models

    , Article ISPA 2011 - 7th International Symposium on Image and Signal Processing and Analysis ; 2011 , Pages 449-454 ; 9789531841597 (ISBN) Amiri, S. H ; Jamzad, M ; European Association for Signal Processing (EURASIP); IEEE Signal Processing Society; IEEE Region 8; IEEE Croatia Section; IEEE Croatia Section Signal Processing Chapter ; Sharif University of Technology
    Abstract
    Automatic image annotation is a challenging problem in the field of image retrieval. Dealing with large databases makes the annotation problem more difficult and therefore an effective approach is needed to manage such databases. In this work, an annotation system has been developed which considers images in separate categories and constructs a profiling model for each category. To describe an image, we propose a new feature extraction method based on color and texture information that describes image content using discrete distribution signatures. Image signatures of one category are partitioned using spectral clustering and a prototype is determined for each cluster by solving an... 

    Learning of gaussian processes in distributed and communication limited systems

    , Article IEEE Transactions on Pattern Analysis and Machine Intelligence ; Volume 42, Issue 8 , 2020 , Pages 1928-1941 Tavassolipour, M ; Motahari, S. A ; Manzuri Shalmani, M. T ; Sharif University of Technology
    IEEE Computer Society  2020
    Abstract
    It is of fundamental importance to find algorithms obtaining optimal performance for learning of statistical models in distributed and communication limited systems. Aiming at characterizing the optimal strategies, we consider learning of Gaussian Processes (GP) in distributed systems as a pivotal example. We first address a very basic problem: how many bits are required to estimate the inner-products of some Gaussian vectors across distributed machines? Using information theoretic bounds, we obtain an optimal solution for the problem which is based on vector quantization. Two suboptimal and more practical schemes are also presented as substitutes for the vector quantization scheme. In... 

    Learning of tree-structured gaussian graphical models on distributed data under communication constraints

    , Article IEEE Transactions on Signal Processing ; Volume 67, Issue 1 , 2019 , Pages 17-28 ; 1053587X (ISSN) Tavassolipour, M ; Motahari, A ; Manzuri Shalmani, M. T ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    In this paper, learning of tree-structured Gaussian graphical models from distributed data is addressed. In our model, samples are stored in a set of distributed machines where each machine has access to only a subset of features. A central machine is then responsible for learning the structure based on received messages from the other nodes. We present a set of communication-efficient strategies, which are theoretically proved to convey sufficient information for reliable learning of the structure. In particular, our analyses show that even if each machine sends only the signs of its local data samples to the central node, the tree structure can still be recovered with high accuracy. Our... 

    Learning of tree-structured Gaussian graphical models on distributed data under communication constraints

    , Article IEEE Transactions on Signal Processing ; Volume 67, Issue 1 , 2019 , Pages 17-28 ; 1053587X (ISSN) Tavassolipour, M ; Motahari, S. A ; Manzuri Shalmani, M. T ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    In this paper, learning of tree-structured Gaussian graphical models from distributed data is addressed. In our model, samples are stored in a set of distributed machines where each machine has access to only a subset of features. A central machine is then responsible for learning the structure based on received messages from the other nodes. We present a set of communication-efficient strategies, which are theoretically proved to convey sufficient information for reliable learning of the structure. In particular, our analyses show that even if each machine sends only the signs of its local data samples to the central node, the tree structure can still be recovered with high accuracy. Our... 

    Learning of tree-structured Gaussian graphical models on distributed data under communication constraints

    , Article IEEE Transactions on Signal Processing ; Volume 67, Issue 1 , 2019 , Pages 17-28 ; 1053587X (ISSN) Tavassolipour, M ; Motahari, S. A ; Manzuri Shalmani, M. T ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    In this paper, learning of tree-structured Gaussian graphical models from distributed data is addressed. In our model, samples are stored in a set of distributed machines where each machine has access to only a subset of features. A central machine is then responsible for learning the structure based on received messages from the other nodes. We present a set of communication-efficient strategies, which are theoretically proved to convey sufficient information for reliable learning of the structure. In particular, our analyses show that even if each machine sends only the signs of its local data samples to the central node, the tree structure can still be recovered with high accuracy. Our... 

    Location Privacy Preservation for Secondary Users in a Database-Driven Cognitive Radio Network

    , Article ISeCure ; Volume 14, Issue 2 , 2022 , Pages 215-227 ; 20082045 (ISSN) Salami, Z ; Ahmadian Attari, M ; Aref, M. R ; Jannati, H ; Sharif University of Technology
    Iranian Society of Cryptology  2022
    Abstract
    Since their introduction, Cognitive Radio Networks (CRN), as a new solution to the problem of spectrum scarcity, have received great attention from the research society. An important field in database-driven CRN studies is pivoted on their security issues. A critical issue in this context is user’s location privacy, which is potentially under serious threat. The query process by secondary users (SU) from the database is one of the points where the problem rises. In this paper, we propose a Privacy-Preserving Query Process (PPQP), accordingly. This method lets SUs deal in the process of spectrum query without sacrificing their location information. Analytical assessment of PPQP’s privacy... 

    MaxHiC: A robust background correction model to identify biologically relevant chromatin interactions in Hi-C and capture Hi-C experiments

    , Article PLoS Computational Biology ; Volume 18, Issue 6 , 2022 ; 1553734X (ISSN) Alinejad Rokny, H ; Modegh, R. G ; Rabiee, H. R ; Sarbandi, E. R ; Rezaie, N ; Tam, K. T ; Forrest, A. R. R ; Sharif University of Technology
    Public Library of Science  2022
    Abstract
    Hi-C is a genome-wide chromosome conformation capture technology that detects interactions between pairs of genomic regions and exploits higher order chromatin structures. Conceptually Hi-C data counts interaction frequencies between every position in the genome and every other position. Biologically functional interactions are expected to occur more frequently than transient background and artefactual interactions. To identify biologically relevant interactions, several background models that take biases such as distance, GC content and mappability into account have been proposed. Here we introduce MaxHiC, a background correction tool that deals with these complex biases and robustly... 

    Mental arousal level recognition competition on the shared database

    , Article 27th Iranian Conference on Electrical Engineering, ICEE 2019, 30 April 2019 through 2 May 2019 ; 2019 , Pages 1730-1736 ; 9781728115085 (ISBN) Saidi, M ; Rezania, S ; Khazaei, E ; Taghibeyglou, B ; Hashemi, S. S ; Kaveh, R ; Abootalebi, V ; Bagheri, S ; Homayounfar, M ; Asadi, M ; Mohammadian, A ; Mozafari, M ; Hasanzadeh, N ; DIni, H ; Sarvi, H. M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2019
    Abstract
    This paper presents the results of the shared task with the aim of arousal level recognition for the competition held in conjunction with the 27th Iranian Conference on Electrical Engineering (ICEE 2019). A database annotated with arousal level labels released by Research Center for Development of Advanced Technologies. The contest was held on arousal database according to a defined protocol. Three teams were able to enter into the final stage of the competition according to compare their performance measure with the baseline method. The baseline method is proposed by the data owner. The aim of this paper is outlining the database, protocol design, and providing an overview of top-ranked... 

    Mobile positioning using enhanced signature database method and error reduction in location grid

    , Article Proceedings - 2009 WRI International Conference on Communications and Mobile Computing, CMC 2009, 6 January 2009 through 8 January 2009, Kunming, Yunnan ; Volume 2 , 2009 , Pages 175-179 ; 9780769535012 (ISBN) Manzuri, M. T ; Naderi, A. M ; Sharif University of Technology
    2009
    Abstract
    Today, communication science is advancing very rapidly; one of these advances is mobile location determination which means finding position of a mobile station. There are some methods to location determination; one of these methods is Signal Strength (Power of Arrivals). In this research, Mobile Location Determination using enhanced signature database method in a Location grid will be introduced; several ways for positioning error reduction will also be described. © 2009 IEEE