Loading...
Search for: bahrani--s
0.123 seconds

    A new word clustering method for building n-gram language models in continuous speech recognition systems

    , Article Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), 18 June 2008 through 20 June 2008, Wroclaw ; Volume 5027 LNAI , 2008 , Pages 286-293 ; 03029743 (ISSN) ; 354069045X (ISBN); 9783540690450 (ISBN) Bahrani, M ; Sameti, H ; Hafezi, N ; Momtazi, S ; Sharif University of Technology
    2008
    Abstract
    In this paper a new method for automatic word clustering is presented. We used this method for building n-gram language models for Persian continuous speech recognition (CSR) systems. In this method, each word is specified by a feature vector that represents the statistics of parts of speech (POS) of that word. The feature vectors are clustered by k-means algorithm. Using this method causes a reduction in time complexity which is a defect in other automatic clustering methods. Also, the problem of high perplexity in manual clustering methods is abated. The experimental results are based on "Persian Text Corpus" which contains about 9 million words. The extracted language models are evaluated... 

    Exploring the impact of machine translation on fake news detection: A case study on Persian tweets about COVID-19

    , Article 29th Iranian Conference on Electrical Engineering, ICEE 2021, 18 May 2021 through 20 May 2021 ; 2021 , Pages 540-544 ; 9781665433655 (ISBN) Saghayan, M. H ; Ebrahimi, S. F ; Bahrani, M ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2021
    Abstract
    Fake news detection has become an emerging and critical topic of research in recent years. One of the major complications of fake news detection lies in the fact that news in social networks is multilingual, and therefore developing methods for each and every language in the world is impossible, especially for low resource languages like Persian. In an effort to solve this problem, researchers use machine translation to uniform the data and develop a method for the uniformed data. In this paper, we aim to explore the impacts of machine translation on fake news detection. For this purpose, we extracted and labeled a dataset of Persian Tweets from Twitter on the subject of COVID-19 and... 

    Introducing a Hybrid Language Model for Improving Performance of Continuous Speech Recognition Systems

    , Ph.D. Dissertation Sharif University of Technology Bahrani, Mohammad (Author) ; Sameti, Hossein (Supervisor)
    Abstract
    The utilizing language model is one of the most effective methods for improving speech recognition performance. For speech recognition applications, several types of language models have been proposed for speech recognition applications that try to model some parts of language information, such as n-gram models, syntactic models, and semantic models. Although n-gram, syntactic and semantic models are able to model different structures that exist in natural language, they each only capture specific linguistic phenomena. None of them can simultaneously take into account all of language phenomena in a unified probabilistic framework. Recently, a number of semantic models called "latent topic... 

    Question Processing for Open Domain Persian Question Answering Systems

    , M.Sc. Thesis Sharif University of Technology Hosseini, Hawre (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    Question answering systems are systems which get a question in natural language as input and present an explicit, appropriate answer to the question. One of the major components of automatic question answering systems is question processing component in which the input question is analyzed. The main goal of question processing phase is to determine the answer type through question classification. Rule-based, machine learning-based and hybrid approaches have been used in order to develop question classifiers among which machine learning-based ones have outperformed the others. This study’s main goal is to develop a question classifier for Persian open domain question answering systems.... 

    Pronoun Resolution with Data Driven Approaches

    , M.Sc. Thesis Sharif University of Technology Nourbakhsh, Aria (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    Pronoun resolution is one of the challenges of natural language processing. The proposed solutions range from heuristic rule-based to machine learning data driven approaches. In this thesis, we followed a previous machine learning base work to Persian pronoun anaphora resolution. The primary goal of this thesis was to improve results, mainly by extracting more balanced data and to add more features to the extracted feature vectors used in classification. Using PCAC2008 dataset, we considered noun phrase structure as a way to extract more suitable training data. Features added to the extracted data include syntactic and semantic features. Then, we trained and tested different machine learning... 

    A new bigram-PLSA language model for speech recognition

    , Article Eurasip Journal on Advances in Signal Processing ; Volume 2010 , July , 2010 ; 16876172 (ISSN) Bahrani, M ; Sameti, H ; Sharif University of Technology
    2010
    Abstract
    A novel method for combining bigram model and Probabilistic Latent Semantic Analysis (PLSA) is introduced for language modeling. The motivation behind this idea is the relaxation of the bag of words assumption fundamentally present in latent topic models including the PLSA model. An EM-based parameter estimation technique for the proposed model is presented in this paper. Previous attempts to incorporate word order in the PLSA model are surveyed and compared with our new proposed model both in theory and by experimental evaluation. Perplexity measure is employed to compare the effectiveness of recently introduced models with the new proposed model. Furthermore, experiments are designed and... 

    Persian Abstractive Summarization using Graph-based Abstract Meaning Representation

    , M.Sc. Thesis Sharif University of Technology Haddadan, Shohreh (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    This study attempts to introduce a novel approach to abstractive summarization in Persian. According to the methodology the first step is to represent input text sentences into an abstract meaning representation structure. This representation is syntax free thus, it helps the summarization system to represent sentences more semantic based and free of the sentence syntactic structure. In order to select suitable content for the summary output semantic and structural features are extracted from the representation. Data used in this research consists of approximatelty 200 senctences summarized in 30 sentences of a famous story book named: ”The little prince”. An SVM is trained on 80% of... 

    Quantum Key Distribution in Quantum Networks

    , Ph.D. Dissertation Sharif University of Technology Bahrani, Sima (Author) ; Salehi, Jawad (Supervisor)
    Abstract
    With the rapid development of communication technology, the importance of information security is growing daily. Data Encryption, as one of the most important tools of secure communication, has attracted a lot of attention. Most conventional cryptography methods are based on computational complexity assumptions. Hence, they are threatened by computational power enhancements. One of the most important approaches of cryptography that does not have this drawback is quantum cryptography. The unconditional security of this method has been proved by laws of quantum mechanics. One application of quantum cryptography is quantum key distribution that enables secret key exchange between two users. The... 

    Rule-Based Conversion of Colloquial Texts into Official Texts in Persian

    , M.Sc. Thesis Sharif University of Technology Rajabpur, Mohammad (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    In this study, first a set of data was colleted which consisted of colloquial sentences in Persian. Each of these sentences was rendered into standard Persian by native speakers. As a result, a corpus of parallel data including 1698 pairs of sentences was created. Then each colloquial sentence and its formal equivalent were converted into term-frequency vectors and the cosine distance similarity between the two vectors was calculated. Besides the mean and the standard deviation of all cosine distances were obtained. Afterwards the whole set of data was divided into two halves through Stratified randomization so that the two halves resembled each other in terms of cosine distance similarity.... 

    Automatic Author Age Identification Using Social Media Texts

    , M.Sc. Thesis Sharif University of Technology Askari, Maryam (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    The most common form of communication on the internet and social network websites is text messages. normally communication on social media or even on the web is by posting some sort of text. usually, these messages or posts are short and text used in them may not follow any language standards, this makes it very difficult to process them. Different age groups use a certain language differently and this is shown in the way, each of them writes texts. The advancements made in the field of natural language processing and computational linguistics makes it possible to predict, text authors age groups by analyzing the way they write. This study focuses on ways to automatically recognize the age... 

    Statistical distribution of intensity fluctuations for underwater wireless optical channels in the presence of air bubbles

    , Article IWCIT 2016 - Iran Workshop on Communication and Information Theory, 3 May 2016 through 4 May 2016 ; 2016 ; 9781509019229 (ISBN) Jamali, M. V ; Khorramshahi, P ; Tashakori, A ; Chizari, A ; Shahsavari, S ; Abdollahramezani, S ; Fazelian, M ; Bahrani, S ; Salehi, J. A ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2016
    Abstract
    In this paper, we experimentally investigate the statistical distribution of intensity fluctuations for underwater wireless optical channels under different channel conditions, namely fresh and salty underwater channels with and without air bubbles. To do so, we first measure the received optical signal with a large number of samples. Based on the normalized acquired data the channel coherence time and the fluctuations probability density function (PDF) are obtained for different channel scenarios. Our experimental results show that salt attenuates the received signal while air bubbles mainly introduce severe intensity fluctuations. Moreover, we observe that log-normal distribution precisely... 

    Mining DNA sequences based on spatially coded technique using spatial light modulator

    , Article IWCIT 2016 - Iran Workshop on Communication and Information Theory, 3 May 2016 through 4 May 2016 ; 2016 ; 9781509019229 (ISBN) Fazelian, M ; Abdollahramezani, S ; Bahrani, S ; Chizari, A ; Jamali, M. V ; Khorramshahi, P ; Tashakori, A ; Shahsavari, S ; Salehi, J. A ; Sharif University of Technology
    Institute of Electrical and Electronics Engineers Inc  2016
    Abstract
    In this paper, we present an optical computing method for string data alignment applicable to genome information analysis. By applying moire technique to spatial encoding patterns of deoxyribonucleic acid (DNA) sequences, association information of the genome and the expressed phenotypes could more effectively be extracted. Such moire fringes reveal occurrence of matching, deletion and insertion between DNA sequences providing useful visualized information for prediction of gene function and classification of species. Furthermore, by applying a cylindrical lens, a new technique is proposed to map two-dimensional (2D) association information to a one-dimensional (1D) column of pixels, where... 

    Nondetection zone assessment of an active islanding detection method and its experimental evaluation

    , Article IEEE Transactions on Power Delivery ; Volume 26, Issue 2 , 2011 , Pages 517-525 ; 08858977 (ISSN) Bahrani, B ; Karimi, H ; Iravani, R ; Sharif University of Technology
    2011
    Abstract
    This paper analytically determines the nondetection zone (NDZ) of an active islanding detection method, and proposes a solution to obviate the NDZ. The method actively injects a negative-sequence current through the interface voltage-sourced converter (VSC) of a distributed generation (DG) unit, as a disturbance signal for islanding detection. The estimated magnitude of the corresponding negative-sequence voltage at the PCC is used as the islanding detection signal. In this paper, based on a laboratory test system, the performance of the islanding detection method under UL1741 anti-islanding test conditions is evaluated. Then, determining the NDZ of the method and proposing the... 

    Stability analysis and experimental evaluation of a control strategy for islanded operation of distributed generation units

    , Article IEEJ Transactions on Industry Applications ; Volume 131, Issue 8 , 2011 , Pages 1013-1022 ; 09136339 (ISSN) Bahrani, B ; Karimi, H ; Iravani, R ; Sharif University of Technology
    2011
    Abstract
    This paper presents stability analysis of a distributed generation (DG) controller (1), in an islanded mode, based on the Mapping Theorem and the Zero Exclusion Condition, and validates the results based on a laboratory scale experimental setup. The DG unit is interfaced to the host system through a voltage-sourced converter (VSC). The control strategy regulates the load voltage at the desired value in the islanded mode, despite uncertainties in the load parameters. The frequency of the island is controlled in an open loop manner by an internal oscillator. The experimental results show that the controller provides robust voltage control for a wide range of load parameters, and even maintains... 

    Commonsense knowledge extraction for persian language: a combinatory approach

    , Article Iranian Journal of Information Processing Management ; Volume 31, Issue 1 , 2015 , Pages 109-124 ; 22518223 (ISSN) Moradi, M ; Vazirnezhad, B ; Bahrani, M ; Sharif University of Technology
    Iranian Research Institute for Scientific Information and Documentation  2015
    Abstract
    The Putting human commonsense knowledge into computers has always been a long standing dream of artificial intelligence (AI). The cost of several tens of millions of dollars and time have been covered so that the computers could know about "objects falling, not rising.", "running is faster than walking". The large database was built, automated and semi-automated methods were introduced and volunteers' efforts were utilized to achieve this, but an automated, high-throughput and low-noise method for commonsense collection still remains as the holy grail of AI. The aim of this study was to build commonsense knowledge ontology using three approaches namely Hearst method, machine translation and... 

    Decentralized control of parallel connection of two distributed generation units

    , Article 35th Annual Conference of the IEEE Industrial Electronics Society, IECON 2009, Porto, 3 November 2009 through 5 November 2009 ; 2009 , Pages 358-362 Bahrani, B ; Karimi, H ; Iravani, R ; Sharif University of Technology
    2009
    Abstract
    This paper presents a decentralized control strategy for the autonomous (islanded) operation of parallel connection of two distributed generation (DG) units. The DG units are electronically interfaced to the host grid at the same point of common coupling (PCC), where the local load is also supplied. In the grid-connected mode, the voltage-sourced converter (VSC) of each DG unit controls the exchange of real and reactive power components with the host grid, based on the conventional dq-current control strategy. In the islanded mode, one of the DG units provides voltage and frequency control for the island, and the other DG unit continues to operate with the pre-islanding dq-current control... 

    Media Bias Analysis for Persian Text News

    , M.Sc. Thesis Sharif University of Technology Abbaszadeh Hojedki, Mohaddese (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    There are different types of media bias. The aim of this study is to analyze media bias by considering two types of it: selection (or coverage) bias and language bias. Thus we have collected some specific news stories or articles -which contain “Iran” as a keyword- from the websites of four news broadcasters that are Al Arabiya, Deutsche Welle (DW), Radio France Internationale (RFI) and SPUTNIK, to build text datasets. For the purpose of comparing and analyzing media bias, the news had to be gathered during two time frames before and after the day the P5+1, European Union and Iran reached Joint Comprehensive Plan of Action (JCPOA). Finally, the collected corpora have amounted to 784 news... 

    Computational Textual Criticism of Manuscripts' Texts

    , M.Sc. Thesis Sharif University of Technology Ranjbar Chaghakabudi, Vahid (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    In this thesis, I try to use methods and algorithms of computational linguistics and natural language processing for textual criticism of Persian manuscripts' texts and design and develop a software based on that. Suggested method in this thesis is comparing all manuscripts' texts with base manuscript's text by Dynamic Time Warping (DTW) algorithm, after definition of base manuscript. Then the same sentences are extracted in different manuscripts and POS tagged as body of corpus of author's style and learned as language model by Hidden Markov Model (HMM). At next stage, due to the textual criticism's rules and using of author stylistics algorithms choose the case which is most similar to the... 

    Local and global buckling condition of all-steel buckling restrained braces

    , Article Steel and Composite Structures ; Volume 23, Issue 2 , 2017 , Pages 217-228 ; 12299367 (ISSN) Mirtaheri, S. M ; Nazeryan, M ; Bahrani, M. K ; Nooralizadeh, A ; Montazerian, L ; Naserifard, M ; Sharif University of Technology
    Techno Press  2017
    Abstract
    Braces are one of the retrofitting systems of structure under earthquake loading. Buckling restrained braces (BRBs) are one of the very efficient braces for lateral loads. One of the key needs for a desirable and acceptable behavior of buckling-restraining brace members under intensive loading is that it prevents total buckling until the bracing member tolerates enough plastic deformation and ductility. This paper presents the results of a set of analysis by finite element method on buckling restrained braces in which the filler materials within the restraining member have been removed. These braces contain core as the conventional BRBs, but they have a different buckling restrained system.... 

    Modeling Persian Language in the Framework of Complex Networks

    , M.Sc. Thesis Sharif University of Technology Sabooni Aghdam, Amir Mahdi (Author) ; Bahrani, Mohammad (Supervisor)
    Abstract
    The interest in analyzing human language with complex networks is on the rise in recent years and a considerable body of research in this area has already been accumulated.However unfortunately, the use of applications of complex networks in Persian Linguistics research is missing. With the goal of introducing complex networks and their applications in this field, two of these applications have been studied in this research. First, we tried to build an inclusive network model, considering two levels of Syntax and Word Cooccurrence, for the Persian Language and provide Linguistics interpretations for them. In addition, by comparing co-occurrence networks of different languages, garnered from...