Persian language understanding using a two-step extended hidden vector state parser

, Article IEEE International Workshop on Machine Learning for Signal Processing, 18 September 2011 through 21 September 2011 ; September , 2011 , Page(s): 1 - 6 ; 9781457716232 (ISBN) Jabbari, F ; Sameti, H ; Hadi Bokaei, M ; Sharif University of Technology

2011

Abstract

The key element of a spoken dialogue system is a spoken language understanding (SLU) unit. Hidden Vector State (HVS) is one of the most popular statistical approaches employed to implement the SLU unit. This paper presents a two-step approach for Persian language understanding. First, a goal detector is used to identify the main goal of the input utterance. Second, after restricting the search space for semantic tagging, an extended hidden vector state (EHVS) parser is used to extract the remaining semantics in each subspace. This will mainly improve the performance of semantic tagger, while reducing the model complexity and training time. Moreover, the need for large amount of data will be...

Summarizing meeting transcripts based on functional segmentation

, Article IEEE/ACM Transactions on Audio Speech and Language Processing ; Volume 24, Issue 10 , 2016 , Pages 1831-1841 ; 23299290 (ISSN) Bokaei, M. H ; Sameti, H ; Liu, Y ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2016

Abstract

In this paper, we aim to improve meeting summarization performance using discourse specific information. Since there are intrinsically different characteristics in utterances in different types of function segments, e.g., Monologue segments versus Discussion ones, we propose a new summarization framework where different summarizers are used for different segment types. For monologue segments, we adopt the integer linear programming-based summarization method; whereas for discussion segments, we use a graph-based method to incorporate speaker information. Performance of our proposed method is evaluated using the standard AMI meeting corpus. Results show a good improvement over previous...

Linear discourse segmentation of multi-party meetings based on local and global information

, Article IEEE/ACM Transactions on Speech and Language Processing ; Volume 23, Issue 11 , July , 2015 , Pages 1879-1891 ; 23299290 (ISSN) Bokaei, M. H ; Sameti, H ; Liu, Y ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2015

Abstract

Linear segmentation of a meeting conversation is beneficial as a stand-alone system (to organize a meeting and make it easier to access) or as a preprocessing step for many other meeting related tasks. Such segmentation can be done according to two different criteria: topic in which a meeting is segmented according to the different items in its agenda, and function in which the segmentation is done according to the meeting's different events (like discussion, monologue). In this article we concentrate on the function segmentation task and propose new unsupervised methods to segment a meeting into functionally coherent parts. The first proposed method assigns a score to each possible boundary...

Extractive meeting summarization through speaker zone detection

, Article 16th Annual Conference of the International Speech Communication Association, INTERSPEECH 2015, 6 September 2015 through 10 September 2015 ; Volume 2015-January , January , 2015 , Pages 2724-2728 ; 2308457X (ISSN) Bokaei, M. H ; Sameti, H ; Liu, Y ; Sharif University of Technology

International Speech and Communication Association 2015

Abstract

In this paper we investigate the role of discourse analysis in extractive meeting summarization task. Specifically our proposed method comprises of two distinct steps. First we use a meeting segmentation algorithm in order to detect various functional parts of the input meeting. Afterwards, a two level scoring mechanism in a graph-based framework is used to score each dialogue act in order to extract the most valuable ones and include them in the extracted summary. We evaluate our proposed method on AMI and ICSI corpora and compare it with other state-of-the-art graph based algorithms according to various evaluation metrics. The experimental results show that our algorithm outperforms the...

Turbine blade cooling passages optimization using reduced conjugate heat transfer methodology

, Article Applied Thermal Engineering ; Volume 103 , 2016 , Pages 1228-1236 ; 13594311 (ISSN) Mazaheri, K ; Zeinalpour, M ; Bokaei, H. R ; Sharif University of Technology

Elsevier Ltd 2016

Abstract

Here we have optimized shape and location of cooling passages of a C3X turbine blade using a multi-objective strategy. The objective functions is selected to be the maximum temperature gradient and the maximum temperature through the three dimensional blade. Shape of cooling channels is modeled using a new method based on the Bezier curves and using forty design variables. The optimized channel shapes are found to be smooth and without corners. To reduce the computational time, parallel processing and the reduced conjugate heat transfer methodology RCHT is used. Using RCHT, the heat transfer between channels and blade are coupled, while the experimental data is used for heat transfer...

Extractive summarization of multi-party meetings through discourse segmentation

, Article Natural Language Engineering ; Volume 22, Issue 1 , 2016 , Pages 41-72 ; 13513249 (ISSN) Bokaei, M. H ; Sameti, H ; Liu, Y ; Sharif University of Technology

Cambridge University Press 2016

Abstract

In this article we tackle the problem of multi-party conversation summarization. We investigate the role of discourse segmentation of a conversation on meeting summarization. First, an unsupervised function segmentation algorithm is proposed to segment the transcript into functionally coherent parts, such as Monologuei (which indicates a segment where speaker i is the dominant speaker, e.g., lecturing all the other participants) or Discussionx1x2,...,xn (which indicates a segment where speakers x 1 to xn involve in a discussion). Then the salience score for a sentence is computed by leveraging the score of the segment containing the sentence. Performance of our proposed segmentation and...

Segmental HMM-based part-of-speech tagger

, Article 2010 International Conference on Audio, Language and Image Processing, ICALIP 2010, Shanghai, 23 November 2010 through 25 November 2010 ; 2010 , Pages 52-56 ; 9781424458653 (ISBN) Bokaei, M. H ; Sameti, H ; Bahrani, M ; Babaali, B ; Sharif University of Technology

2010

Abstract

This paper presents a solution in order to solve the problem of using HMM-based POS tagger in some languages where a word can be comprised of several tokens. Viterbi algorithm is modified in order to support segment of words within a model state. In the other word, the proposed system has a built-in tokenizer where indicates words boundaries as well as its corresponding tag sequence

Discriminative spoken language understanding using statistical machine translation alignment models

, Article Communications in Computer and Information Science ; Vol. 427, issue , Sep , 2014 , pp. 194-202 ; ISSN: 18650929 ; ISBN: 9783319108490 Aliannejadi, M ; Khadivi, S ; Ghidary, S. S ; Bokaei, M. H ; Sharif University of Technology

2014

Abstract

In this paper, we study the discriminative modeling of Spoken Language Understanding (SLU) using Conditional Random Fields (CRF) and Statistical Machine Translation (SMT) alignment models. Previous discriminative approaches to SLU have been dependent on n-gram features. Other previous works have used SMT alignment models to predict the output labels. We have used SMT alignment models to align the abstract labels and trained CRF to predict the labels. We show that the state transition features improve the performance. Furthermore, we have compared the proposed method with two baseline approaches; Hidden Vector States (HVS) and baseline-CRF. The results show that for the F-measure the proposed...

Niusha, the first persian speech-enabled IVR platform

, Article 2010 5th International Symposium on Telecommunications, IST 2010, 4 December 2010 through 6 December 2010, Tehran ; 2010 , Pages 591-595 ; 9781424481835 (ISBN) Bokaei, M. H ; Sameti, H ; Eghbal-Zadeh, H ; BabaAli, B ; Hosseinzadeh, K. H ; Bahrani, M ; Veisi, H ; Sanian, A ; Sharif University of Technology

2010

Abstract

This paper introduces Niusha, the first Persian speech-enabled IVR platform. This platform uses Persian recognizer and Persian text-to-speech synthesizer engines in order to interact with users. The platform is designed in a way that it can simply be customized in various domains and its components are adjustable with new words

A Four-Stage algorithm for community detection based on label propagation and game theory in social networks

, Article AI (Switzerland) ; Volume 4, Issue 1 , 2023 , Pages 255-269 ; 26732688 (ISSN) Torkaman, A ; Badie, K ; Salajegheh, A ; Bokaei, M. H ; Ardestani, S. F. F ; Sharif University of Technology

Multidisciplinary Digital Publishing Institute (MDPI) 2023

Abstract

Over the years, detecting stable communities in a complex network has been a major challenge in network science. The global and local structures help to detect communities from different perspectives. However, previous methods based on them suffer from high complexity and fall into local optimum, respectively. The Four-Stage Algorithm (FSA) is proposed to reduce these issues and to allocate nodes to stable communities. Balancing global and local information, as well as accuracy and time complexity, while ensuring the allocation of nodes to stable communities, are the fundamental goals of this research. The Four-Stage Algorithm (FSA) is described and demonstrated using four real-world data...

Unilateral semi-supervised learning of extended hidden vector state for Persian language understanding

, Article NLP-KE 2011 - Proceedings of the 7th International Conference on Natural Language Processing and Knowledge Engineering, 27 November 2011 through 29 November 2011, Tokushima ; 2011 , Pages 165-168 ; 9781612847283 (ISBN) Jabbari, F ; Sameti, H ; Bokaei, M. H ; Chinese Association for Artificial Intelligence; IEEE Signal Processing Society ; Sharif University of Technology

2011

Abstract

The key element of a spoken dialogue system is Spoken Language Understanding (SLU) part. HVS and EHVS are two most popular statistical methods employed to implement the SLU part which need lightly annotated data. Since annotation is a time consuming, we present a novel semi-supervised learning for EHVS to reduce the human labeling effort using two different statistical classifiers, SVM and KNN. Experiments are done on a Persian corpus, the University Information Kiosk corpus. The experimental results show improvements in performance of semi-supervised EHVS, trained by both labeled and unlabeled data, compared to EHVS trained by just initially labeled data. The performance of EHVS improves...

Spoken Language Understanding in Dialogue System

, M.Sc. Thesis Sharif University of Technology Bokaei, Mohammad Hadi (Author) ; Sameti, Hossein (Supervisor)

Abstract

In contrast to automatic speech recognition (ASR), which converts a speaker’s spoken utterance into a text string, spoken language understanding (SLU) is aimed at interpreting user’s intentions from their speech utterances. Traditionally, this has been accomplished by writing context-free grammars (CFGs) or unification grammars (UGs) manually. The manual grammar authoring process is laborious and expensive, requiring much expertise. In addition, robustness is a vital requirement of these modules, because the input of these modules comes from a speech recognition unit and always contains errors. In recent years, many data-driven models have been proposed for spoken language understanding, but...

Multi-objective Optimization of Internal Cooling Passages for a Turbine Blade

, M.Sc. Thesis Sharif University of Technology Bokaei, Hamid Reza (Author) ; Mazaheri, Karim (Supervisor)

Abstract

In the present work, the shape and position of internal cooling passages within an axial turbine blade have been optimized to achieve a uniform temperature distribution with the minimum cooling air flow while the maximum temperature is below the allowable value. Four cooling passages are made within the blade. The cross section shape of each passage is parameterized using a new method based on an 8-order Bezier curve. This curve which is represented in terms of Bezier control points has much flexibility and can produce a large variety of shapes. The shape of the blade surface profile remains unchanged during the optimization process. The numerical simulation has been carried out using...

Extractive Meeting Summarization through Discourse Analysis

, Ph.D. Dissertation Sharif University of Technology Bokaei, Mohammad Hadi (Author) ; Sameti, Hossein (Supervisor)

Abstract

Improvement of automatic speech recognition systems and the growth of audio data (such as broadcast news, voice mail, telephony conversations and meetings) have attracted plenty of research interest in the field of speech summarization. The goal of this dissertation is to improve the performance of the speech summarization in the domain of multi-party conversations, specifically meetings. Most of the previous work in this field are inheritted from the text summarization counterpart, whithout paying much attention to the discourse specific information of the multi-party conversations. The main idea of this work is to use discourse information to improve the accuracy of extracted summaries in...

TWO-Snapshot Doa estimation Via hankel-structured matrix completion

, Article 47th IEEE International Conference on Acoustics, Speech, and Signal Processing, ICASSP 2022, 23 May 2022 through 27 May 2022 ; Volume 2022-May , 2022 , Pages 5018-5022 ; 15206149 (ISSN); 9781665405409 (ISBN) Bokaei, M ; Razavikia, S ; Amini, A ; Rini, S ; Sharif University of Technology

Institute of Electrical and Electronics Engineers Inc 2022

Abstract

In this paper, we study the problem of estimating the direction of arrival (DOA) using a sparsely sampled uniform linear array (ULA). Based on an initial incomplete ULA measurements, our strategy is to choose a sparse subset of array elements for measuring the next snapshot. Then, we use a Hankel-structured matrix completion to interpolate for the missing ULA measurements. Finally, the source DOAs are estimated using a subspace method such as Prony on the fully recovered ULA. We theoretically provide a sufficient bound for the number of required samples (array elements) for perfect recovery. The numerical comparisons of the proposed method with existing techniques such as atomic-norm...

Improve the Quality of DOA Estimation by Hankel Matrix Completing

, M.Sc. Thesis Sharif University of Technology Bokaei, Moahammad (Author) ; Behrouzi, Hamid (Supervisor) ; Amini, Arash (Supervisor)

Abstract

In this thesis, we deal with the problem of direction of arrival (DOA) using single-snapshot samples taken from multiple source signals by an array of sensors. The array we are looking for in this work is a uniform linear array. There are many ways to solve this problem, including MUSIC and Prony. But one of the few methods that are able to estimate correctly with single-snapshot samples with incomplete uniform linear arrays in compressed sensing base algorithms. In the first step, we can estimate the data of a incomplete uniform linear array by arranging them in the form of low-rank Hankel matrices and using leverage scores and minimizing the weight nuclear norm. If the weight matrices are...

Multidocument Keyphrase Extraction Using Recurrent Neural Networks

, M.Sc. Thesis Sharif University of Technology Doostmohammadi, Ehsan (Author) ; Sameti, Hossein (Supervisor) ; Bokaei, Mohammad Hadi (Supervisor)

Abstract

Keyphrase extraction, as an important open problem of Natural Language Processing (NLP), is useful as a stand-alone task in the field of Information Extraction and as an upstream task for Information Retrieval, text summarization and classification,etc. In this study, regarding the needs in Persian NLP, artificial neural networks are adopted to extract keyphrases from single documents and a graph-based re-scoring method is proposed for multidocument keyphrase extraction. The proposed method for extracting keyphrases from multiple documents consists of two steps: (1) extracting keyphrases of each document in a cluster using a sequence to sequence model with attention, and (2) re-scoring the...

Automatic Difficulty Estimation of Thematic Similarity MultipleChoice Questions

, M.Sc. Thesis Sharif University of Technology Akef, Soroosh (Author) ; Sameti, Hossein (Supervisor) ; Bokaei, Mohammad Hadi (Supervisor)

Abstract

This project has been conducted in two related phases: In the first phase, we have attempted to write a program capable of answering thematic similarity multiple-choice questions without utilizing any training data. The best performance in this phase was attained by the 25-topic LDA model using the Hellinger distance between the probability distributions of the poetic verses. This model managed to attain an accuracy of 42%, which is very close to the average human performance of 43%. In the second phase, two tasks of seven-class classification and binary classification were defined based on the p-value of the questions. To this end, the questions were initially ranked according to the...

Improving Persian Word Embeddings Using Neural Networks

, M.Sc. Thesis Sharif University of Technology Aliramezani, Mohammad (Author) ; Sameti, Hossein (Supervisor) ; Bokaei, Mohammad Hadi (Co-Supervisor)

Abstract

In recent years, word embeddings as the word representation have captured the attention of natural language processing (NLP) researches. One of the great advantages of word embeddings is their capability in representing the relationships of the words. Therefore, using word embeddings in NLP applications results in better performance.Despite widespread attention towards word embedding in late years, Persian word embeddings have not achieved sensible progress. One of the Persian word embeddings difficulties is related to that, Persian is a low-resource language in comparison with worldwide languages. Therefore, Persian word embedding quality is lower than English. Consequently, the accuracy of...

Semantic Analysis and Event Detection Using Deep Learning for Stock Prediction

, M.Sc. Thesis Sharif University of Technology Basirian Jahromi, Ali (Author) ; Sameti, Hossein (Supervisor) ; Bokaei, Mohammad Hadi (Supervisor)

Abstract

News plays a very important role in stock market trading. Nowadays news from a different part of the world and about different fields can be accessed easily, and for a successful trade, it is necessary to analyze accurately and use this big data and information as soon as possible. For this reason, this thesis tries to present and study models based on Deep Learning networks and Natural Language Processing for financial news analysis and predicting stock indices movement. This research takes advantage of a language model for learning and representing news text, and beside this language model it uses deep learning networks at multiple levels to extract proper features from each news in a day...