Loading...

History based unsupervised data oriented parsing

Mesgar, M ; Sharif University of Technology | 2013

816 Viewed
  1. Type of Document: Article
  2. Publisher: 2013
  3. Abstract:
  4. Grammar induction is a basic step in natural language processing. Based on the volume of information that is used by different methods, we can distinguish three types of grammar induction method: supervised, unsupervised, and semi-supervised. Supervised and semisupervised methods require large tree banks, which may not currently exist for many languages. Accordingly, many researchers have focused on unsupervised methods. Unsupervised Data Oriented Parsing (UDOP) is currently the state of the art in unsupervised grammar induction. In this paper, we show that the performance of UDOP in free word order languages such as Persian is inferior to that of fixed order languages such as English. We also introduce a novel approach called History-based unsupervised data oriented Parsing, and show that the performance of UDOP can be significantly improved by using some history information, especially in dealing with free word order languages
  5. Keywords:
  6. Free word order languages ; Grammar induction ; History informations ; NAtural language processing ; Semi-supervised method ; State of the art ; Unsupervised data ; Unsupervised method ; Computational grammars ; Natural language processing systems
  7. Source: International Conference Recent Advances in Natural Language Processing, RANLP ; September , 2013 , Pages 453-459 ; 13138502 (ISSN)