Loading...

PyGTED: Python application for computing graph traversal edit distance

Ebrahimpour Boroojeny, A ; Sharif University of Technology | 2020

288 Viewed
  1. Type of Document: Article
  2. DOI: 10.1089/cmb.2019.0510
  3. Publisher: Mary Ann Liebert Inc , 2020
  4. Abstract:
  5. Graph Traversal Edit Distance (GTED) is a measure of distance (or dissimilarity) between two graphs introduced. This measure is based on the minimum edit distance between two strings formed by the edge labels of respective Eulerian traversals of the two graphs. GTED was motivated by and provides the first mathematical formalism for sequence coassembly and de novo variation detection in bioinformatics. Many problems in applied machine learning deal with graphs (also called networks), including social networks, security, web data mining, protein function prediction, and genome informatics. The kernel paradigm beautifully decouples the learning algorithm from the underlying geometric space, which renders graph kernels important for the aforementioned applications. In this article, we introduce a tool, PyGTED to compute GTED. It implements the algorithm based on the polynomial time algorithm devised for it by the authors. Informally, the GTED is the minimum edit distance between two strings formed by the edge labels of respective Eulerian traversals of the two graphs. © Copyright 2020, Mary Ann Liebert, Inc., publishers 2020
  6. Keywords:
  7. Clustering genera ; Coassembly ; De novo variation detaction ; Graph comparison ; Graph kernel ; Linear programming ; Article ; Bioinformatics ; Data mining ; Human ; Learning algorithm ; Prediction ; Protein function ; Social network ; System analysis
  8. Source: Journal of Computational Biology ; Volume 27, Issue 3 , 2020 , Pages 436-439
  9. URL: https://www.liebertpub.com/doi/10.1089/cmb.2019.0510