Artykuł w czasopiśmie
Brak miniatury
Licencja

ClosedAccessDostęp zamknięty

Efficient algorithms for genomic duplication models

Autor
Paszek, Jarosław
Górecki, Paweł
Data publikacji
2018
Abstrakt (EN)

An important issue in evolutionary molecular biology is to discover genomic duplication episodes and their correspondence to the species tree. Existing approaches vary in the two fundamental aspects: the choice of evolutionary scenarios that model allowed locations of duplications in the species tree, and the rules of clustering gene duplications from gene trees into a single multiple duplication event. Here we study the method of clustering called minimum episodes for several models of allowed evolutionary scenarios with a focus on interval models in which every gene duplication has an interval consisting of allowed locations in the species tree. We present mathematical foundations for general genomic duplication problems. Next, we propose the first linear time and space algorithm for minimum episodes clustering jointly for any interval model and the algorithm for the most general model in which every evolutionary scenario is allowed. We also present a comparative study of different models of genomic duplication based on simulated and empirical datasets. We provided algorithms and tools that could be applied to solve efficiently minimum episodes clustering problems. Our comparative study helps to identify which model is the most reasonable choice in inferring genomic duplication events.

Słowa kluczowe EN
Genomic duplication duplication episode minimum episodes problem reconciliation gene tree species tree
Dyscyplina PBN
informatyka
Czasopismo
IEEE/ACM Transactions on Computational Biology and Bioinformatics
Tom
15
Zeszyt
5
Strony od-do
1515-1524
ISSN
1545-5963
Licencja otwartego dostępu
Dostęp zamknięty