Generalized Bootstrap Supports for Phylogenetic Analyses of Protein Sequences Incorporating Alignment Uncertainty
Journal Title: | Systematic biology 2018, Vol.67 (6), p.997-1009 |
Main Author: | Chatzou, Maria |
Other Authors: | Floden, Evan W , Di Tommaso, Paolo , Gascuel, Olivier , Notredame, Cedric |
Format: |
![]() |
Language: |
English |
Subjects: | |
Publisher: | England: Oxford University Press |
ID: | ISSN: 1063-5157 |
Zum Text: |
SendSend as email
Add to Book BagAdd to Book Bag
Staff View

recordid: | cdi_hal_primary_oai_HAL_lirmm_02078444v1 |
title: | Generalized Bootstrap Supports for Phylogenetic Analyses of Protein Sequences Incorporating Alignment Uncertainty |
format: | Article |
creator: |
|
subjects: |
|
ispartof: | Systematic biology, 2018, Vol.67 (6), p.997-1009 |
description: | Phylogenetic reconstructions are essential in genomics data analyses and depend on accurate multiple sequence alignment (MSA) models. We show that all currently available large-scale progressive multiple alignment methods are numerically unstable when dealing with amino-acid sequences. They produce significantly different output when changing sequence input order. We used the HOMFAM protein sequences dataset to show that on datasets larger than 100 sequences, this instability affects on average 21.5% of the aligned residues. The resulting Maximum Likelihood (ML) trees estimated from these MSAs are equally unstable with over 38% of the branches being sensitive to the sequence input order. We established that about two-thirds of this uncertainty stems from the unordered nature of children nodes within the guide trees used to estimate MSAs. To quantify this uncertainty we developed unistrap, a novel approach that estimates the combined effect of alignment uncertainty and site sampling on phylogenetic tree branch supports. Compared with the regular bootstrap procedure, unistrap provides branch support estimates that take into account a larger fraction of the parameters impacting tree instability when processing datasets containing a large number of sequences. |
language: | eng |
source: | |
identifier: | ISSN: 1063-5157 |
fulltext: | no_fulltext |
issn: |
|
url: | Link |
@attributes |
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
PrimoNMBib |
|