schliessen

Filtern

 

Bibliotheken

Coestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data

The multispecies network coalescent (MSNC) is a stochastic process that captures how gene trees grow within the branches of a phylogenetic network. Coupling the MSNC with a stochastic mutational process that operates along the branches of the gene trees gives rise to a generative model of how multip... Full description

Journal Title: Systematic biology 2018, Vol.67 (3), p.439-457
Main Author: Wen, Dingqiao
Other Authors: Nakhleh, Luay
Format: Electronic Article Electronic Article
Language: English
Subjects:
Publisher: England: Oxford University Press
ID: ISSN: 1063-5157
Link: https://www.ncbi.nlm.nih.gov/pubmed/29088409
Zum Text:
SendSend as email Add to Book BagAdd to Book Bag
Staff View
recordid: cdi_proquest_miscellaneous_1958545141
title: Coestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data
format: Article
creator:
  • Wen, Dingqiao
  • Nakhleh, Luay
subjects:
  • REGULAR ARTICLES
ispartof: Systematic biology, 2018, Vol.67 (3), p.439-457
description: The multispecies network coalescent (MSNC) is a stochastic process that captures how gene trees grow within the branches of a phylogenetic network. Coupling the MSNC with a stochastic mutational process that operates along the branches of the gene trees gives rise to a generative model of how multiple loci from within and across species evolve in the presence of both incomplete lineage sorting (ILS) and reticulation (e.g., hybridization). We report on a Bayesian method for sampling the parameters of this generative model, including the species phylogeny, gene trees, divergence times, and population sizes, from DNA sequences of multiple independent loci. We demonstrate the utility of our method by analyzing simulated data and reanalyzing an empirical data set. Our results demonstrate the significance of not only coestimating species phylogenies and gene trees, but also accounting for reticulation and ILS simultaneously. In particular, we show that when gene flow occurs, our method accurately estimates the evolutionary histories, coalescence times, and divergence times. Tree inference methods, on the other hand, underestimate divergence times and overestimate coalescence times when the evolutionary history is reticulate. While the MSNC corresponds to an abstract model of “intermixture,” we study the performance of the model and method on simulated data generated under a gene flow model. We show that the method accurately infers the most recent time at which gene flow occurs. Finally, we demonstrate the application of the new method to a 106-locus yeast data set.
language: eng
source:
identifier: ISSN: 1063-5157
fulltext: no_fulltext
issn:
  • 1063-5157
  • 1076-836X
url: Link


@attributes
NO1
SEARCH_ENGINEprimo_central_multiple_fe
SEARCH_ENGINE_TYPEPrimo Central Search Engine
RANK2.7138815
LOCALfalse
PrimoNMBib
record
control
sourceidjstor_proqu
recordidTN_cdi_proquest_miscellaneous_1958545141
sourceformatXML
sourcesystemPC
jstor_id26581969
oup_id10.1093/sysbio/syx085
sourcerecordid26581969
originalsourceidFETCH-LOGICAL-1396t-640a8a517458cd644e48daaa4937ec3804e1bf333c03dfe79da2872c742bc6f00
addsrcrecordideNqFkM9LwzAYhoMobk6PHpUevVST5UeTo0ydguLQCd5Kln6dGV0zkxTcf29Hp4Ignr588PB-bx6Ejgk-J1jRi7AOM-va8YEl30F9gjORSipedzdvQVNOeNZDByEsMCZEcLKPekOFpWRY9dFk5CBEu9TR1vPkCaI1TaUjJJO3deXmUFsIia6LZAw1JFMP7Vp6t0wemiraypkmJM_w3kBtILnSUR-ivVJXAY62c4Bebq6no9v0_nF8N7q8TwlVIqaCYS01Jxnj0hSCMWCy0FozRTMwVGIGZFZSSg2mRQmZKvRQZkOTseHMiBLjATrrclfetedDzJc2GKgqXYNrQk4Ul5xxwkiLph1qvAvBQ5mvfPtjv84JzjcS805i3kls-dNtdDNbQvFNf1lrAfor0NjYGnR19NpWf8ZuG7tm9W-Dkw5dhOj8TwHBJVFC4U_YTJok
sourcetypeAggregation Database
isCDItrue
recordtypearticle
pqid1958545141
display
typearticle
titleCoestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data
creatorWen, Dingqiao ; Nakhleh, Luay
contributorKubatko, Laura
creatorcontribWen, Dingqiao ; Nakhleh, Luay ; Kubatko, Laura
descriptionThe multispecies network coalescent (MSNC) is a stochastic process that captures how gene trees grow within the branches of a phylogenetic network. Coupling the MSNC with a stochastic mutational process that operates along the branches of the gene trees gives rise to a generative model of how multiple loci from within and across species evolve in the presence of both incomplete lineage sorting (ILS) and reticulation (e.g., hybridization). We report on a Bayesian method for sampling the parameters of this generative model, including the species phylogeny, gene trees, divergence times, and population sizes, from DNA sequences of multiple independent loci. We demonstrate the utility of our method by analyzing simulated data and reanalyzing an empirical data set. Our results demonstrate the significance of not only coestimating species phylogenies and gene trees, but also accounting for reticulation and ILS simultaneously. In particular, we show that when gene flow occurs, our method accurately estimates the evolutionary histories, coalescence times, and divergence times. Tree inference methods, on the other hand, underestimate divergence times and overestimate coalescence times when the evolutionary history is reticulate. While the MSNC corresponds to an abstract model of “intermixture,” we study the performance of the model and method on simulated data generated under a gene flow model. We show that the method accurately infers the most recent time at which gene flow occurs. Finally, we demonstrate the application of the new method to a 106-locus yeast data set.
identifier
0ISSN: 1063-5157
1EISSN: 1076-836X
2DOI: 10.1093/sysbio/syx085
3PMID: 29088409
languageeng
publisherEngland: Oxford University Press
subjectREGULAR ARTICLES
ispartofSystematic biology, 2018, Vol.67 (3), p.439-457
rights
0The Author(s) 2017
1The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com 2017
lds50peer_reviewed
oafree_for_read
citedbyFETCH-LOGICAL-1396t-640a8a517458cd644e48daaa4937ec3804e1bf333c03dfe79da2872c742bc6f00
citesFETCH-LOGICAL-1396t-640a8a517458cd644e48daaa4937ec3804e1bf333c03dfe79da2872c742bc6f00
links
openurl$$Topenurl_article
thumbnail$$Usyndetics_thumb_exl
backlink$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/29088409$$D View this record in MEDLINE/PubMed
search
contributorKubatko, Laura
creatorcontrib
0Wen, Dingqiao
1Nakhleh, Luay
title
0Coestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data
1Systematic biology
addtitleSyst Biol
descriptionThe multispecies network coalescent (MSNC) is a stochastic process that captures how gene trees grow within the branches of a phylogenetic network. Coupling the MSNC with a stochastic mutational process that operates along the branches of the gene trees gives rise to a generative model of how multiple loci from within and across species evolve in the presence of both incomplete lineage sorting (ILS) and reticulation (e.g., hybridization). We report on a Bayesian method for sampling the parameters of this generative model, including the species phylogeny, gene trees, divergence times, and population sizes, from DNA sequences of multiple independent loci. We demonstrate the utility of our method by analyzing simulated data and reanalyzing an empirical data set. Our results demonstrate the significance of not only coestimating species phylogenies and gene trees, but also accounting for reticulation and ILS simultaneously. In particular, we show that when gene flow occurs, our method accurately estimates the evolutionary histories, coalescence times, and divergence times. Tree inference methods, on the other hand, underestimate divergence times and overestimate coalescence times when the evolutionary history is reticulate. While the MSNC corresponds to an abstract model of “intermixture,” we study the performance of the model and method on simulated data generated under a gene flow model. We show that the method accurately infers the most recent time at which gene flow occurs. Finally, we demonstrate the application of the new method to a 106-locus yeast data set.
subjectREGULAR ARTICLES
issn
01063-5157
11076-836X
fulltextfalse
rsrctypearticle
creationdate2018
recordtypearticle
recordideNqFkM9LwzAYhoMobk6PHpUevVST5UeTo0ydguLQCd5Kln6dGV0zkxTcf29Hp4Ignr588PB-bx6Ejgk-J1jRi7AOM-va8YEl30F9gjORSipedzdvQVNOeNZDByEsMCZEcLKPekOFpWRY9dFk5CBEu9TR1vPkCaI1TaUjJJO3deXmUFsIia6LZAw1JFMP7Vp6t0wemiraypkmJM_w3kBtILnSUR-ivVJXAY62c4Bebq6no9v0_nF8N7q8TwlVIqaCYS01Jxnj0hSCMWCy0FozRTMwVGIGZFZSSg2mRQmZKvRQZkOTseHMiBLjATrrclfetedDzJc2GKgqXYNrQk4Ul5xxwkiLph1qvAvBQ5mvfPtjv84JzjcS805i3kls-dNtdDNbQvFNf1lrAfor0NjYGnR19NpWf8ZuG7tm9W-Dkw5dhOj8TwHBJVFC4U_YTJok
startdate20180501
enddate20180501
creator
0Wen, Dingqiao
1Nakhleh, Luay
generalOxford University Press
scope
0NPM
1AAYXX
2CITATION
37X8
sort
creationdate20180501
titleCoestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data
authorWen, Dingqiao ; Nakhleh, Luay
facets
frbrtype5
frbrgroupidcdi_FETCH-LOGICAL-1396t-640a8a517458cd644e48daaa4937ec3804e1bf333c03dfe79da2872c742bc6f00
rsrctypearticles
prefilterarticles
languageeng
creationdate2018
topicREGULAR ARTICLES
toplevelpeer_reviewed
creatorcontrib
0Wen, Dingqiao
1Nakhleh, Luay
collection
0PubMed
1CrossRef
2MEDLINE - Academic
jtitleSystematic biology
delivery
delcategoryRemote Search Resource
fulltextno_fulltext
addata
au
0Wen, Dingqiao
1Nakhleh, Luay
formatjournal
genrearticle
ristypeJOUR
atitleCoestimating Reticulate Phylogenies and Gene Trees from Multilocus Sequence Data
jtitleSystematic biology
addtitleSyst Biol
date2018-05-01
risdate2018
volume67
issue3
spage439
epage457
pages439-457
issn1063-5157
eissn1076-836X
abstractThe multispecies network coalescent (MSNC) is a stochastic process that captures how gene trees grow within the branches of a phylogenetic network. Coupling the MSNC with a stochastic mutational process that operates along the branches of the gene trees gives rise to a generative model of how multiple loci from within and across species evolve in the presence of both incomplete lineage sorting (ILS) and reticulation (e.g., hybridization). We report on a Bayesian method for sampling the parameters of this generative model, including the species phylogeny, gene trees, divergence times, and population sizes, from DNA sequences of multiple independent loci. We demonstrate the utility of our method by analyzing simulated data and reanalyzing an empirical data set. Our results demonstrate the significance of not only coestimating species phylogenies and gene trees, but also accounting for reticulation and ILS simultaneously. In particular, we show that when gene flow occurs, our method accurately estimates the evolutionary histories, coalescence times, and divergence times. Tree inference methods, on the other hand, underestimate divergence times and overestimate coalescence times when the evolutionary history is reticulate. While the MSNC corresponds to an abstract model of “intermixture,” we study the performance of the model and method on simulated data generated under a gene flow model. We show that the method accurately infers the most recent time at which gene flow occurs. Finally, we demonstrate the application of the new method to a 106-locus yeast data set.
copEngland
pubOxford University Press
pmid29088409
doi10.1093/sysbio/syx085
oafree_for_read