schliessen

Filtern

 

Bibliotheken

ProtASR: An Evolutionary Framework for Ancestral Protein Reconstruction with Selection on Folding Stability

The computational reconstruction of ancestral proteins provides information on past biological events and has practical implications for biomedicine and biotechnology. Currently available tools for ancestral sequence reconstruction (ASR) are often based on empirical amino acid substitution models th... Full description

Journal Title: Systematic biology 2017, Vol.66 (6), p.1054-1064
Main Author: Arenas, Miguel
Other Authors: Weber, Claudia C , Liberles, David A , Bastolla, Ugo
Format: Electronic Article Electronic Article
Language: English
Subjects:
Quelle: Alma/SFX Local Collection
Publisher: England: Oxford University Press
ID: ISSN: 1063-5157
Link: https://www.ncbi.nlm.nih.gov/pubmed/28057858
Zum Text:
SendSend as email Add to Book BagAdd to Book Bag
Staff View
recordid: cdi_proquest_miscellaneous_1856590935
title: ProtASR: An Evolutionary Framework for Ancestral Protein Reconstruction with Selection on Folding Stability
format: Article
creator:
  • Arenas, Miguel
  • Weber, Claudia C
  • Liberles, David A
  • Bastolla, Ugo
subjects:
  • Acids
  • Algorithms
  • Amino Acid Substitution
  • Amino acids
  • Bias
  • Biotechnology
  • Classification - methods
  • Computer applications
  • DNA, Ancient - chemistry
  • Evolution
  • Evolution, Molecular
  • Models
  • Models, Biological
  • Phylogeny
  • Protein families
  • Protein Folding
  • Protein Stability
  • Proteins
  • Sequence Analysis, Protein
  • Simulation
  • Software
  • Software for Systematics and Evolution
ispartof: Systematic biology, 2017, Vol.66 (6), p.1054-1064
description: The computational reconstruction of ancestral proteins provides information on past biological events and has practical implications for biomedicine and biotechnology. Currently available tools for ancestral sequence reconstruction (ASR) are often based on empirical amino acid substitution models that assume that all sites evolve at the same rate and under the same process. However, this assumption is frequently violated because protein evolution is highly heterogeneous due to different selective constraints among sites. Here, we present ProtASR, a new evolutionary framework to infer ancestral protein sequences accounting for selection on protein stability. First, ProtASR generates site-specific substitution matrices through the structurally constrained mean-field (MF) substitution model, which considers both unfolding and misfolding stability. We previously showed that MF models outperform empirical amino acid substitution models, as well as other structurally constrained substitution models, both in terms of likelihood and correctly inferring amino acid distributions across sites. In the second step, ProtASR adapts a well-established maximum-likelihood (ML) ASR procedure to infer ancestral proteins under MF models. A known bias of ML ASR methods is that they tend to overestimate the stability of ancestral proteins by underestimating the frequency of deleterious mutations. We compared ProtASR under MF to two empirical substitution models (JTT and CAT), reconstructing the ancestral sequences of simulated proteins. ProtASR yields reconstructed proteins with less biased stabilities, which are significantly closer to those of the simulated proteins. Analysis of extant protein families suggests that folding stability evolves through time across protein families, potentially reflecting neutral fluctuation. Some families exhibit a more constant protein folding stability, while others are more variable. ProtASR is freely available from https://github.com/miguelarenas/protasr and includes detailed documentation and ready-to-use examples. It runs in seconds/minutes depending on protein length and alignment size.
language: eng
source: Alma/SFX Local Collection
identifier: ISSN: 1063-5157
fulltext: fulltext
issn:
  • 1063-5157
  • 1076-836X
url: Link


@attributes
NO1
SEARCH_ENGINEprimo_central_multiple_fe
SEARCH_ENGINE_TYPEPrimo Central Search Engine
RANK2.570833
LOCALfalse
PrimoNMBib
record
control
sourceidjstor_opena
recordidTN_cdi_proquest_miscellaneous_1856590935
sourceformatXML
sourcesystemPC
jstor_id26581831
sourcerecordid26581831
originalsourceidFETCH-LOGICAL-1458t-b751fe054e81e06f395bf68ecc80c8be6c7f52da979aa11833c03c2deff7a32b3
addsrcrecordideNp1kE1LxDAQQIMoq64ePerFi5dqpmmSyVHETwTFD_AW0jSFLrvNmqSI_94s1RUET5PDm0fmEXIA9BSoYmfxM9adz-MDStggO0ClKJCJt83VW7CCA5fbZDfGGaUAgsOEbJdIuUSOO2TyGHw6f37aI1utmUe3_z2n5PXq8uXiprh_uL69OL8voOKYilpyaB3llUNwVLRM8boV6KxFarF2wsqWl41RUhkDgIxZymzZuLaVhpU1m5K70euXrjddcHoZuoUJn9qbTje9S7oxydguOV2iRKiBoqytLKnlNJsFGMGaslIWsuxklC2Dfx9cTHrRRevmc9M7P0QNyAVXuRLP6PEfdOaH0OdTNSiJolIUVaaKkbLBxxhcu_4dUL3Krcfcesyd-aNv61AvXLOmf_pmgP0R5stM6nyfgunm_2oPx61ZTD78WgXHnBTYF59tlZo
sourcetypeOpen Access Repository
isCDItrue
recordtypearticle
pqid1978649089
display
typearticle
titleProtASR: An Evolutionary Framework for Ancestral Protein Reconstruction with Selection on Folding Stability
sourceAlma/SFX Local Collection
creatorArenas, Miguel ; Weber, Claudia C ; Liberles, David A ; Bastolla, Ugo
creatorcontribArenas, Miguel ; Weber, Claudia C ; Liberles, David A ; Bastolla, Ugo
descriptionThe computational reconstruction of ancestral proteins provides information on past biological events and has practical implications for biomedicine and biotechnology. Currently available tools for ancestral sequence reconstruction (ASR) are often based on empirical amino acid substitution models that assume that all sites evolve at the same rate and under the same process. However, this assumption is frequently violated because protein evolution is highly heterogeneous due to different selective constraints among sites. Here, we present ProtASR, a new evolutionary framework to infer ancestral protein sequences accounting for selection on protein stability. First, ProtASR generates site-specific substitution matrices through the structurally constrained mean-field (MF) substitution model, which considers both unfolding and misfolding stability. We previously showed that MF models outperform empirical amino acid substitution models, as well as other structurally constrained substitution models, both in terms of likelihood and correctly inferring amino acid distributions across sites. In the second step, ProtASR adapts a well-established maximum-likelihood (ML) ASR procedure to infer ancestral proteins under MF models. A known bias of ML ASR methods is that they tend to overestimate the stability of ancestral proteins by underestimating the frequency of deleterious mutations. We compared ProtASR under MF to two empirical substitution models (JTT and CAT), reconstructing the ancestral sequences of simulated proteins. ProtASR yields reconstructed proteins with less biased stabilities, which are significantly closer to those of the simulated proteins. Analysis of extant protein families suggests that folding stability evolves through time across protein families, potentially reflecting neutral fluctuation. Some families exhibit a more constant protein folding stability, while others are more variable. ProtASR is freely available from https://github.com/miguelarenas/protasr and includes detailed documentation and ready-to-use examples. It runs in seconds/minutes depending on protein length and alignment size.
identifier
0ISSN: 1063-5157
1EISSN: 1076-836X
2DOI: 10.1093/sysbio/syw121
3PMID: 28057858
languageeng
publisherEngland: Oxford University Press
subjectAcids ; Algorithms ; Amino Acid Substitution ; Amino acids ; Bias ; Biotechnology ; Classification - methods ; Computer applications ; DNA, Ancient - chemistry ; Evolution ; Evolution, Molecular ; Models ; Models, Biological ; Phylogeny ; Protein families ; Protein Folding ; Protein Stability ; Proteins ; Sequence Analysis, Protein ; Simulation ; Software ; Software for Systematics and Evolution
ispartofSystematic biology, 2017, Vol.66 (6), p.1054-1064
rights
0The Author(s) 2017
1The Author(s) 2017. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For Permissions, please email: journals.permissions@oup.com.
lds50peer_reviewed
oafree_for_read
citedbyFETCH-LOGICAL-1458t-b751fe054e81e06f395bf68ecc80c8be6c7f52da979aa11833c03c2deff7a32b3
links
openurl$$Topenurl_article
openurlfulltext$$Topenurlfull_article
thumbnail$$Usyndetics_thumb_exl
backlink$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/28057858$$D View this record in MEDLINE/PubMed
search
creatorcontrib
0Arenas, Miguel
1Weber, Claudia C
2Liberles, David A
3Bastolla, Ugo
title
0ProtASR: An Evolutionary Framework for Ancestral Protein Reconstruction with Selection on Folding Stability
1Systematic biology
addtitleSyst Biol
descriptionThe computational reconstruction of ancestral proteins provides information on past biological events and has practical implications for biomedicine and biotechnology. Currently available tools for ancestral sequence reconstruction (ASR) are often based on empirical amino acid substitution models that assume that all sites evolve at the same rate and under the same process. However, this assumption is frequently violated because protein evolution is highly heterogeneous due to different selective constraints among sites. Here, we present ProtASR, a new evolutionary framework to infer ancestral protein sequences accounting for selection on protein stability. First, ProtASR generates site-specific substitution matrices through the structurally constrained mean-field (MF) substitution model, which considers both unfolding and misfolding stability. We previously showed that MF models outperform empirical amino acid substitution models, as well as other structurally constrained substitution models, both in terms of likelihood and correctly inferring amino acid distributions across sites. In the second step, ProtASR adapts a well-established maximum-likelihood (ML) ASR procedure to infer ancestral proteins under MF models. A known bias of ML ASR methods is that they tend to overestimate the stability of ancestral proteins by underestimating the frequency of deleterious mutations. We compared ProtASR under MF to two empirical substitution models (JTT and CAT), reconstructing the ancestral sequences of simulated proteins. ProtASR yields reconstructed proteins with less biased stabilities, which are significantly closer to those of the simulated proteins. Analysis of extant protein families suggests that folding stability evolves through time across protein families, potentially reflecting neutral fluctuation. Some families exhibit a more constant protein folding stability, while others are more variable. ProtASR is freely available from https://github.com/miguelarenas/protasr and includes detailed documentation and ready-to-use examples. It runs in seconds/minutes depending on protein length and alignment size.
subject
0Acids
1Algorithms
2Amino Acid Substitution
3Amino acids
4Bias
5Biotechnology
6Classification - methods
7Computer applications
8DNA, Ancient - chemistry
9Evolution
10Evolution, Molecular
11Models
12Models, Biological
13Phylogeny
14Protein families
15Protein Folding
16Protein Stability
17Proteins
18Sequence Analysis, Protein
19Simulation
20Software
21Software for Systematics and Evolution
issn
01063-5157
11076-836X
fulltexttrue
rsrctypearticle
creationdate2017
recordtypearticle
recordideNp1kE1LxDAQQIMoq64ePerFi5dqpmmSyVHETwTFD_AW0jSFLrvNmqSI_94s1RUET5PDm0fmEXIA9BSoYmfxM9adz-MDStggO0ClKJCJt83VW7CCA5fbZDfGGaUAgsOEbJdIuUSOO2TyGHw6f37aI1utmUe3_z2n5PXq8uXiprh_uL69OL8voOKYilpyaB3llUNwVLRM8boV6KxFarF2wsqWl41RUhkDgIxZymzZuLaVhpU1m5K70euXrjddcHoZuoUJn9qbTje9S7oxydguOV2iRKiBoqytLKnlNJsFGMGaslIWsuxklC2Dfx9cTHrRRevmc9M7P0QNyAVXuRLP6PEfdOaH0OdTNSiJolIUVaaKkbLBxxhcu_4dUL3Krcfcesyd-aNv61AvXLOmf_pmgP0R5stM6nyfgunm_2oPx61ZTD78WgXHnBTYF59tlZo
startdate20171101
enddate20171101
creator
0Arenas, Miguel
1Weber, Claudia C
2Liberles, David A
3Bastolla, Ugo
general
0Oxford University Press
1Oxford University Press (OUP)
scope
0CGR
1CUY
2CVF
3ECM
4EIF
5NPM
6AAYXX
7CITATION
8K9.
97X8
10CLFQK
sort
creationdate20171101
titleProtASR
authorArenas, Miguel ; Weber, Claudia C ; Liberles, David A ; Bastolla, Ugo
facets
frbrtype5
frbrgroupidcdi_FETCH-LOGICAL-1458t-b751fe054e81e06f395bf68ecc80c8be6c7f52da979aa11833c03c2deff7a32b3
rsrctypearticles
prefilterarticles
languageeng
creationdate2017
topic
0Acids
1Algorithms
2Amino Acid Substitution
3Amino acids
4Bias
5Biotechnology
6Classification - methods
7Computer applications
8DNA, Ancient - chemistry
9Evolution
10Evolution, Molecular
11Models
12Models, Biological
13Phylogeny
14Protein families
15Protein Folding
16Protein Stability
17Proteins
18Sequence Analysis, Protein
19Simulation
20Software
21Software for Systematics and Evolution
toplevel
0peer_reviewed
1online_resources
creatorcontrib
0Arenas, Miguel
1Weber, Claudia C
2Liberles, David A
3Bastolla, Ugo
collection
0Medline
1MEDLINE
2MEDLINE (Ovid)
3MEDLINE
4MEDLINE
5PubMed
6CrossRef
7ProQuest Health & Medical Complete (Alumni)
8MEDLINE - Academic
9OpenAIRE
jtitleSystematic biology
delivery
delcategoryRemote Search Resource
fulltextfulltext
addata
au
0Arenas, Miguel
1Weber, Claudia C
2Liberles, David A
3Bastolla, Ugo
formatjournal
genrearticle
ristypeJOUR
atitleProtASR
jtitleSystematic biology
addtitleSyst Biol
date2017-11-01
risdate2017
volume66
issue6
spage1054
epage1064
pages1054-1064
issn1063-5157
eissn1076-836X
abstractThe computational reconstruction of ancestral proteins provides information on past biological events and has practical implications for biomedicine and biotechnology. Currently available tools for ancestral sequence reconstruction (ASR) are often based on empirical amino acid substitution models that assume that all sites evolve at the same rate and under the same process. However, this assumption is frequently violated because protein evolution is highly heterogeneous due to different selective constraints among sites. Here, we present ProtASR, a new evolutionary framework to infer ancestral protein sequences accounting for selection on protein stability. First, ProtASR generates site-specific substitution matrices through the structurally constrained mean-field (MF) substitution model, which considers both unfolding and misfolding stability. We previously showed that MF models outperform empirical amino acid substitution models, as well as other structurally constrained substitution models, both in terms of likelihood and correctly inferring amino acid distributions across sites. In the second step, ProtASR adapts a well-established maximum-likelihood (ML) ASR procedure to infer ancestral proteins under MF models. A known bias of ML ASR methods is that they tend to overestimate the stability of ancestral proteins by underestimating the frequency of deleterious mutations. We compared ProtASR under MF to two empirical substitution models (JTT and CAT), reconstructing the ancestral sequences of simulated proteins. ProtASR yields reconstructed proteins with less biased stabilities, which are significantly closer to those of the simulated proteins. Analysis of extant protein families suggests that folding stability evolves through time across protein families, potentially reflecting neutral fluctuation. Some families exhibit a more constant protein folding stability, while others are more variable. ProtASR is freely available from https://github.com/miguelarenas/protasr and includes detailed documentation and ready-to-use examples. It runs in seconds/minutes depending on protein length and alignment size.
copEngland
pubOxford University Press
pmid28057858
doi10.1093/sysbio/syw121
oafree_for_read