Disentangling sources of gene tree discordance in phylogenomic datasets: testing ancient hybridizations in Amaranthaceae s.l

Main Authors: Morales-Briones, Diego F., Kadereit, Gudrun, Tefarikis, Delphine, Moore, Michael, Smith, Stephen, Brockington, Samuel, Timoneda, Alfonso, Yim, Won, Cushman, John, Yang, Ya
Format: info dataset eJournal
Terbitan: , 2020
Subjects:
Online Access: https://zenodo.org/record/3987463
ctrlnum 3987463
fullrecord <?xml version="1.0"?> <dc schemaLocation="http://www.openarchives.org/OAI/2.0/oai_dc/ http://www.openarchives.org/OAI/2.0/oai_dc.xsd"><creator>Morales-Briones, Diego F.</creator><creator>Kadereit, Gudrun</creator><creator>Tefarikis, Delphine</creator><creator>Moore, Michael</creator><creator>Smith, Stephen</creator><creator>Brockington, Samuel</creator><creator>Timoneda, Alfonso</creator><creator>Yim, Won</creator><creator>Cushman, John</creator><creator>Yang, Ya</creator><date>2020-08-28</date><description>Gene tree discordance in large genomic datasets can be caused by evolutionary processes such as incomplete lineage sorting and hybridization, as well as model violation, and errors in data processing, orthology inference, and gene tree estimation. Species tree methods that identify and accommodate all sources of conflict are not available, but a combination of multiple approaches can help tease apart alternative sources of conflict. Here, using a phylotranscriptomic analysis in combination with reference genomes, we test a hypothesis of ancient hybridization events within the plant family Amaranthaceae s.l. that was previously supported by morphological, ecological, and Sanger-based molecular data. The dataset included seven genomes and 88 transcriptomes, 17 generated for this study. We examined gene-tree discordance using coalescent-based species trees and network inference, gene tree discordance analyses, site pattern tests of introgression, topology tests, synteny analyses, and simulations. We found that a combination of processes might have generated the high levels of gene tree discordance in the backbone of Amaranthaceae s.l. Furthermore, we found evidence that three consecutive short internal branches produce anomalous trees contributing to the discordance. Overall, our results suggest that Amaranthaceae s.l. might be a product of an ancient and rapid lineage diversification, and remains, and probably will remain, unresolved. This work highlights the potential problems of identifiability associated with the sources of gene tree discordance including, in particular, phylogenetic network methods. Our results also demonstrate the importance of thoroughly testing for multiple sources of conflict in phylogenomic analyses, especially in the context of ancient, rapid radiations. We provide several recommendations for exploring conflicting signals in such situations.</description><description>- The file 'Supplementa_Methods_and_Materials.tar.gz' contains the supplemental methods, figures and tables referenced in the main text - The file 'Homologs.tar.gz' contains the 14584 homolog trees: raw_homologs.tar.gz - trees without any filtering or pruning final_homologs.tar.gz - trees after, monophyletic and paraphyletic grades of the same species masked, deep paralogs prunned, and spurious tips removed. - The file 'Analyses_data.tar.gz' contains the data (alignments and individual gene trees) used for each of the dataset: filtered_transcriptomes.tar.gz - 88 filtered transcriptomes all_13025_orthologs_cln_aln.tar.gz - all the 13025 'monophyletic outgroup' orthologs 105-taxon.tar.gz - 936 alignments and trees of the full 105-taxon analyses 41-taxon.tar.gz - 1242 alignments and trees of the 41-taxon cloudogram 11-taxon-net.tar.gz - 4138 alignments and trees of the 11-taxon(net) used for network analyses 4-taxon.tar.gz - alignments and trees (between 7,756 and 8,793) for each of the 10 4-taxon quartets 11-taxon-tree.tar.gz - 5936 alignments and trees of the 11-taxon(tree) analyses chloroplast.tar.gz - 11-taxon alignment and tree and 76 individual CDS alignment and trees of the plastid analyses Funding provided by: University of MinnesotaCrossref Funder Registry ID: http://dx.doi.org/10.13039/100007249Funding provided by: University of MichiganCrossref Funder Registry ID: http://dx.doi.org/10.13039/100007270Funding provided by: US National Science FoundationCrossref Funder Registry ID: http://dx.doi.org/10.13039/100000001Award Number: DEB 1354048Funding provided by: Department of Energy, Office of Science, Genomic Science ProgramCrossref Funder Registry ID: Award Number: DE-SC0008834</description><identifier>https://zenodo.org/record/3987463</identifier><identifier>10.5061/dryad.ns1rn8pq4</identifier><identifier>oai:zenodo.org:3987463</identifier><relation>doi:10.1093/sysbio/syaa066</relation><relation>url:https://zenodo.org/communities/dryad</relation><rights>info:eu-repo/semantics/openAccess</rights><rights>https://creativecommons.org/publicdomain/zero/1.0/legalcode</rights><subject>Amaranthaceae</subject><subject>gene tree discordance</subject><subject>species network</subject><title>Disentangling sources of gene tree discordance in phylogenomic datasets: testing ancient hybridizations in Amaranthaceae s.l.</title><type>Other:info:eu-repo/semantics/other</type><type>Other:dataset</type><recordID>3987463</recordID></dc>
format Other:info:eu-repo/semantics/other
Other
Other:dataset
Journal:eJournal
Journal
author Morales-Briones, Diego F.
Kadereit, Gudrun
Tefarikis, Delphine
Moore, Michael
Smith, Stephen
Brockington, Samuel
Timoneda, Alfonso
Yim, Won
Cushman, John
Yang, Ya
title Disentangling sources of gene tree discordance in phylogenomic datasets: testing ancient hybridizations in Amaranthaceae s.l
publishDate 2020
topic Amaranthaceae
gene tree discordance
species network
url https://zenodo.org/record/3987463
contents Gene tree discordance in large genomic datasets can be caused by evolutionary processes such as incomplete lineage sorting and hybridization, as well as model violation, and errors in data processing, orthology inference, and gene tree estimation. Species tree methods that identify and accommodate all sources of conflict are not available, but a combination of multiple approaches can help tease apart alternative sources of conflict. Here, using a phylotranscriptomic analysis in combination with reference genomes, we test a hypothesis of ancient hybridization events within the plant family Amaranthaceae s.l. that was previously supported by morphological, ecological, and Sanger-based molecular data. The dataset included seven genomes and 88 transcriptomes, 17 generated for this study. We examined gene-tree discordance using coalescent-based species trees and network inference, gene tree discordance analyses, site pattern tests of introgression, topology tests, synteny analyses, and simulations. We found that a combination of processes might have generated the high levels of gene tree discordance in the backbone of Amaranthaceae s.l. Furthermore, we found evidence that three consecutive short internal branches produce anomalous trees contributing to the discordance. Overall, our results suggest that Amaranthaceae s.l. might be a product of an ancient and rapid lineage diversification, and remains, and probably will remain, unresolved. This work highlights the potential problems of identifiability associated with the sources of gene tree discordance including, in particular, phylogenetic network methods. Our results also demonstrate the importance of thoroughly testing for multiple sources of conflict in phylogenomic analyses, especially in the context of ancient, rapid radiations. We provide several recommendations for exploring conflicting signals in such situations.
- The file 'Supplementa_Methods_and_Materials.tar.gz' contains the supplemental methods, figures and tables referenced in the main text - The file 'Homologs.tar.gz' contains the 14584 homolog trees: raw_homologs.tar.gz - trees without any filtering or pruning final_homologs.tar.gz - trees after, monophyletic and paraphyletic grades of the same species masked, deep paralogs prunned, and spurious tips removed. - The file 'Analyses_data.tar.gz' contains the data (alignments and individual gene trees) used for each of the dataset: filtered_transcriptomes.tar.gz - 88 filtered transcriptomes all_13025_orthologs_cln_aln.tar.gz - all the 13025 'monophyletic outgroup' orthologs 105-taxon.tar.gz - 936 alignments and trees of the full 105-taxon analyses 41-taxon.tar.gz - 1242 alignments and trees of the 41-taxon cloudogram 11-taxon-net.tar.gz - 4138 alignments and trees of the 11-taxon(net) used for network analyses 4-taxon.tar.gz - alignments and trees (between 7,756 and 8,793) for each of the 10 4-taxon quartets 11-taxon-tree.tar.gz - 5936 alignments and trees of the 11-taxon(tree) analyses chloroplast.tar.gz - 11-taxon alignment and tree and 76 individual CDS alignment and trees of the plastid analyses Funding provided by: University of MinnesotaCrossref Funder Registry ID: http://dx.doi.org/10.13039/100007249Funding provided by: University of MichiganCrossref Funder Registry ID: http://dx.doi.org/10.13039/100007270Funding provided by: US National Science FoundationCrossref Funder Registry ID: http://dx.doi.org/10.13039/100000001Award Number: DEB 1354048Funding provided by: Department of Energy, Office of Science, Genomic Science ProgramCrossref Funder Registry ID: Award Number: DE-SC0008834
id IOS17403.3987463
institution Universitas PGRI Palembang
institution_id 189
institution_type library:university
library
library Perpustakaan Universitas PGRI Palembang
library_id 587
collection Marga Life in South Sumatra in the Past: Puyang Concept Sacrificed and Demythosized
repository_id 17403
city KOTA PALEMBANG
province SUMATERA SELATAN
repoId IOS17403
first_indexed 2022-07-26T01:28:19Z
last_indexed 2022-07-26T01:28:19Z
recordtype dc
_version_ 1739406640209199104
score 17.60987