dc.contributor.author
Hühn, Philipp
dc.contributor.author
Dillenberger, Markus S.
dc.contributor.author
Gerschwitz-Eidt, Michael
dc.contributor.author
Hörandl, Elvira
dc.contributor.author
Los, Jessica A.
dc.contributor.author
Messerschmid, Thibaud F. E.
dc.contributor.author
Paetzold, Claudia
dc.contributor.author
Rieger, Benjamin
dc.contributor.author
Kadereit, Gudrun
dc.date.accessioned
2022-01-10T10:19:29Z
dc.date.available
2022-01-10T10:19:29Z
dc.identifier.uri
https://refubium.fu-berlin.de/handle/fub188/33406
dc.identifier.uri
http://dx.doi.org/10.17169/refubium-33127
dc.description.abstract
Analysing multiple genomic regions while incorporating detection and qualification of discordance among regions has become standard for understanding phylogenetic relationships. In plants, which usually have comparatively large genomes, this is feasible by the combination of reduced-representation library (RRL) methods and high-throughput sequencing enabling the cost effective acquisition of genomic data for thousands of loci from hundreds of samples. One popular RRL method is RADseq. A major disadvantage of established RADseq approaches is the rather short fragment and sequencing range, leading to loci of little individual phylogenetic information. This issue hampers the application of coalescent-based species tree inference. The modified RADseq protocol presented here targets ca. 5,000 loci of 300-600nt length, sequenced with the latest short-read-sequencing (SRS) technology, has the potential to overcome this drawback. To illustrate the advantages of this approach we use the study group Aichryson Webb & Berthelott (Crassulaceae), a plant genus that diversified on the Canary Islands. The data analysis approach used here aims at a careful quality control of the long loci dataset. It involves an informed selection of thresholds for accurate clustering, a thorough exploration of locus properties, such as locus length, coverage and variability, to identify potential biased data and a comparative phylogenetic inference of filtered datasets, accompanied by an evaluation of resulting BS support, gene and site concordance factor values, to improve overall resolution of the resulting phylogenetic trees. The final dataset contains variable loci with an average length of 373nt and facilitates species tree estimation using a coalescent-based summary approach. Additional improvements brought by the approach are critically discussed.
en
dc.format.extent
23 Seiten
dc.rights.uri
https://creativecommons.org/licenses/by/4.0/
dc.subject
clustering threshold selection
en
dc.subject
coalescent-based summary method
en
dc.subject
locus filtering
en
dc.subject
species tree inference
en
dc.subject.ddc
500 Naturwissenschaften und Mathematik::570 Biowissenschaften; Biologie::570 Biowissenschaften; Biologie
dc.title
How challenging RADseq data turned out to favor coalescent-based species tree inference. A case study in Aichryson (Crassulaceae)
dc.type
Wissenschaftlicher Artikel
dcterms.bibliographicCitation.articlenumber
107342
dcterms.bibliographicCitation.doi
10.1016/j.ympev.2021.107342
dcterms.bibliographicCitation.journaltitle
Molecular Phylogenetics and Evolution
dcterms.bibliographicCitation.volume
167
dcterms.bibliographicCitation.url
https://doi.org/10.1016/j.ympev.2021.107342
refubium.affiliation
Biologie, Chemie, Pharmazie
refubium.affiliation.other
Institut für Biologie
refubium.resourceType.isindependentpub
no
dcterms.accessRights.openaire
open access
dcterms.isPartOf.eissn
1095-9513
refubium.resourceType.provider
WoS-Alert