Young JJ, Grayson P, Edwards SV, Tabin CJ. Attenuated Fgf Signaling Underlies the Forelimb Heterochrony in the Emu Dromaius novaehollandiae. Current Biology [Internet]. 2019;29 (21) :3681 - 3691.e5. Publisher's VersionAbstract
Summary Powered flight was fundamental to the establishment and radiation of birds. However, flight has been lost multiple times throughout avian evolution. Convergent losses of flight within the ratites (flightless paleognaths, including the emu and ostrich) often coincide with reduced wings. Although there is a wealth of anatomical knowledge for several ratites, the genetic mechanisms causing these changes remain debated. Here, we use a multidisciplinary approach employing embryological, genetic, and genomic techniques to interrogate the mechanisms underlying forelimb heterochrony in emu embryos. We show that the initiation of limb formation, an epithelial to mesenchymal transition (EMT) in the lateral plate mesoderm (LPM) and myoblast migration into the LPM, occur at equivalent stages in the emu and chick. However, the emu forelimb fails to subsequently proliferate. The unique emu forelimb expression of Nkx2.5, previously associated with diminished wing development, initiates after this stage (concomitant with myoblast migration into the LPM) and is therefore unlikely to cause this developmental delay. In contrast, RNA sequencing of limb tissue reveals significantly lower Fgf10 expression in the emu forelimb. Artificially increasing Fgf10 expression in the emu LPM induces ectodermal Fgf8 expression and a limb bud. Analyzing open chromatin reveals differentially active regulatory elements near Fgf10 and Sall-1 in the emu wing, and the Sall-1 enhancer activity is dependent on a likely Fgf-mediated Ets transcription factor-binding site. Taken together, our results suggest that regulatory changes result in lower expression of Fgf10 and a concomitant failure to express genes required for limb proliferation in the early emu wing bud.
Liu L, Anderson C, Pearl D, Edwards S. Modern Phylogenomics: Building Phylogenetic Trees Using the Multispecies Coalescent Model. New York, NY: Humana; 2019 pp. 211-239. Publisher's VersionAbstract
The multispecies coalescent (MSC) model provides a compelling framework for building phylogenetic trees from multilocus DNA sequence data. The pure MSC is best thought of as a special case of so-called “multispecies network coalescent” models, in which gene flow is allowed among branches of the tree, whereas MSC methods assume there is no gene flow between diverging species. Early implementations of the MSC, such as “parsimony” or “democratic vote” approaches to combining information from multiple gene trees, as well as concatenation, in which DNA sequences from multiple gene trees are combined into a single “supergene,” were quickly shown to be inconsistent in some regions of tree space, in so far as they converged on the incorrect species tree as more gene trees and sequence data were accumulated. The anomaly zone, a region of tree space in which the most frequent gene tree is different from the species tree, is one such region where many so-called “coalescent” methods are inconsistent. Second-generation implementations of the MSC employed Bayesian or likelihood models; these are consistent in all regions of gene tree space, but Bayesian methods in particular are incapable of handling the large phylogenomic data sets currently available. Two-step methods, such as MP-EST and ASTRAL, in which gene trees are first estimated and then combined to estimate an overarching species tree, are currently popular in part because they can handle large phylogenomic data sets. These methods are consistent in the anomaly zone but can sometimes provide inappropriate measures of tree support or apportion error and signal in the data inappropriately. MP-EST in particular employs a likelihood model which can be conveniently manipulated to perform statistical tests of competing species trees, incorporating the likelihood of the collected gene trees on each species tree in a likelihood ratio test. Such tests provide a useful alternative to the multilocus bootstrap, which only indirectly tests the appropriateness of competing species trees. We illustrate these tests and implementations of the MSC with examples and suggest that MSC methods are a useful class of models effectively using information from multiple loci to build phylogenetic trees.
Lindsay WR, Andersson S, Bererhi B, Höglund J, Johnsen A, Kvarnemo C, Leder EH, Lifjel JT, Ninnes CE, Olsson M, et al. Endless forms of sexual selection. PeerJ [Internet]. 2019;7 (e27584v1). Publisher's VersionAbstract

In recent years, the field of sexual selection has exploded, with advances in theoretical and empirical research complementing each other in exciting ways. This perspective piece is the product of a “stock-taking” workshop on sexual selection and conflict. Our aim is to identify and deliberate on outstanding questions and to stimulate discussion rather than provide a comprehensive overview of the entire field. These questions are organized into four thematic sections we deem essential to the field. First we focus on the evolution of mate choice and mating systems. Variation in mate quality can generate both competition and choice in the opposite sex, with implications for the evolution of mating systems. Limitations on mate choice may dictate the importance of direct vs. indirect benefits in mating decisions and consequently, mating systems, especially with regard to polyandry. Second, we focus on how sender and receiver mechanisms shape signal design. Mediation of honest signal content likely depends on integration of temporally variable social and physiological costs that are challenging to measure. We view the neuroethology of sensory and cognitive receiver biases as the main key to signal form and the ‘aesthetic sense’ proposed by Darwin. Since a receiver bias is sufficient to both initiate and drive ornament or armament exaggeration, without a genetically correlated or even coevolving receiver, this may be the appropriate ‘null model’ of sexual selection. Thirdly, we focus on the genetic architecture of sexually selected traits. Despite advances in modern molecular techniques, the number and identity of genes underlying performance, display and secondary sexual traits remains largely unknown. In-depth investigations into the genetic basis of sexual dimorphism in the context of long-term field studies will reveal constraints and trajectories of sexually selected trait evolution. Finally, we focus on sexual selection and conflict as drivers of speciation. Population divergence and speciation are often influenced by an interplay between sexual and natural selection. The extent to which sexual selection promotes or counteracts population divergence may vary depending on the genetic architecture of traits as well as the covariance between mating competition and local adaptation. Additionally, post-copulatory processes, such as selection against heterospecific sperm, may influence the importance of sexual selection in speciation. We propose that efforts to resolve these four themes can catalyze conceptual progress in the field of sexual selection, and we offer potential avenues of research to advance this progress.
Bakker FT, Antonelli A, Clarke JA, Cook JA, Edwards SV, Ericson PGP, Faurby S, Ferrand N, Gelang M, Gillespie RG, et al. The Global Museum: natural history collections and the future of evolutionary biology and public education. PeerJ [Internet]. 2019;7 (e27666v1). Publisher's VersionAbstract

Natural history museums are unique spaces for interdisciplinary research and for educational innovation. Through extensive exhibits and public programming and by hosting rich communities of amateurs, students, and researchers at all stages of their careers, they provide a place-based window to focus on integration of science and discovery, as well as a locus for community engagement. At the same time, like a synthesis radio telescope, when joined together through emerging digital resources, the global community of museums (the ‘Global Museum’) is more than the sum of its parts, allowing insights and answers to diverse biological, environmental, and societal questions at the global scale, across eons of time, and spanning vast diversity across the Tree of Life. We argue that, whereas natural history collections and museums began with a focus on describing the diversity and peculiarities of species on Earth, they are now increasingly leveraged in new ways that significantly expand their impact and relevance. These new directions include the possibility to ask new, often interdisciplinary questions in basic and applied science; inform biomimetic design; and even provide solutions to climate change, global health and food security challenges. As institutions, they are incubators for cutting-edge research in biology and simultaneously protect core infrastructure for present and future societal needs. In this perspective, we discuss challenges to the realization of the full potential of natural history collections and museums to serve society. After reviewing collections and types of museums, including local and global efforts, we discuss the value of specimens and the importance of observations. We then focus on mapping and modelling of museum data (including place-based approaches and discovery), and explore the main projects, platforms and databases enabling this. We also explore ways in which improved infrastructure will allow higher quality science and increased opportunities for interdisciplinary research and communication, as well as new uses of collections. Finally, we aim to improve relevant protocols for the long-term storage of specimens and tissues, ensuring proper connection with tomorrow’s technologies and hence further increasing the relevance of natural history museums.
Näpflin K, Becks L, Bensch S, Ellis VA, Hafer-Hahmann N, Harding KC, Lindén SK, O'Connor EA, Olsen MT, Roved J, et al. Genomics of hosts-pathogen interactions: challenges and opportunities across ecological and spatiotemporal scales. PeerJ [Internet]. 2019;(e27734v1). Publisher's VersionAbstract
Evolutionary genomics has recently entered a new era in the study of host-pathogen interactions. A variety of novel genomic techniques has transformed to the identification, detection and classification of both hosts and pathogens, allowing a greater resolution that helps decipher their underlying dynamics and provides novel insights into their environmental context. Nevertheless, many challenges to a general understanding of hostpathogen interactions remain, in particular in the synthesis and integration of concepts and findings across a variety of systems and different spatiotemporal and ecological scales. In this perspective we aim to highlight some of the commonalities and complexities across diverse studies of host-pathogen interactions, with a focus on ecological, spatiotemporal variation, and the choice of genomic methods used. We performed a quantitative review of recent literature to investigate links, patterns and potential tradeoffs between the complexity of genomic, ecological and spatiotemporal scales undertaken in individual host-pathogen studies. We found that the majority of studies used whole genome resolution to address their research objectives across a broad range of ecological PeerJ Preprints | | CC BY 4.0 Open Access | rec: 15 May 2019, publ: 15 May 2019 scales, especially when focusing on the pathogen side of the interaction. Nevertheless, genomic studies conducted in a complex spatiotemporal context are currently rare in the literature. Because processes of host-pathogen interactions can be understood at multiple scales, from molecular-, cellular-, and physiological-scales to the levels of populations and ecosystems, we conclude that a major obstacle for synthesis across diverse host-pathogen systems is that data are collected on widely diverging scales with different degrees of resolution. This disparity not only hampers effective infrastructural organization of the data but also data granularity and accessibility. Comprehensive metadata deposited in association with genomic data in easily accessible databases will allow greater inference across systems in the future, especially when combined with open data standards and practices. The standardization and comparability of such data will facilitate early detection of emerging infectious diseases as well as studies of the impact of anthropogenic stressors, such as climate change, on disease dynamics in humans and wildlife.
Lamichhaney S, Card DC, Grayson P, Tonini JFR, Bravo GA, Näpflin K, Termignoni-Garcia F, Torres C, Burbrink F, Clarke JA, et al. Integrating natural history collections and comparative genomics to study the genetic architecture of convergent evolution. Philosophical Transactions of the Royal Society B [Internet]. 2019;374 (1777) :20180248. Publisher's VersionAbstract
Evolutionary convergence has been long considered primary evidence of adaptation driven by natural selection and provides opportunities to explore evolutionary repeatability and predictability. In recent years, there has been increased interest in exploring the genetic mechanisms underlying convergent evolution, in part, owing to the advent of genomic techniques. However, the current ‘genomics gold rush’ in studies of convergence has overshadowed the reality that most trait classifications are quite broadly defined, resulting in incomplete or potentially biased interpretations of results. Genomic studies of convergence would be greatly improved by integrating deep ‘vertical’, natural history knowledge with ‘horizontal’ knowledge focusing on the breadth of taxonomic diversity. Natural history collections have and continue to be best positioned for increasing our comprehensive understanding of phenotypic diversity, with modern practices of digitization and databasing of morphological traits providing exciting improvements in our ability to evaluate the degree of morphological convergence. Combining more detailed phenotypic data with the well-established field of genomics will enable scientists to make progress on an important goal in biology: to understand the degree to which genetic or molecular convergence is associated with phenotypic convergence. Although the fields of comparative biology or comparative genomics alone can separately reveal important insights into convergent evolution, here we suggest that the synergistic and complementary roles of natural history collection-derived phenomic data and comparative genomics methods can be particularly powerful in together elucidating the genomic basis of convergent evolution among higher taxa.
Xu L, Sin SYW, Grayson P, Edwards SV, Sackton TB. Evolutionary dynamics of sex chromosomes of paleognathous birds. Genome Biology & Evolution [Internet]. 2019;11 (8) :2376-2390. Publisher's VersionAbstract
Standard models of sex chromosome evolution propose that recombination suppression leads to the degeneration of the heterogametic chromosome, as is seen for the Y chromosome in mammals and the W chromosome in most birds. Unlike other birds, paleognaths (ratites and tinamous) possess large nondegenerate regions on their sex chromosomes (PARs or pseudoautosomal regions). It remains unclear why these large PARs are retained over >100 Myr, and how this retention impacts the evolution of sex chromosomes within this system. To address this puzzle, we analyzed Z chromosome evolution and gene expression across 12 paleognaths, several of whose genomes have recently been sequenced. We confirm at the genomic level that most paleognaths retain large PARs. As in other birds, we find that all paleognaths have incomplete dosage compensation on the regions of the Z chromosome homologous to degenerated portions of the W (differentiated regions), but we find no evidence for enrichments of male-biased genes in PARs. We find limited evidence for increased evolutionary rates (faster-Z) either across the chromosome or in differentiated regions for most paleognaths with large PARs, but do recover signals of faster-Z evolution in tinamou species with mostly degenerated W chromosomes, similar to the pattern seen in neognaths. Unexpectedly, in some species, PAR-linked genes evolve faster on average than genes on autosomes, suggested by diverse genomic features to be due to reduced efficacy of selection in paleognath PARs. Our analysis shows that paleognath Z chromosomes are atypical at the genomic level, but the evolutionary forces maintaining largely homomorphic sex chromosomes in these species remain elusive.
O'Connor EA, Westerdahl H, Burri R, Edwards SV. Avian MHC Evolution in the Era of Genomics: Phase 1.0. Cells8 [Internet]. 2019;(10) :1152. Publisher's VersionAbstract
Birds are a wonderfully diverse and accessible clade with an exceptional range of ecologies and behaviors, making the study of the avian major histocompatibility complex (MHC) of great interest. In the last 20 years, particularly with the advent of high-throughput sequencing, the avian MHC has been explored in great depth in several dimensions: its ability to explain ecological patterns in nature, such as mating preferences; its correlation with parasite resistance; and its structural evolution across the avian tree of life. Here, we review the latest pulse of avian MHC studies spurred by high-throughput sequencing. Despite high-throughput approaches to MHC studies, substantial areas remain in need of improvement with regard to our understanding of MHC structure, diversity, and evolution. Recent studies of the avian MHC have nonetheless revealed intriguing connections between MHC structure and life history traits, and highlight the advantages of long-term ecological studies for understanding the patterns of MHC variation in the wild. Given the exceptional diversity of birds, their accessibility, and the ease of sequencing their genomes, studies of avian MHC promise to improve our understanding of the many dimensions and consequences of MHC variation in nature. However, significant improvements in assembling complete MHC regions with long-read sequencing will be required for truly transformative studies.
Cloutier A, Sackton TB, Grayson P, Clamp M, Baker AJ, Edwards SV. Whole-genome analyses resolve the phylogeny of flightless birds (Palaeognathae) in the presence of an empirical anomaly zone. Systematic Biology. 2019;10.1101/26.
Hu Z, Sackton TB, Edwards SV, Liu JS. Bayesian Detection of Convergent Rate Changes of Conserved Noncoding Elements on Phylogenetic Trees Pond SK. Molecular Biology and Evolution [Internet]. 2019. Publisher's Version
Sackton TB, Grayson P, Cloutier A, Hu Z, Liu JS, Wheeler NE, Gardner PP, Clarke JA, Baker AJ, Clamp M, et al. Convergent regulatory evolution and loss of flight in paleognathous birds. Science (New York, N.Y.) [Internet]. 2019;364 (6435) :74–78. Publisher's VersionAbstract
A core question in evolutionary biology is whether convergent phenotypic evolution is driven by convergent molecular changes in proteins or regulatory regions. We combined phylogenomic, developmental, and epigenomic analysis of 11 new genomes of paleognathous birds, including an extinct moa, to show that convergent evolution of regulatory regions, more so than protein-coding genes, is prevalent among developmental pathways associated with independent losses of flight. A Bayesian analysis of 284,001 conserved noncoding elements, 60,665 of which are corroborated as enhancers by open chromatin states during development, identified 2355 independent accelerations along lineages of flightless paleognaths, with functional consequences for driving gene expression in the developing forelimb. Our results suggest that the genomic landscape associated with morphological convergence in ratites has a substantial shared regulatory component.
Bravo GA, Antonelli A, Bacon CD, Bartoszek K, Blom MPK, Huynh S, Jones G, Knowles LL, Lamichhaney S, Marcussen T, et al. Embracing heterogeneity: coalescing the Tree of Life and the future of phylogenomics. PeerJ [Internet]. 2019;7 :e6399. Publisher's VersionAbstract
Building the Tree of Life (ToL) is a major challenge of modern biology, requiring advances in cyberinfrastructure, data collection, theory, and more. Here, we argue that phylogenomics stands to benefit by embracing the many heterogeneous genomic signals emerging from the first decade of large-scale phylogenetic analysis spawned by high-throughput sequencing (HTS). Such signals include those most commonly encountered in phylogenomic datasets, such as incomplete lineage sorting, but also those reticulate processes emerging with greater frequency, such as recombination and introgression. Here we focus specifically on how phylogenetic methods can accommodate the heterogeneity incurred by such population genetic processes; we do not discuss phylogenetic methods that ignore such processes, such as concatenation or supermatrix approaches or supertrees. We suggest that methods of data acquisition and the types of markers used in phylogenomics will remain restricted until a posteriori methods of marker choice are
Liu L, Zhang J, Rheindt FR, Lei F, Qu Y, Wang Y, Zhang Y, Sullivan C, Nie W, Wang J, et al. Reply to Gatesy and Springer: Claims of homology errors and zombie lineages do not compromise the dating of placental diversification. Proceedings of the National Academy of Sciences. 2018;114 :E9433–E9434.
Antonelli A, Ariza M, Albert J, Andermann T, Azevedo J, Bacon C, Faurby S, Guedes T, Hoorn C, Lohmann LG, et al. Conceptual and empirical advances in Neotropical biodiversity research. PeerJ [Internet]. 2018;6 :e5644. Publisher's VersionAbstract
\textlessp\textgreaterThe unparalleled biodiversity found in the American tropics (the Neotropics) has attracted the attention of naturalists for centuries. Despite major advances in recent years in our understanding of the origin and diversification of many Neotropical taxa and biotic regions, many questions remain to be answered. Additional biological and geological data are still needed, as well as methodological advances that are capable of bridging these research fields. In this review, aimed primarily at advanced students and early-career scientists, we introduce the concept of “trans-disciplinary biogeography,” which refers to the integration of data from multiple areas of research in biology (e.g., community ecology, phylogeography, systematics, historical biogeography) and Earth and the physical sciences (e.g., geology, climatology, palaeontology), as a means to reconstruct the giant puzzle of Neotropical biodiversity and evolution in space and time. We caution against extrapolating results derived from the study of one or a few taxa to convey general scenarios of Neotropical evolution and landscape formation. We urge more coordination and integration of data and ideas among disciplines, transcending their traditional boundaries, as a basis for advancing tomorrow's ground-breaking research. Our review highlights the great opportunities for studying the Neotropical biota to understand the evolution of life.\textless/p\textgreater
Lewin HA, Robinson GE, Kress WJ, Baker WJ, Coddington J, Crandall KA, Durbin R, Edwards SV, Forest F, Gilbert MTP, et al. Earth BioGenome Project: Sequencing life for the future of life. Proceedings of the National Academy of Sciences [Internet]. 2018;115 (17) :4325–4333. Publisher's VersionAbstract
Increasing our understanding of Earth's biodiversity and responsibly stewarding its resources are among the most crucial scientific and social challenges of the new millennium. These challenges require fundamental new knowledge of the organization, evolution, functions, and interactions among millions of the planet's organisms. Herein, we present a perspective on the Earth BioGenome Project (EBP), a moonshot for biology that aims to sequence, catalog, and characterize the genomes of all of Earth's eukaryotic biodiversity over a period of 10 years. The outcomes of the EBP will inform a broad range of major issues facing humanity, such as the impact of climate change on biodiversity, the conservation of endangered species and ecosystems, and the preservation and enhancement of ecosystem services. We describe hurdles that the project faces, including data-sharing policies that ensure a permanent, freely available resource for future scientific discovery while respecting access and benefit sharing guidelines of the Nagoya Protocol. We also describe scientific and organizational challenges in executing such an ambitious project, and the structure proposed to achieve the project's goals. The far-reaching potential benefits of creating an open digital repository of genomic information for life on Earth can be realized only by a coordinated international effort.
Hoover B, Alcaide M, Jennings S, Sin SYW, Edwards SV, Nevitt GA. Ecology can inform genetics: Disassortative mating contributes to MHC polymorphism in Leach's storm-petrels (Oceanodroma leucorhoa). Molecular Ecology [Internet]. 2018;27 (16) :3371–3385. Publisher's VersionAbstract
Studies of MHC-based mate choice in wild populations often test hypotheses on species exhibiting female choice and male-male competition, which reflects the general prevalence of females as the choosy sex in natural systems. Here, we examined mutual mate-choice patterns in a small burrow-nesting seabird, the Leach's storm-petrel (Oceanodroma leucorhoa), using the major histocompatibility complex (MHC). The life history and ecology of this species are extreme: both partners work together to fledge a single chick during the breeding season, a task that requires regularly travelling hundreds of kilometres to and from foraging grounds over a 6- to 8-week provisioning period. Using a 5-year data set unprecedented for this species (n = 1078 adults and 925 chicks), we found a positive relationship between variation in the likelihood of female reproductive success and heterozygosity at Ocle-DAB2, a MHC class IIB locus. Contrary to previous reports rejecting disassortative mating as a mechanism for maintaining genetic polymorphism in this species, here we show that males make significant disassortative mate-choice decisions. Variability in female reproductive success suggests that the most common homozygous females (Ocle-DAB2*01/Ocle-DAB2*01) may be physiologically disadvantaged and, therefore, less preferred as lifelong partners for choosy males. The results from this study support the role of mate choice in maintaining high levels of MHC variability in a wild seabird species and highlight the need to incorporate a broader ecological framework and sufficient sample sizes into studies of MHC-based mating patterns in wild populations in general.
Schmitt CJ, Cook JA, Zamudio KR, Edwards SV. Museum specimens of terrestrial vertebrates are sensitive indicators of environmental change in the Anthropocene. Philosophical Transactions of the Royal Society B: Biological Sciences [Internet]. 2018;374 (1763) :20170387. Publisher's VersionAbstract
Natural history museums and the specimen collections they curate are vital scientific infrastructure, a fact as true today as it was when biologists began collecting and preserving specimens over 200 years ago. The importance of museum specimens in studies of taxonomy, systematics, ecology and evolutionary biology is evidenced by a rich and abundant literature, yet creative and novel uses of specimens are constantly broadening the impact of natural history collections on biodiversity science and global sustainability. Excellent examples of the critical importance of specimens come from their use in documenting the consequences of environmental change, which is particularly relevant considering the alarming rate at which we now modify our planet in the Anthropocene. In this review, we highlight the important role of bird, mammal and amphibian specimens in documenting the Anthropocene and provide examples that underscore the need for continued collection of museum specimens.This article is part of the theme issue 'Biological collections for understanding biodiversity in the Anthropocene'.
Tang Q, Edwards SV, Rheindt FE. Rapid diversification and hybridization have shaped the dynamic history of the genus Elaenia. Molecular phylogenetics and evolution [Internet]. 2018;127 :522–533. Publisher's VersionAbstract
Multi-locus data have proven invaluable in phylogenetic reconstruction and species delimitation. However, the mixed genetic signal from different loci can make inference of evolutionary history challenging and may produce incongruences depending on analytical and marker choice. Aside from incomplete lineage sorting (ILS) following diversification events that have had little time for deep differentiation, the most common causes of incongruent phylogenies are genetic introgression confounding a bifurcating evolutionary trajectory. In this study, we used multi-locus analytical approaches on sequence data of nine loci from 80 individuals of over 20 Neotropical Elaenia flycatcher species to examine the systematics, molecular phylogeny and species limits of this complex genus. Our results provide a robust phylogeny and estimates of species limits within Elaenia, but point to important cases of incongruences among phylogenies based on different analytical approaches. Simulations and estimates of divergence times provide reasonable explanations for the incongruent placement of some Elaenia taxa, pointing to multiple cases of both ILS and introgression within the genus. Molecular dating of major evolutionary events revealed intensive diversification during the Pleistocene, suggesting a central role of climate oscillations in the evolution of Elaenia flycatchers.
Ezaz T, Edwards SV. Editorial: Evolutionary Feedbacks Between Population Biology and Genome Architecture. Frontiers in Genetics [Internet]. 2018;9 :329. Publisher's Version
Wu S, Edwards S, Liu L. Genome-scale DNA sequence data and the evolutionary history of placental mammals. Data in Brief [Internet]. 2018;18 :1972–1975. Publisher's VersionAbstract
We present a genomic data set comprised of the coding DNA sequences of 5162 loci from 90 vertebrate species, including 82 mammals. The loci were aligned with their protein sequences. The aligned protein sequences were then back translated into their original DNA sequences. The alignments were further filtered to remove individual sequences from each alignment exhibiting long branches or other unusual features. The data is deposited in figshare (\ and will be useful as a test data set for large-scale phylogenomic analysis.