- Research article
- Open Access
The chordate ancestor possessed a single copy of the Brachyury gene for notochord acquisition
Zoological Lettersvolume 3, Article number: 4 (2017)
The T-box family transcription-factor gene, Brachyury, has two expression domains with discrete functions during animal embryogenesis. The primary domain, associated with the blastopore, is shared by most metazoans, while the secondary domain, involved in the notochord, is specific to chordates. In most animals, Brachyury is present in a single copy, but in cephalochordates, the most basal of the chordates, the gene is present in two copies, suggesting allotment of the two domains to each of the duplicates.
In order to clarify whether Brachyury duplication occurred in the common ancestor of chordates after which one of duplicates was lost in the urochordate and vertebrate lineages, we estimated phylogenetic relationships of Brachyury genes and examined the synteny of a Brachyury-containing genomic region of deuterostomes with decoded genomes. The monophyletic origin of tandemly arranged Brachyury genes of cephalochordates indicates that the tandem duplication occurred in the cephalochordate lineage, but not in the chordate ancestor.
Our results thus suggest that, in the common ancestor of chordates, a single copy of Brachyury acquired two expression domains and that the duplication was not involved in the acquisition of the notochord. However, in relation to regulatory mechanisms, both possibilities—namely a single copy with two domains and two copies with different domains—should be considered in future studies of Brachyury.
We are interested in genetic mechanisms involved in the origins and evolution of chordates . Chordates comprise three taxa, cephalochordates, urochordates or tunicates, and vertebrates . These are thought to have originated from a common ancestor of the deuterostomes, together with ambulacrarians, a clade containing echinoderms and hemichordates. The organ that best characterizes chordates is the notochord, an organ that supports the beating of the muscular tail of fish-like larvae or adults [1, 3, 4]. The T-box family transcription-factor gene, Brachyury, plays an essential role in notochord formation . In ascidians (urochordates), for example, Brachyury is expressed exclusively in primordial embryonic notochord cells [6, 7]. Loss of Brachyury function results in the failure of notochord formation, while its ectopic expression induces endoderm cells to become notochord cells [8, 9].
Interestingly, Brachyury is not specific to chordates, but is present in most metazoans, including non-chordate deuterostomes [5, 10]. Beside paralogs derived from ancient whole genome duplication (referred to as “ohnologs”), Brachyury is usually present as a single copy, with some exceptions (Hydra , calcisponge , and cephalochordates (see below)), and is expressed around the blastopore during gastrulation. A recent study by our group using coral embryos demonstrated that the evolutionarily conserved function of Brachyury is associated with formation of blastopore-derived organs, such as the pharynx of coral embryos . In non-chordate deuterostomes, ambulacrarians, Brachyury is expressed in the archenteron invagination region in early gastrulae and in the stomodeum invagination region in later stage embryos [14, 15]. Although an overexpression experiment with sea urchin Brachyury suggested its role in gastrulation , the developmental role of ambulacrarian Brachyury remains to be determined.
We proposed an evolutionary scenario for Brachyury emphasizing its primary and secondary expression domains and functions . Namely, ambulacrarians require Brachyury in its primary domain of expression and function associated with the blastopore (PEF), while chordates employ the gene, not only for PEF, but also for the secondary domain of expression and function associated with the notochord (SEF). In urochordates, the PEF was likely lost due to the precocious mode of embryogenesis [1, 5]. For the past decade, evolutionary developmental biologists have been asking how the chordate ancestor acquired the Brachyury SEF. Answering this question is critical to our understanding of genetic and molecular mechanisms involved in the origins of chordates. Genomes of the cephalochordates, Branchiostoma floridae and B. belcheri, each contain a set of duplicated Brachyury, Amphi-Bra1 and Amphi-Bra2 [17, 18]. These duplicated genes have both PEF and SEF, but the expression domains of each have not been determined [17, 19]. In contrast, urochordates have only a single copy . Most vertebrates also have a single copy of Brachyury/T, although they have variable numbers of T ohnologs, which arose from the two rounds of genome-wide gene duplication (2R-GWGD) that occurred in this lineage [21, 22]. In mice, only a T gene exhibits PEF and SEF during early embryogenesis .
The occurrence of two domains of Brachyury expression and function may be explained by one of two alternative evolutionary scenarios (Fig. 1). In scenario 1, Brachyury was present as a single copy in deuterostome ancestors, while it became duplicated in an ancestor of chordates, such that the original gene retained PEF and its newly formed counterpart obtained SEF (Fig. 1a). The cephalochordate lineage retained this arrangement, whereas the urochordate and vertebrate lineages lost one of duplicates. In scenario 2 (Fig. 1b), chordates retained a single copy of Brachyury as did the non-chordate invertebrates. Duplication occurred in the lineage leading to cephalochordates, but not in the lineage leading to olfactores (urochordates + vertebrates). Evolution of vertebrate T-box family genes seems to have been complicated by the 2R-GWGD, and vertebrate Brachyury/T needs to be examined more carefully in the future (Yasuoka et al., in preparation).
In 2015, the genomes of two hemichordate species, an indirectly developing acorn worm, Ptychodera lava, and a directly developing acorn worm, Saccoglossus kowalevskii, were decoded . Genomes of animals belonging to each of the four other phyla were already decoded prior to 2014; echinoderm sea urchin (Strongylocentrotus purpuratus; ) and starfish (Acanthaster planci ), cephalochordate amphioxus (Branchiostoma floridae  and Branchiostoma belcheri ), urochordates (e.g., Ciona intestinalis  and Oikopleura dioica ) and vertebrates (e.g., Homo sapiens ). We obtained genomic information for all five phyla of deuterostomes, enabling us to examine which of the two scenarios outlined above better explains the genetic and molecular evolution of Brachyury in relation to chordate origins.
Gene trees were estimated with an analytical pipeline implementing BLAST search and the maximum likelihood method (modified from Inoue et al. ).
Human and lancelet protein-coding sequences (amino acids) were used as queries for BLASTP search  against all protein-coding sequences in 18 selected animal genomes (Table 1). The resulting BLAST top 20 hits were screened using an E-value cutoff of <10−3 . Where transcript variants existed for a single locus, only the longest sequence was used in the present analysis.
The sequences of proteins obtained by the BLASTP search were aligned using MAFFT . Multiple sequence alignments were trimmed by removing poorly aligned regions using TRIMAL 1.2  with option “gappyout.” Corresponding cDNA sequences were forced onto the amino acid alignment using PAL2NAL  to generate nucleotide alignments for later comparative analysis. Each gene sequence was checked, and removed from the alignment as spurious BLAST hits if the sequence was shorter than 55% of the length of the query sequence in the unambiguously aligned sites.
Gene tree search
Phylogenetic analyses were conducted by the maximum likelihood method aligned with bootstrap analysis based upon 100 replicates. The first and second codon positions were used for DNA analysis. The analysis was performed by RAxML 8.2.4 , which invokes a rapid bootstrap analysis and search for the best scoring ML tree. The GTRGAMMA (general time-reversible  with the gamma ) and the PROTGAMMAWAGF (WAG  with gamma and empirical base frequencies) models were used for DNA and amino acid analyses, respectively. The synteny of a Brachyury-containing genomic region was assessed by identifying human/Drosophila ortholog of each neighboring gene by estimating each gene tree via our analytical pipeline.
We examined Brachyury of representative protostomes and deuterostomes, the genomes of which have been decoded (Table 1, with Brachyury gene/protein ID and information from the genomes). Phylogenetic relationships of deuterostome species are based on previous studies (e.g., ).
Copy number of Brachyury in deuterostomes and the chordate ancestor
Molecular cloning of mouse Brachyury was followed by identifying its orthologs in various metazoans (reviewed by [5, 40]). In addition, cloning of other T-box-containing transcription factor genes, including T-brain, Tbx1, Tbox2, and Tbx6, shows that they form a family called the T-box family (e.g., ). Although Brachyury has been identified and characterized in each bilaterian with a decoded genome, we examined Brachyury copy numbers in 18 species, including six protostomes and 12 deuterostomes (Table 1). To evaluate the two scenarios (Fig. 1), we did not include ohnologs in vertebrates as the copy number of Brachyury because that duplication is not associated with the acquisition of SEF.
To this end, we first carefully identified all T-box containing genes from the decoded genomes. In order to identify a candidate query sequence from non-vertebrate chordate lineages, using the human Brachyury amino acid sequence as a query, phylogenetic relationships were roughly estimated to delineate a clade comprising the Brachyury subfamily. Using queries of human and cephalochordate Brachyury amino acid sequences (Additional file 1), we next searched for the Brachyury gene in selected bilaterian genomes by estimating phylogenetic relationships (Additional file 2). Although no ortholog was found in the present genome assembly of the nematode, Caenorhabditis elegans, a single copy of Brachyury was found in each of the other bilaterians in both resultant trees based on nucleotide and amino acid datasets. An exception was that lancelets (cephalochordates) possessed two copies in their genome, as shown previously [17, 18].
In order to estimate the copy number of Brachyury in the chordate ancestor, we reconstructed phylogenetic relationships of Brachyury using only Brachyury gene sequences (Additional file 3) selected from the estimated tree of T-box containing genes (Additional file 2). Estimated gene trees based on the comparison of nuclear (Fig. 2) and amino acid (Additional file 4) sequences produced the same deuterostome relationships, except for the positions of the urochordate, Botryllus schlosseri and a clade consisting of the remaining urochordates. For subsequent discussion, we used the tree obtained from DNA analysis because of the longer sequences. The Brachyury tree (Fig. 2) differed from the species tree (e.g., ) in that, in the Brachyury tree, urochordate genes formed a sister clade with all remaining deuterostome genes. This may be the result of a faster evolutionary rate of urochordate proteins than in other deuterostome taxa. Our results (Fig. 2) indicate that, with bootstrap support of 97%, Amphi-Bra1 and Amphi-Bra2 of the two species of cephalochordates forms a monophyletic group, consistent with our previous study .
Given that cephalochordates are the only deuterostome taxon with the duplicated state of Brachyury, and that the two copies form a monophyletic clade (Fig. 2), it is highly likely that the duplication of Brachyury was specific to the cephalochordate lineage, and did not occur in the chordate ancestor. Namely Brachyury was present in a single copy in the chordate ancestor. Therefore, our results support the second of the two scenarios described above (Fig. 1b). In vertebrates (chicken and human, here), Brachyury/T forms a clade while Tbx19 forms another clade (Fig. 2). This suggests that 2R-GWGD resulted in divergence of the ancestral Brachyury into a clade including Brachyury or Tbx19.
Genomic organization of Brachyury in deuterostomes
Next, to gain a better understanding of the evolutionary changes in the genetic and genomic organization of Brachyury in relation to chordate evolution, we examined the synteny of genes in Brachyury-containing genomic regions, especially in deuterostome taxa. Although global synteny analyses have shown comparable and conserved synteny between cephalochordate and vertebrate genomes  and between hemichordate and cephalochordate genomes , no detailed analyses were carried out on genomic regions that contain Brachyury.
In the genomes of two cephalochordate species, B. floridae and B. belcheri, the two Brachyury were tandemly aligned (Fig. 3). In both genomes, although a neighboring NOTUM-like gene is present syntenically, synteny of neighboring genes is limited to the four genes. This suggests that the two copies of Brachyury arose from a tandem duplication, not from an ancient segmental duplication.
Beside the trait of tandem duplication of cephalochordate Brachyury, our synteny analysis characterized conserved synteny around Brachyury of major deuterostome lineages (Fig. 3). A global view of synteny indicates that it is comparable among ambulacrarian species, because genes with comparable sequences were found in echinoderms and an acorn worm. On the contrary, no conserved synteny was found among four species of tunicates. Although the synteny analysis among vertebrate genomes was complicated by 2R-GWGD, it is evident that there are no scaffolds with duplicated Brachyury/T or its ohnolog, Tbx19, in chicken or human genomes. Detailed synteny of the Brachyury-containing genomic region is discussed in Additional file 5.
Brachyury is thought to be the most ancient T-box family member (e.g., [21, 22]). It has been speculated that during their histories of over 500 million years, each of the five deuterostome taxa altered the genetic and genomic constitution of Brachyury from its original forms, rending analysis of the present question of Brachyury evolution more difficult than that of more recently evolved genes. Nevertheless, the tandem duplicated state of Brachyury is found only in cephalochordates. Recently, another example of tandem duplication of chordate Brachyury has been reported in an amphibian, Xenopus tropicalis . It thus appears likely that cases of Brachyury duplication  are exceptional. As previously described, Brachyury has two domains of expression and function. The primary domain of expression is in the blastopore during gastrulation (PEF), which is shared by all metazoans [5, 13]. The secondary domain is associated with the notochord (SEF), and is specific to chordates. It is tempting to speculate that one is the original Brachyury gene with its original function and the other is a new copy with a secondary function, and that the duplication occurred very early in chordate evolution. However, as shown in the present study, Brachyury was present as a single copy in the common ancestor of chordates and a single copy of Brachyury acquired the secondary function, SEF, in the chordate ancestor.
The presence of a single copy of Brachyury in the chordate ancestor indicates that the gene duplication was not associated with the acquisition of SEF leading to the development of the notochord. Considering that copy numbers of Brachyury vary between cephalochordates and other deuterostome lineages, the question of regulatory mechanisms still remains. Intensive analyses have been carried out to elucidate 5′ upstream sequences or modules that regulate PEF and SEF, respectively . Therefore, we have to keep the two cases, namely a single copy with two domains and two copies with different domains, in mind for future studies of Brachyury in relation to regulatory mechanisms.
Primary domain of expression and function associated with the blastopore
Secondary domain of expression and function associated with the notochord
Satoh N. Chordate origins and evolution: the molecular evolutionary road to vertebrates. Boston: Elsevier; 2016.
Satoh N, Tagawa K, Lowe CJ, Yu JK, Kawashima T, Takahashi H, et al. On a possible evolutionary link of the stomochord of hemichordates to pharyngeal organs of chordates. Genesis. 2014;52:925–34.
Kowalevsky A. Entwicklungsgeschichte der einfachen Ascidien. Mémoires de l’Académie impériale des sciences de St Pétersbourg. 1866;7:11–9.
Gee H. Before the backbone : views on the origin of the vertebrates. 1st ed. London; New York: Chapman & Hall; 1996.
Satoh N, Tagawa K, Takahashi H. How was the notochord born? Evol Dev. 2012;14:56–75.
Yasuo H, Satoh N. Function of vertebrate T gene. Nature. 1993;364:582–3.
Corbo JC, Levine M, Zeller RW. Characterization of a notochord-specific enhancer from the Brachyury promoter region of the ascidian, Ciona intestinalis. Development. 1997;124:589–602.
Takahashi H, Hotta K, Erives A, Di Gregorio A, Zeller RW, Levine M, et al. Brachyury downstream notochord differentiation in the ascidian embryo. Genes Dev. 1999;13:1519–23.
Chiba S, Jiang D, Satoh N, Smith WC. Brachyury null mutant-induced defects in juvenile ascidian endodermal organs. Development. 2009;136:35–9.
Technau U. Brachyury, the blastopore and the evolution of the mesoderm. Bioessays. 2001;23:788–94.
Bielen H, Oberleitner S, Marcellini S, Gee L, Lemaire P, Bode HR, et al. Divergent functions of two ancient Hydra Brachyury paralogues suggest specific roles for their C-terminal domains in tissue fate induction. Development. 2007;134:4187–97.
Leininger S, Adamski M, Bergum B, Guder C, Liu J, Laplante M, et al. Developmental gene expression provides clues to relationships between sponge and eumetazoan body plans. Nat Commun. 2014;5:3905.
Yasuoka Y, Shinzato C, Satoh N. The mesoderm-forming Gene brachyury regulates ectoderm-endoderm demarcation in the coral Acropora digitifera. Curr Biol. 2016;26:2885–92.
Peterson KJ, Harada Y, Cameron RA, Davidson EH. Expression pattern of Brachyury and Not in the sea urchin: Comparative implications for the origins of mesoderm in the basal deuterostomes. Dev Biol. 1999;207:419–31.
Tagawa K, Humphreys T, Satoh N. Novel pattern of Brachyury gene expression in hemichordate embryos. Mech Dev. 1998;75:139–43.
Gross JM, McClay DR. The role of Brachyury (T) during gastrulation movements in the sea urchin Lytechinus variegatus. Dev Biol. 2001;239:132–47.
Holland PW, Koschorz B, Holland LZ, Herrmann BG. Conservation of Brachyury (T) genes in amphioxus and vertebrates: developmental and evolutionary implications. Development. 1995;121:4283–91.
Terazawa K, Satoh N. Spatial expression of the amphioxus homolog of Brachyury (T) gene during early embryogenesis of Branchiostoma belcheri. Dev Growth Differ. 1995;37:395–401.
Terazawa K, Satoh N. Formation of the chordamesoderm in the amphioxus embryo: Analysis with Brachyury and fork head/HNF-3 genes. Dev Genes Evol. 1997;207:1–11.
Dehal P, Satou Y, Campbell RK, Chapman J, Degnan B, De Tomaso A, et al. The draft genome of Ciona intestinalis: Insights into chordate and vertebrate origins. Science. 2002;298:2157–67.
Papaioannou VE. T-box genes in development: from hydra to humans. Int Rev Cytol. 2001;207:1–70.
Papaioannou VE. The T-box gene family: emerging roles in development, stem cells and cancer. Development. 2014;141:3819–33.
Simakov O, Kawashima T, Marletaz F, Jenkins J, Koyanagi R, Mitros T, et al. Hemichordate genomes and deuterostome origins. Nature. 2015;527:459–65.
Sea Urchin Genome Sequencing C, Sodergren E, Weinstock GM, Davidson EH, Cameron RA, Gibbs RA, et al. The genome of the sea urchin Strongylocentrotus purpuratus. Science. 2007;314:941–52.
Baughman KW, McDougall C, Cummins SF, Hall M, Degnan BM, Satoh N, et al. Genomic organization of Hox and ParaHox clusters in the echinoderm, Acanthaster planci. Genesis. 2014;52:952–8.
Putnam NH, Butts T, Ferrier DE, Furlong RF, Hellsten U, Kawashima T, et al. The amphioxus genome and the evolution of the chordate karyotype. Nature. 2008;453:1064–71.
Huang SF, Chen ZL, Yan XY, Yu T, Huang GR, Yan QY, et al. Decelerated genome evolution in modern vertebrates revealed by analysis of multiple lancelet genomes. Nat Commun. 2014;5:5896.
Denoeud F, Henriet S, Mungpakdee S, Aury JM, Da Silva C, Brinkmann H, et al. Plasticity of animal genome architecture unmasked by rapid evolution of a pelagic tunicate. Science. 2010;330:1381–5.
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, et al. The sequence of the human genome. Science. 2001;291:1304–51.
Inoue J, Sato Y, Sinclair R, Tsukamoto K, Nishida M. Rapid genome reshaping by multiple-gene loss after whole-genome duplication in teleost fish suggested by mathematical modeling. Proc Natl Acad Sci U S A. 2015;112:14918–23.
Altschul SF, Madden TL, Schaffer AA, Zhang JH, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucleic Acids Res. 1997;25:3389–402.
Sato Y, Hashiguchi Y, Nishida M. Temporal pattern of loss/persistence of duplicate genes involved in signal transduction and metabolic pathways after teleost-specific genome duplication. BMC Evol Biol. 2009;9:127.
Katoh K, Kuma K, Toh H, Miyata T. MAFFT version 5: improvement in accuracy of multiple sequence alignment. Nucleic Acids Res. 2005;33:511–8.
Capella-Gutierrez S, Silla-Martinez JM, Gabaldon T. trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses. Bioinformatics. 2009;25:1972–3.
Suyama M, Torrents D, Bork P. PAL2NAL: robust conversion of protein sequence alignments into the corresponding codon alignments. Nucleic Acids Res. 2006;34:W609–12.
Stamatakis A. RAxML version 8: a tool for phylogenetic analysis and post-analysis of large phylogenies. Bioinformatics. 2014;30:1312–3.
Yang Z. Estimating the pattern of nucleotide substitution. J Mol Evol. 1994;39:105–11.
Yang Z. Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: approximate method. J Mol Evol. 1994;39:306–14.
Whelan S, Goldman N. A general empirical model of protein evolution derived from multiple protein families using a maximum-likelihood approach. Mol Biol Evol. 2001;18:691–9.
Wilkinson DG, Bhatt S, Herrmann BG. Expression pattern of the mouse T gene and its role in mesoderm formation. Nature. 1990;343:657–9.
Gentsch GE, Owens NDL, Martin SR, Piccinelli P, Faial T, Trotter MWB, et al. In vivo T-Box transcription factor profiling reveals joint regulation of embryonic neuromesodermal bipotency. Cell Rep. 2013;4:1185–96.
Hedges SB, Marin J, Suleski M, Paymer M, Kumar S. Tree of life reveals clock-like speciation and diversification. Mol Biol Evol. 2015;32:835–45.
EnsemblMetazoa33. Ensembl, Hinxton. 2016. http://metazoa.ensembl.org/index.html. Accessed 23 Apr 2016.
Hemichordate Genomes. Okinawa Institute of Science and Technology Graduate University, Okinawa. 2016. https://groups.oist.jp/molgenu/hemichordate-genomes. Accessed 7 Apr 2016.
MarinegenomicsDB. Okinawa Institute of Science and Technology Graduate University, Okinawa. 2013. http://marinegenomics.oist.jp/gallery. Accessed 25 Apr 2016.
Branchiostoma floridae-JGI Genome Portal, v1. Joint Genome Institute, Walnut Creek. 2014. http://genome.jgi.doe.gov/Brafl1/Brafl1.home.html. Accessed 15 Jul 2016.
LanceletDB, v18h27r3. Sun Yatsen University, Guangzho. 2014. http://genome.bucm.edu.cn/lancelet/gbrowser_wel.php. Accessed 15 Jul 2016.
OikoBase. University of Bergen, Bergen. 2013. http://oikoarrays.biology.uiowa.edu/Oiko. Accessed 23 Apr 2016.
Botryllus schlosseri Genome Project. Stanford University, Stanford. 2013. http://botryllus.stanford.edu/botryllusgenome. Accessed 24 Apr 2016.
Ensembl79. Ensembl, Hinxton. 2015. http://mar2015.archive.ensembl.org/index.html. Accessed 5 May 2015.
We thank Dr. Steven D. Aird for editing the manuscript.
The work was supported in part by JSPS grants to NS (16H04824), YY (26650087 and 16K21559), HT(15K07174), and JI (15K07172).
Availability of data and materials
All data generated or analyzed during this study are included in this published article.
NS and YY conceived the project and JI performed analyses. NS, JI, YY, and HT prepared the manuscript. All authors read and approved the final manuscript.
The authors declare that they have no competing interests.
Consent for publication
Ethics approval and consent to participate
cDNA sequence alignment of T-box family members analyzed in the phylogenetic analysis (Additional file 2). Identity to one of the query gene sequences (underlined) is denoted by dots. Only unambiguously aligned sites are presented (525 out of 16,677 sites). Insertions/deletions of specific nucleotides are indicated by dashes. (PDF 9491 kb)
Molecular phylogenies of T-box family members based on a DNA dataset comprising 350 unambiguously aligned sites (excluding 3rd codon positions) (a) and based on an amino acid dataset comprising 175 sites (b). In both trees, the Brachyury family consistently forms a distinct clade among T-box family members. The resulting tree obtained from reanalysis using only this portion is shown in Fig. 2 and Additional file 4. Query sequences used for the BLAST search are marked with black dots. (PDF 551 kb)
cDNA sequence alignment of Brachyury genes (2625 sites in total) analyzed in the phylogenetic analyses (Fig. 2 and Additional file 4). The alignment was constructed using the selected gene sequences from the estimated tree of T-box-containing genes (Additional file 2). Unambiguously aligned sites indicated by a 1 (on the top of the sequences, 1308 sites) were used for the analyses. (PDF 3214 kb)
Molecular phylogeny of Brachyury family members based on an amino acid dataset comprising 436 unambiguously aligned sites. Arrowheads indicate topological incongruities with the tree obtained from comparisons of nucleotides (Fig. 2). Probably due to the short length of the analyzed sequence, the Botryllus schlosseri (urochordate) gene was placed as a sister lineage of a clade comprising cephalochordate and vertebrate genes. (PDF 150 kb)
Conserved synteny around Brachyury in deuterostomes. (DOCX 125 kb)