- Open Access
Did homeobox gene duplications contribute to the Cambrian explosion?
Zoological Letters volume 1, Article number: 1 (2015)
The Cambrian explosion describes an apparently rapid increase in the diversity of bilaterian animals around 540–515 million years ago. Bilaterian animals explore the world in three-dimensions deploying forward-facing sense organs, a brain, and an anterior mouth; they possess muscle blocks enabling efficient crawling and burrowing in sediments, and they typically have an efficient ‘through-gut’ with separate mouth and anus to process bulk food and eject waste, even when burrowing in sediment. A variety of ecological, environmental, genetic, and developmental factors have been proposed as possible triggers and correlates of the Cambrian explosion, and it is likely that a combination of factors were involved. Here, I focus on a set of developmental genetic changes and propose these are part of the mix of permissive factors. I describe how ANTP-class homeobox genes, which encode transcription factors involved in body patterning, increased in number in the bilaterian stem lineage and earlier. These gene duplications generated a large array of ANTP class genes, including three distinct gene clusters called NK, Hox, and ParaHox. Comparative data supports the idea that NK genes were deployed primarily to pattern the bilaterian mesoderm, Hox genes coded position along the central nervous system, and ParaHox genes most likely originally specified the mouth, midgut, and anus of the newly evolved through-gut. It is proposed that diversification of ANTP class genes played a role in the Cambrian explosion by contributing to the patterning systems used to build animal bodies capable of high-energy directed locomotion, including active burrowing.
The number, diversity, and disparity of fossilized animal forms are far greater in deposits dating to the early Cambrian  than found in any earlier deposits. Although some have claimed that this difference reflects fossilization bias, the majority view is that most of the diversity of animal life increased in the early Cambrian, a phenomenon called the Cambrian explosion. As recently reviewed  a multitude of factors have been proposed as candidates for triggering the observed diversification, including biotic and abiotic factors. These authors conclude that was not likely a single cause, but rather a cascade of interconnected environmental and ecological changes, including sea level rise generating habitable shallow-water seas, submarine erosion releasing calcium and phosphate, and exploitation of calcium by animals for biomineralization.
When considering this multitude of factors that contributed to the diversification of animals, internal constraints should not be ignored. Unless the appropriate genes, and their associated developmental pathways, are in place to build complex and adaptable body plans, no combination of environmental factors alone could have stimulated the radiation of the animal kingdom. Based on discovery of the ParaHox gene cluster (a set of three genes related to the better known Hox genes), I previously proposed that generation of these new Hox-related homeobox genes in early animal evolution permitted elaboration of the bilaterian body, especially in allowing evolution of distinct mouth and anus, and that this was instrumental to the diversification of animal life in the Cambrian. We suggested that “the origin of distinct Hox and ParaHox genes by gene-cluster duplication facilitated an increase in body complexity during the Cambrian explosion” . This idea was expanded more generally, ‘the origin of three germ-layers, bilateral symmetry and a through gut also probably involved gene duplication….. This event may provide a partial genetic explanation for the Cambrian explosion’ . The relevance of the through-gut to the Cambrian explosion, but without reference homeobox genes, was also later stressed by Cavalier-Smith  who wrote “Very likely this anal breakthrough …. stimulated the long-puzzling Cambrian explosion”.
Here I revisit this hypothesis, explaining the basis for its proposal and evaluating its robustness in the light of additional data accumulated over the past 15 years. I conclude that there was indeed an expansion of the ANTP class of homeobox genes during early animal evolution, but on a larger scale than originally thought. This large expansion generated not only the Hox and ParaHox genes, but also the NK homeobox genes and their relatives, and I suggest that these genes were recruited for roles in patterning ectoderm, gut and mesoderm. These patterning genes, and others, permitted the evolution of animal body plans capable of active, directed locomotion and, due to evolution of a through-gut with a distinct anus, active burrowing and feeding in sediments. These internal factors acted alongside, or prior to, external stimuli to catalyse the Cambrian explosion.
What exploded in the Cambrian?
The fossils of the Cambrian record a diversification of animal life, but how does this relate to the phylogeny or evolutionary tree of the animal kingdom? In other words, which clade or clades of animals actually diversified? In 1857, Darwin wrote to his friend Thomas Henry Huxley “The time will come, I believe, though I shall not live to see it, when we shall have very fairly true genealogical trees of each great kingdom of nature” . Darwin did not live to see it, and indeed reconstruction of the ‘fairly true’ outline of metazoan phylogeny was not achieved until the late 1990s [6,7]. After more than a century of argument, there is remarkable consensus today about the general framework of animal phylogeny: a framework originally built from ribosomal DNA sequences, but now corroborated by DNA sequences from hundreds of genes [8-10]. The broadly-accepted scheme, sometimes called the ‘New Animal Phylogeny’, sees four animal phyla branching off basally (Porifera, Ctenophora, Placozoa, Cnidaria – not necessarily in that order). All animals apart from the basal four phyla belong to the Bilateria, also called triploblasts, which can be divided into three great clades: the Ecdysozoa (moulting animals, including arthropods, onychophorans, tardigrades, nematodes, priapulids, and others), the Lophotrochozoa (including molluscs, annelids, platyhelminths, nemerteans, brachiopods, bryozoans, and more) and the Deuterostomes (chordates, echinoderms, hemichordates).
The bilaterians are characterized, at least primitively, by possession of an anteroposterior axis (head to tail), a dorsoventral axis (top to bottom) and a left-right axis. Most bilaterians have a centralized nerve cord with a brain, and sense organs concentrated at a clear front end. They (primitively) have a through-gut with two openings, ingesting food through a mouth and passing food unidirectionally before ejection of waste through an anus. Either side of the gut are muscle blocks capable of contorting the body, often acting antagonistically to hydrostatic skeletons or hard parts. These characters are in sharp contrast to the four basal phyla, none of which have cephalization, brains, centralized nerve cords, or through-guts. The cnidarians, for example, have a single gastric opening which acts as both mouth and anus (though there are some variations on the theme). In general, the bilaterians are animals characterized by very active locomotion in a clear direction. These are the animals that explore the world in three-dimensions with forward-facing sense organs, a brain, and an anterior mouth, that possess muscle blocks to allow crawling, burrowing or even swimming, and that have an efficient gut system to process food and eject waste in the wake of the moving animal. Some cnidarians also burrow, but due to the absence of an anus and defined muscle blocks, their burrowing is not in such a directed, efficient, or high-energy manner. The bilaterians are truly the animals that dominate the three-dimensional world (for further discussion see ).
Over twenty years ago, Conway Morris argued that the Cambrian explosion was essentially a bilaterian phenomenon . Most Ediacaran fossils can be assigned to a ‘grade’ of organisation comparable with that found in cnidarians, or if they were not cnidarians then they were also not clearly bilaterians either. In contrast, the majority of animals that characterise the Burgess Shale-type deposits of the middle Cambrian, or the earlier small shelly faunas, are bilaterian. Details of this idea have been challenged several times, but overall it holds considerable weight. For example, comparisons between living animals indicate that the latest common ancestor of all living bilaterians (LCAB) had a brain, anterior sense organs, lateral muscle blocks, central nervous system, and a through-gut [13,14]. An animal with such a body organisation should be capable of active directed locomotion, including burrowing, and logically such animals should leave traces in the form of burrows and directional tracks. Putative trace fossils have been reported from 560 million years ago, 20 million years before the Cambrian, but the clearest of these are surface traces; the first complex horizontal and vertical burrows date to the base of the Cambrian itself [15,16]. One would also deduce that the first bilaterians were not particularly small, since today the microscopic meiofaunal animals are secondarily derived from larger animals, and are highly modified. Furthermore, comparison of developmental genes involved in mesoderm and heart development suggest that the LCAB had a circulatory system , which is not compatible with that ancestor being a microscopic organism. The LCAB should, therefore, have been at least several millimetres – perhaps tens of millimetres – in size. Hence, the trace fossil record should be a very good indicator of whether bilaterians, or at least bilateral animals that postdate the LCAB, were present.
Putting the fossil evidence (including trace fossil evidence) together with comparative anatomy and developmental biology, I argue that the latest common ancestor of living bilaterians dates to the base of the Cambrian and not significantly before. There are lines of evidence that dissent from this view, notably dating efforts using molecular clocks that have placed the LCAB much earlier [18,19]. Such analyses are highly sophisticated, but since we do not fully understand the factors that dictate mutation and substitution rates in genes, one has to ask if this evidence is strong enough to challenge the robust paleontological record. On balance, I favour the view that the Cambrian explosion represents a true diversification of the Bilateria, from a LCAB that existed at or only marginally before the base of the Cambrian. An alternative view would place the LCAB (with its large size, circulatory system, central nervous system, muscles, and through-gut) rather earlier, and the Cambrian explosion would represent diversification of descendent lineages.
The anus of fortune
The earliest stage of the Cambrian is the Fortunian, named after the town of Fortune in Newfoundland, Canada. Deposits dating to this period, around 540–530 million years ago, show trace fossils including indications of animals that had directed locomotion, sinusoidal movement and some capacity for burrowing, including the diagnostic mud-burrowing trace fossil Trichophycus pedum [16,20]. It is thought that animal movement and burrowing caused break-up of the dominant microbial mat faunas, mixing of anoxic and oxic layers of sediments, and an increase in the habitable zone by animals. We may ask which of the bilaterian characters were instrumental in permitting these new forms of animal behaviour? Furthermore, which genes or developmental processes were necessary to permit these characters to evolve? There are living cnidarians that burrow, but these are mostly restricted to very soft mud and they do not mix sedimentary layers extensively. In contrast, many bilaterian ‘worms’ dig, delve, and devour their way through compacted sediments the world over.
The efficiency of bilaterian burrowing rests on three key features: muscle, skeletons, and a through-gut. Muscles provide the power force necessary for high-energy burrowing, and skeletons (which may be composed of hard parts or more usually of fluid-filled cavities) provide incompressible structures ensuring that muscular forces do not simply make bodies shrink but transmit energy against the surrounding substrate. Through-guts allow feeding to be combined with locomotion through sediments. Animals with these characters can truly explore their world in three dimensions, leaving their waste products behind. A centralised nervous system with anterior brain and sense organs to face the on-coming world is a logical set of adaptations to complement this body plan.
As for intrinsic factors necessary for animals to develop these characters, the key must be sets of genes that control spatial patterning of mesoderm (including muscle), endoderm (forming most of the gut) and ectoderm (including the nervous system). There are many candidates for such genes, including those encoding transcription factors from the Fox, Pax, homeobox, zinc finger and T-box superclasses and also genes coding for secreted molecules or their receptors. In this article I focus on homeobox genes , but by doing so I am not suggesting that other types of genes are less important. Indeed, it is likely that complex networks of genes needed to be pieced together during evolution to allow development of particular features. It is also worth noting that it is not necessarily the origin of the genes themselves that is the key permissive step for evolution of a particular anatomical feature, but rather it could be the co-option of those genes into regulatory networks associated with a function. In other words, it is not the birth of the genes that matters, but their deployment. With this background in mind, I will summarize recent data concerning the evolution of homeobox gene functions and their association with patterning of mesoderm, ectoderm and endoderm, including genes for specifying the mouth, central gut, and anus of the bilaterian through-gut.
The ANTP homeobox genes: Hox, ParaHox, NK and relatives
When homeobox genes are mentioned, many biologists will think of Hox genes – a well known set of genes famous for their roles in body patterning and their astonishing evolutionary conservation. First discovered in the fruit fly Drosophila, mutated Hox mutated typically cause ‘homeotic’ transformations, when one part of the body is transformed into another, such as legs growing where antennae should be. In many animals these genes are arranged in ‘gene clusters’, meaning that Hox genes are neighbours on a chromosome. This in itself is quite unusual and central to how Hox genes are regulated. But Hox genes are just one type of homeobox gene. They are the tip of the homeobox iceberg . For example, Drosophila has over a hundred homeobox genes, yet only eight of these are typical Hox genes. Similarly, the human genome has over 200 homeobox loci, of which just 39 are Hox genes [22,23]. To make sense of the diversity of homeobox genes, they can be subdivided into gene classes and gene families. Of the 11 homeobox gene classes in animals, the one that contains most of the genes implicated in body patterning is the ANTP class. This contains the Hox genes, but also some other closely related homeobox genes including ParaHox genes, NK genes and various others (notably Barhl, Barx, Bsx, Dbx, Dlx, Emx, En, Evx, Gbx, Hhex, Hlx, Meox, Mnx, Msx, Nanog, Noto, Vax and Ventx ).
The ParaHox genes are particularly intriguing. It was noted in the 1990s that some homeobox genes that are not part of Hox gene clusters are especially similar in DNA sequence to Hox genes, and for some time their evolutionary origins remained unclear. Indeed, for three of these genes – Gsx, Xlox (also called Pdx) and Cdx – their DNA sequences are more similar to some Hox genes than many Hox genes are to each other. They are ‘Hox-like,’ but not in the Hox cluster. The paradox was solved, or at least clarified, in 1998 when we found that these three genes were arranged in their own gene cluster, at least in one animal taxon – the cephalochordate amphioxus . Subsequently, clustering of these three genes has also been found in vertebrates (except teleost fish ) and in an echinoderm, the sea star Patiria miniata . The discovery of the gene cluster in amphioxus, together with phylogenetic analysis of Hox and ParaHox sequences, led to our proposal that Hox and ParaHox gene clusters were generated by an ancient gene cluster duplication event, from a hypothetical progenitor ‘ProtoHox’ gene cluster. Further studies in other taxa have added support to this model, with the main issue of contention simply being whether the hypothetical ProtoHox gene cluster had two, three or four homeobox genes [27,28]. Not all animals have retained the ParaHox genes in a clustered arrangement, and some taxa have lost particular ParaHox genes, but the original bilaterians must have possessed an intact ParaHox gene cluster. There was also confusion for some years over the timing of the deduced Hox-ParaHox duplication event, but it is now clear that it was not an event that characterised the bilaterians uniquely, since it occurred before the divergence of cnidarians and bilaterians . The implication is that the origin of Hox genes and the simultaneous origin of ParaHox genes occurred before the origin of the bilaterian body plan.
What these findings mean for the evolution of body patterning, and indeed the Cambrian explosion, must also take into account another key set of ANTP class homeobox genes, in another gene cluster: the NK homeobox genes. These genes have been studied in most detail in the fruit fly, Drosophila, where they form a third and quite separate gene cluster, containing the genes slo (NK1), tin (NK4), bap (NK3), Lbx and Tlx . All five genes are ancient and date to the base of the Bilateria, or indeed earlier, and are well conserved in a range of taxa. Although the NK gene cluster has broken up in amphioxus and vertebrates of the Phylum Chordata , there is evidence from comparison of many genomes that the five-gene cluster found in Drosophila is a remnant of an even larger gene cluster. The ancestral bilaterian NK gene cluster most likely contained the five homeobox genes noted above, plus several others including Msx, NK5 and NK6 [21,32-34]. Phylogenetic analysis reveals that the Hox and ParaHox are more closely related to each other than they are to the NK cluster. Sponges have several NK homeobox genes but not definite Hox or ParaHox genes , and it has been suggested that Hox and ParaHox genes were generated from an ancestral NK-like cluster by tandem gene duplication (Figure 1).
To summarize, before the emergence of bilaterian animals, three distinct clusters of ANTP class homeobox genes arose: Hox, ParaHox and NK homeobox genes. Additional ANTP class genes were left scattered as neighbours of the Hox and NK homeobox genes. The simplest model to explain the origins of all these genes, now supported by a large amount of data from diverse taxa, is that a hypothetical ancestral ANTP class gene underwent extensive tandem gene duplication, spreading a series of related genes along a chromosome (Figure 1). From this large array of genes, many genes subsequently became dispersed in the genome, but there remained three ‘islands’ of gene clustering - NK, Hox and ParaHox - each with a subtly different developmental role, discussed below. The three gene clusters have been conserved to different extents in different evolutionary lineages. To take two extremes, vertebrates have very compact and conserved Hox and ParaHox gene clusters but disrupted NK gene clusters, while dipteran flies have a disrupted Hox gene cluster and have lost one of the three ParaHox genes, but have retained a tight NK gene cluster.
The germ-layer hypothesis
The presence of three ANTP class gene clusters has an intriguing parallel to the three germ layers of bilaterian animals. This relationship had been outlined previously  and is expanded below. The Hox genes of vertebrates and the fruit fly, the first to be analysed, are deployed to pattern the embryonic mesoderm and ectoderm (especially neuroectoderm). In mammals, for example, mutation of Hox genes causes profound disruption to patterning of the vertebral column, which develops from the mesodermal somites, and also the nerve cord. From these data alone, it might reasonably be thought that, in the common ancestor of insects and vertebrates, Hox genes patterned ectoderm and mesoderm. However, early studies on amphioxus and ascidians reviewed by , and later work on molluscs and annelids [37,38], has highlighted that the most consistent picture is for Hox genes to have anteroposterior spatial patterns in ectoderm only. Mesoderm expression, when present, rarely shows a clear anteroposterior domain, except in vertebrates and insects. Perhaps the original role of Hox genes in Bilateria, therefore, was to encode positional information just in the ectoderm? Interestingly, the Hox genes do not pattern the most anterior parts of the brain, but other homeobox genes originally linked chromosomally to Hox genes (such as Emx, En and Gbx) are deployed in these regions.
With the discovery of the ParaHox gene cluster in amphioxus, an intriguing parallel was found. Two of the ParaHox cluster genes, Cdx and Xlox (Pdx) were found to be predominantly expressed in the gut . Cdx is consistently expressed in the most posterior part of the body, around the anus and also in other tissues, in all bilaterian animals examined (Figures 2 and 3). Xlox is necessary for development of the endoderm of the presumptive pancreas and duodenum of vertebrates [39,40], and is expressed in a region of midgut endoderm in amphioxus (Figure 2) and leeches [2,41]. The same has also been shown recently for an echinoderm . We proposed, therefore, that after duplication of the hypothetical ProtoHox gene cluster to generate Hox and ParaHox, one gene cluster was deployed to pattern ectoderm and the other to pattern endoderm (or more accurately gut, since the anterior and posterior extremities of the gut are not generally considered endoderm ). There were, however, two complications to this model. First, while there are many Hox genes in most bilaterians (around 10 being normal for an invertebrate), there are only three ParaHox genes leaving little scope for regionalising the entire gut. Second, while the Cdx gene is associated with the anus and Xlox associated with the midgut, the third ParaHox gene (Gsx) was not expressed in the mouth in amphioxus . Instead, this gene was found to be expressed in the amphioxus brain, as indeed are its two homologues in vertebrates (Figure 2). Our proposal to cope with this uncooperative fact was that gene expression had changed in the ancestry of amphioxus and vertebrates . This suggestion was expanded upon later, as follows. For over a century there had been a suggestion that deuterostomes (the clade to which amphioxus and vertebrate belong) have invented a ‘new mouth’ during evolution; if true, then lack of expression in the forming mouth of amphioxus and vertebrates might have no bearing on the ancestral role .
The testable prediction was that analysis of the Gsx gene in protostome taxa - animals which are proposed to have retained the original mouth opening - should reveal expression in the oral region. This was not the case for the fruit fly, Drosophila, and the gene has been lost in the nematode Caenorhabditis, but both these taxa have many derived features of their development. It was therefore very informative when Gsx expression was studied carefully in a spiralian protostome (the mollusc Gibbula), since a clear ring of expression was seen around the oral cavity, albeit alongside expression in the nervous system  (Figure 3). The authors noted ‘Our results support Holland’s hypothesis that ParaHox genes are involved in gut regionalization and offer further support to the ancestral mouth patterning role of Gsx in protostomes’ . Perhaps the original role of ParaHox genes in Bilateria, therefore, was to encode positional information in the gut?
Above I noted that Hox and ParaHox are retained clusters from a larger array of ANTP class homeobox genes, alongside a third gene cluster, NK. The expression and function of these genes have been less well-studied, but there are indications that mesodermal expression is a common property . For example, the NK4 or tinman (tin) gene of Drosophila is first expressed in all mesoderm cells and later becomes restricted to the dorsal region of the mesoderm. The key role is in development of the insect pulsatile vesicle (or ‘heart’) derived from these cells; the name tinman refers to the heart-deficient mutant fly, named after a character in the Wizard of Oz . In vertebrates there are usually three vertebrate orthologues of NK4, called Nkx2-3, Nkx2-5 and Nkx2-6; these are expressed in the vertebrate heart in at least some species, although there may be functional overlap . Nkx2-5 in particular is necessary for correct heart development and mutations in this gene are associated with heart defects in humans . The NK3 or bagpipe gene is involved in Drosophila visceral mesoderm development , and its vertebrate orthologues are expressed in heart, plus other tissues including salivary and prostate glands and developing somites [48-50]. The NK1 or slouch gene has roles in insect somatic muscle development and is also expressed in the nervous system [30,51]. The mouse homologues Nkx1-1 (also called Sax2) and Nkx1-2 (Sax1) break the pattern of mesodermal function, being predominantly expressed in neural tissue [52,53], with Nkx1-2 in the brain and implicated in hormonal control of food intake . The general trend of mesodermal expression and function, however, continues to some genes no longer in the NK gene cluster of Drosophila, but deduced to have been present in the original NK gene cluster of the basal bilaterian, such as Msx (muscle segment homeobox). Although there is no suggestion of anteroposterior restriction, in contrast to Hox and ParaHox, it seems clear that NK homeobox genes are primarily expressed in subsets of the mesoderm. There are additional sites of expression, and many non-mesodermal functions, but on balance it seems likely that the original role of NK homeobox genes was connected with specialisation of mesoderm.
The base of the Cambrian was marked by the evolution of active, efficient, burrowing, and directed locomotion along and beneath the substrate. This in turn transformed an essentially two-dimensional world, dominated by microbial mats, into a three-dimensional world. It is argued here that for the evolution of this ‘ecological engineering’ , the early bilaterians required muscle, skeletons, a through-gut, and a centralised nervous system with anterior brain and sense organs. The through-gut, with distinct mouth, midgut, and anus is highlighted here as particularly important for ecological change. The developmental genetic evidence, together with comparative anatomy, indicates that these key anatomical characters were all in place in the most recent common ancestor of living bilaterians, and hence must have evolved on the bilaterian stem lineage, not independently in different bilaterians.
How did these anatomical characters, so pivotal for the Cambrian explosion, arise in evolution? The expansion of the ANTP class of homeobox genes has been traced, and is shown to have generated an array of regulatory factors, containing three key gene clusters: Hox, NK and ParaHox. The proposal, which has been developed gradually over the past 15 years, is that these three sets of new genes were recruited in evolution for patterning the development of the nervous system, the development of muscle and development of a through-gut in the ancestors of bilaterians, thereby permitting the evolution of animals capable of active directed locomotion, including burrowing through the substrate. Without this molecular evolution, there could be no Cambrian explosion. Even when environmental changes, such as sea level changes occurred, exploitation of new niches required the genetic machinery to build complex body plans.
It should be highlighted that the ANTP class homeobox genes form just part of the genetic architecture used for patterning these structures, and that many other genes were also involved. These include the Pax genes, some of which are involved in mediolateral specialisation of nervous system and mesoderm, and in formation of sense organs, the Fox genes implicated in endoderm and mesoderm patterning, the T-box genes and more [54-58]. Ultimately, without the diversity of transcription factor genes generated in early animal evolution, it is argued that no bilaterian evolution was possible.
Smith MP, Harper DAT: Causes of the Cambrian Explosion. Science 2013, 341:1355–1356.
Brooke NM, Garcia-Fernandez J, Holland PWH: The ParaHox gene cluster is an evolutionary sister of the Hox gene cluster. Nature 1998, 392:920–922.
Holland PWH: Major transitions in animal evolution: A developmental genetic perspective. Am Zool 1998, 38:829–842.
Cavalier-Smith T: Cell evolution and Earth history: stasis and revolution. Philos Trans R Soc Lond B Biol Sci 2006, 361:969–1006.
Darwin CR (Ed): Letter to TH Huxley. In ‘The Correspondence of Charles Darwin’. 1990 edition. Cambridge University Press: Frederick Burkhardt and Sydney Smith; 1857.
Aguinaldo AMA, Turbeville JM, Linford LS, Rivera MC, Garey JR, Raff RA, Lake JA: Evidence for a clade of nematodes, arthropods and other moulting animals. Nature 1997, 387:489–493.
Halanych KM, Bacheller JD, Aguinaldo AM, Liva SM, Hillis DM, Lake JA: Evidence from 18S ribosomal DNA that the lophophorates are protostome animals. Science 1995, 267:1641–1643.
Dunn CW, Hejnol A, Matus DQ, Pang K, Browne WE, Smith SA, Seaver E, Rouse GW, Obst M, Edgecombe GD, Sørensen MV, Haddock SHD, Schmidt-Rhaesa A, Okusu A, Kristensen RM, Wheeler WC, Martindale MQ, Giribet G: Broad phylogenomic sampling improves resolution of the animal tree of life. Nature 2008, 452:745–749.
Jiménez-Guri E, Philippe H, Okamura B, Holland PWH: Buddenbrockia is a cnidarian worm. Science 2007, 317:116–118.
Philippe H, Lartillot N, Brinkmann H: Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. Mol Biol Evol 2005, 22:1246–1253.
Holland PWH: The Animal Kingdom: a Very Short Introduction. Oxford, UK: Oxford University Press; 2011.
Conway Morris S: The fossil record and the early evolution of the Metazoa. Nature 1993, 361:219–225.
De Robertis EM, Sasai Y: A common plan for dorsoventral patterning in Bilateria. Nature 1996, 380:37–40.
Scott MP: Intimations of a creature. Cell 1994, 79:1121–1124.
Erwin DH, Laflamme M, Tweedt SM, Sperling EA, Pisani D, Peterson KJ: The Cambrian Conundrum: Early Divergence and Later Ecological Success in the Early History of Animals. Science 2011, 334:1091–1097.
Seilacher A, Buatois LA, Mángano GM: Trace fossils in the Ediacaran–Cambrian transition: Behavioral diversification, ecological turnover and environmental shift. Palaeogeogr Palaeoclimatol Palaeoecol 2005, 227:323–356.
Bodmer R, Venkatesh TV: Heart development in Drosophila and vertebrates: Conservation of molecular mechanisms. Dev Gene 1998, 22:181–186.
Blair JE, Hedges SB: Molecular phylogeny and divergence times of deuterostome animals. Mol Biol Evol 2005, 22:2275–2284.
Peterson KJ, Cotton JA, Gehling JG, Pisani D: The Ediacaran emergence of bilaterians: congruence between the genetic and the geological fossil records. Philos Trans R Soc Lond B Biol Sci 2008, 363:1435–1443.
Dzik J: Behavioral and anatomical unity of the earliest burrowing animals and the cause of the “Cambrian explosion”. Paleobiology 2005, 31:503–521.
Holland PWH: Evolution of homeobox genes. WIREs Deve Biol 2013, 2:31–45.
Holland PWH, Booth HAF, Bruford EA: Classification and nomenclature of all human homeobox genes. BMC Biol 2007, 5:47.
Zhong Y-F, Holland PWH: The dynamics of vertebrate homeobox gene evolution: gain and loss of genes in mouse and human lineages. BMC Evol Biol 2011, 11:169.
Zhong Y-F, Holland PWH: HomeoDB2: functional expansion of a comparative homeobox gene database for evolutionary developmental biology. Evolution Dev 2011, 13:567–568.
Mulley JF, Chiu CH, Holland PWH: Breakup of a homeobox cluster after genome duplication in teleosts. Proc Natl Acad Sci U S A 2006, 103:10369–10372.
Annunziata R, Martinez P, Arnone M: Intact cluster and chordate-like expression of ParaHox genes in a sea star. BMC Biol 2013, 11:68.
Garcia-Fernandez J: Hox, ParaHox, Proto Hox: facts and guesses. Heredity 2005, 94:145–152.
Lanfear R, Bromham L: Statistical Tests between Competing Hypotheses of Hox Cluster Evolution. Syst Biol 2008, 57:708–718.
Hui JHL, Holland PWH, Ferrier DEK: Do cnidarians have a ParaHox cluster? Analysis of synteny around a Nematostella homeobox gene cluster. Evolution Dev 2008, 10:725–730.
Jagla K, Bellard M, Frasch M: A cluster of Drosophila homeobox genes involved in mesoderm differentiation programs. Bioessays 2001, 23:125–133.
Luke GN, Castro LFC, McLay K, Bird C, Coulson A, Holland PWH: Dispersal of NK homeobox clusters in amphioxus and humans. Proc Nat Acad Scie 2003, 100:5292–5295.
Pollard SL, Holland PWH: Evidence for 14 homeobox gene clusters in human genome ancestry. Current Biology 2000, 10:1059–1062.
Castro LFC, Holland PWH: Chromosomal mapping of ANTP class homeobox genes in amphioxus: piecing together ancestral genomes. Evolution Dev 2003, 5:459–465.
Garcia-Fernandez J: The genesis and evolution of homeobox gene clusters. Nat Rev Gene 2005, 6:881–892.
Larroux C, Fahey B, Degnan SM, Adamski M, Rokhsar DS, Degnan BM: The NK homeobox gene cluster predates the origin of Hox genes. Curr Biol 2007, 17:706–710.
Holland PWH, Garcia-Fernandez J: Hox genes and chordate evolution. Dev Biol 1996, 173:382–395.
Frobius AC, Matus DQ, Seaver EC: Genomic organization and expression demonstrate spatial and temporal hox gene colinearity in the lophotrochozoan Capitella sp I. PLoS One 2008, 3(4004):3.
Samadi L, Steiner G: Expression of Hox genes during the larval development of the snail, Gibbula varia (L.)-further evidence of non-colinearity in molluscs. Dev Gene Evol 2010, 220:161–172.
Jonsson J, Carlsson L, Edlund T, Edlund H: Insulin-promoter-factor1 is required for pancreas development in mice. Nature 1994, 371:606–609.
Offield MF, Jetton TL, Labosky PA, Ray M, Stein RW, Magnuson MA, Hogan BLM, Wright CVE: PDX-1 is required for pancreatic outgrowth and differentiation of the rostral duodenum. Development 1996, 122:983–995.
Wysocka-Diller J, Aisemberg G, Macagno E: A novel homeobox cluster expressed in repeated structures of the midgut. Dev Biol 1995, 171:439–447.
Holland PWH: Beyond the Hox: how widespread is homeobox gene clustering? J Anatomy 2001, 199:13–23.
Samadi L, Steiner G: Conservation of ParaHox genes’ function in patterning of the digestive tract of the marine gastropod Gibbula varia. BMC Dev Biol 2010, 10:74.
Bodmer R: The gene tinman is required for specification of the heart and visceral muscles in Drosophila. Development 1993, 118:719–729.
Tanaka M, Yamasaki N, Izumo S: Phenotypic characterization of the murine Nkx2.6 Homeobox gene by gene targeting. Mol Cell Biol 2000, 20:2874–2879.
Schott J-J, Benson DW, Basson CT, Pease W, Silberbach GM, Moak JP, Maron BJ, Seidman CE, Seidman JG: Congenital heart disease caused by mutations in the transcription factor NKX2-5. Science 1998, 281:108–111.
Azpiazu N, Frasch M: Tinman and bagpipe: two homeo box genes that determine cell fates in the dorsal mesoderm of Drosophila. Gene Dev 1993, 7:1325–1340.
Herbrand H, Pabst O, Hill R, Arnold H-H: Transcription factors Nkx3.1 and Nkx3.2 (Bapx1) play an overlapping role in sclerotomal development of the mouse. Mech Dev 2002, 117:217–224.
Schneider A, Brand T, Zweigerdt R, Arnold H-H: Targeted disruption of the Nkx3.1 gene in mice results in morphogenetic defects of minor salivary glands: parallels to glandular duct morphogenesis in prostate. Mech Dev 2000, 95:163–174.
Tanaka M, Lyons GE, Izumo S: Expression of the Nkx3.1 homobox gene during pre and postnatal development. Mech Dev 1999, 85:179–182.
Dohrmann C, Azpiazu N, Frasch M: A new Drosophila homeo box gene is expressed in mesodermal precursor cells of distinct muscles during embryogenesis. Gene Dev 1990, 4:2098–2111.
Rovescalli AC, Cinquanta M, Ferrante J, Kozak CA, Nirenberg M: The mouse Nkx-1.2 homeobox gene: Alternative RNA splicing at canonical and noncanonical splice sites. Proc Nat Acad Sci 2000, 97:1982–1987.
Simon R, Lufkin T: Postnatal lethality in mice lacking the Sax2 homeobox gene homologous to Drosophila S59/slouch: Evidence for positive and negative autoregulation. Mol Cell Biol 2003, 23:9046–9060.
Mansouri A, Hallonet M, Gruss P: Pax genes and their roles in cell differentiation and development. Curr Opin Cell Biol 1996, 8:851–857.
Mazet F, Amemiya CT, Shimeld SM: An ancient Fox gene cluster in bilaterian animals. Curr Biol 2006, 16:R314–R316.
Papaioannou VE, Silver LM: The T-box gene family. Bioessays 1998, 20:9–19.
Shimeld SM, Boyle MJ, Brunet T, Luke GN, Seaver EC: Clustered Fox genes in lophotrochozoans and the evolution of the bilaterian Fox gene cluster. Dev Biol 2010, 340:234–248.
Stoykova A, Gruss P: Roles of Pax-genes in developing and adult brain as suggested by expression patterns. J Neurosci 1994, 14:1395–1412.
I thank Jordi Garcia-Fernandez, Nina Brooke, Hiroshi Wada, Graham Luke, Sophie Harris-Watkins, Filipe Castro, Anne Booth, Jerome Hui, David Ferrier and Ying-fu Zhong for their contributions to the early phases of this research, Jody Holland and Sharon Cornwell for help with figures and text, and Leyli Samadi, Sebastian Shimeld, Ferdinand Marletaz, Jordi Paps, Ignacio Maeso, Paul Smith and Gabriela Mángano for useful discussions. The author’s research is funded from the European Research Council under the European Union’s Seventh Framework Programme (FP7/2007-2013)/ERC grant 11.
The author declares that he has no competing interests.