Many animals are venomous, but in most cases the exact proteins involved in causing pain or death are unknown, and even in those cases the genes producing them have not been identified, counted or mapped. If you’re interested in the evolution of venom, what its precursors are, and how venomous animals avoid poisoning themselves, you have to know this kind of stuff.
A new paper in Proc. Nat. Acad. Sciences (click screenshot below to read for free, or find the pdf here) answered several of these questions in the venomous caterpillar of the mottled cup moth (Doratifera vulnerans), shown below. It’s from Australia, and is described in Wikipedia this way:
It is known for its caterpillar having unique stinging spines or hairs that contain toxins, for which the scientific name is given that means “bearer of gifts of wounds”. Chemical and genetic analysis in 2021 show that its caterpillar contains 151 toxins, some of which have medicinal properties
That earlier paper, from 2021 and including some of the same authors as the one we discuss today, did indeed identify 151 proteins (peptide are bits of proteins or short chains of amino acids) that were in the toxins, but did not know which genes produced them, how the genes were arranged, what the closest relatives of the genes were, and how many of the 151 “toxins” were really toxic (the word “toxin” there and in the present paper do not mean that the substances were toxic, but that they were simply a component of the extracted toxins). However, the authors, some on the paper I’m highlighting today, did identify two genuine toxins that caused pain: the peptides Dv12 and Dv11.
Look at this thing! It’s clearly aposematic, meaning that it has bright warning coloration that predators can recognize and learn to avoid. And you can see those nasty spines. In the earlier paper they extracted toxins from related species and tested them by injecting them into mice tails, guinea pigs, and human volunteers. That earlier paper also adds this about the species name:
This species, whose binomial name etymologically means “bearer of painful gifts,” is a common culprit of caterpillar envenomations in Australia.
That means that many Aussies get stung by these things, probably inadvertently. Would you touch an animal that looks like this?:

On to the new paper, and I’ll try to be brief as it’s long and complicated.
1.) First, the authors sequenced the entire caterpillar genome (remember, it’s the same as the adult moth genome).
2.) Then, knowing the sequences of the proteins known from previous work on toxins, they could find the genes producing them by matching the protein sequence to the DNA sequence that could produce these proteins. Of the 151 proteins in caterpillar venom known from the prvious work, they mapped 149 of them to 115 sites in the genome
3.) Of the 115 sites, 35 were products of single genes, while 80 (70%) of the total, were members of gene families consisting of two or more similar genes (sometimes many genes) with similar sequences. Here’s a map of the “toxin gene” locations on the insect’s 13 chromosomes. The blue dots are the genes existing in single copies, orange dots are clusters of genes previously grouped together by protein-sequence similarity, and pink dots are genes that were newly identified, surely as part of gene families, in the present study. This conclusion comes from their sequence similarity and they physical grouping on two chromosomes.(The size of the dots indicates the number of genes that are part of a contiguous group. Click to enlarge:
So we know that genes found in venom are very often the product of gene duplications, either of single genes becoming two (this can happen via unequal crossing-over during meiosis or by other methods), producing two initially identical genes side by side or whole groups of them (“tandem duplications”). Once a gene has been duplicated, the original copy can then keep its original function, while the other copies, not being “needed,” are free to evolve other functions. Many genes we’re familiar with, like our own globins and immunoglobulins, evolved by gene duplication followed by divergence of the duplicated copies.
Where did the genes making venom proteins come from? This is the key evolutionary question answered here and, to some extent, in the previous paper. They evolved from ancestral genes in the moth’s immune system that evolved to attack microbes, the so-called “antimicrobial peptides” (AMPs), also known as cecropins. The ancestral AMP proteins, nearly identical to their original form and function, kill bacteria (prokaryotes) by disrupting the bacterial membranes. Insects still need to kill microbes!
Clearly, the proteins in venom have evolved by natural selection modifying ancestral genes used to kill bacteria. Now they are used to repel predators. Natural selection causing this divergence was implicated by looking at sequence differences, as there are ways of showing what sequence differences evolve faster than expected under either the slower processes of genetic drift or “purifying” selection that conserves structure. They found that most of the venom-adapted proteins that evolved from cecropins did evolve under natural selection, while the descendants of cecropins that retained their original anti-microbial proteins were under purifying selection to retain their sequence. It’s clear, then, that the insect still needs genes to attack bacteria. It’s just that some of them have been repurposed, often through gene duplication and divergence, to repel predators. (The authors have a way of assessing “pain” by measuring the increase in calcium concentration in cells grown in vitro and exposed to venom. This happens when the two investigated proteins are used.)
Here is a complicated family tree of cecropin genes in black used to kill microbes. The genes found in venom are in the red box (“venom adapted”). You can see that they are related to cecropin genes but branched off fairly recently (probably four or five million years ago). The venom genes are in the red box that I’ve added, and their relationship as being derived from ancestral AMP genes is very clear. (The “canonical” genes in green are antimicrobial proteins closely related in sequence to the venom genes.
So, now we know where the genes in venom come from. What we do not know is how many of those genes are essential in venom, either causing pain or doing other stuff that venom needs to do. At least two of them cause pain, but there are probably more, for they haven’t all been tested. And some of the other genes are probably involved in dismantling cell walls in potential predators. The authors tested several of the venom proteins and also found that, as in their AMP ancestors, they disrupt cell covering, in this case eukaryotic cell membranes.
Finally, the big question: If the caterpillar makes venom, why doesn’t it poison itself? Here’s how the authors answer that question (I’ve put the answer for this species is in bold).
Animals that produce toxins, either for innate immunity or as venom toxins, must employ strategies to protect themselves from toxicity. Such protective mechanisms include production of toxin inhibitors, storage in inactive form, mutations in their own ion channels that confer resistance, alteration of lipid bilayer compositions, and compartmentalization of toxins separate from body tissues. In the case of limacodid venom peptides, the venom is compartmentalized into the cuticle-lined venom reservoir inside venom spines, preventing the toxin from coming into contact with cells other than the secretory cells that produce them. Thus, compared to canonical cecropins, venom-adapted cecropins may also be released from pressure to avoid activity against animal cells.
There are other findings in the paper that will be of interest primarily to those studying genomic evolution. For example, many of the venom proteins still retain some weak antimicrobial activity, so the idea that genes completely lose their ancestral function when they gain a new one doesn’t hold in this case.
Below you can see the adult moth because, remember, they studied caterpillar venoms, and many of those genes are probably turned off in the adult. But adult and caterpillar carry the exact same genes, of course; their different bodies, physiology, and behavior rest on the differential turning on and off of these genes at different life stages. And that remains a big mystery: how do such different life stages evolve, with each step of the evolution being adaptive?
From The Australian Museum, photo credits at bottom (click to enlarge), image by Lyn Craggs.





























