The Cacao Genome Revolution

The Scientists Who Are Rewriting the DNA of Chocolate

A Fragile Tree in Peril

In a humid greenhouse in Costa Rica, a botanist gently opens a cacao pod, exposing wet white beans inside. Outside, the afternoon sun glints off lush cacao trees – but many bear wounds of disease. Some pods are covered in a velvety fungus that looks like frost, others blackened and mummified. Half a world away, on a small farm in Ghana, a cocoa farmer walks his plantation and counts the withering trees marked with red paint – each infected with a virus that swells the trunks and leaves them to die in slow motion. Scenes like these are playing out across the tropics. Chocolate’s essential ingredient comes from Theobroma cacao, a delicate tree that has fed civilizations and delighted the world for centuries. But that tree is in peril.

It’s hard to imagine that the future of chocolate could be at risk. Yet the cacao tree – despite its divine name meaning “food of the gods” – has always been vulnerable. Cacao thrives only in narrow conditions: the shady understory of tropical rainforests, within a band 20 degrees north or south of the Equator. It wilts if temperatures climb too high or rainfall falters. And worse, cacao is constantly stalked by blights and pests. Farmers have a grim, evocative lexicon for these threats: Witches’ Broom, Frosty Pod, Black Pod, Swollen Shoot. Each name is a nightmare. Witches’ broom disease, caused by a fungus that likely coevolved with cacao in the Amazon, makes branches sprout deformed tangles of stems – the “brooms” – sucking life from the tree. Frosty pod, a cousin fungus, erupts across the surface of pods in a white, powdery crust, rotting the beans inside. Black pod disease turns the fruit black and foul. And the swollen shoot virus, spread by tiny mealybugs, invisibly infects trees for years, slowly strangling them from within until their trunks swell and crack.

For decades, these plagues have periodically ravaged cacao heartlands. In Ghana – the world’s second largest cocoa producer – outbreaks of swollen shoot disease first discovered in the 1930s have never fully gone away. By the 1960s, it had decimated Ghana’s yields, contributing to a collapse in production that shook the cocoa market. In the 1980s, a severe resurgence of the virus, coupled with drought, forced farmers to fell and burn countless trees; Ghana’s cocoa output plummeted, and with it the nation’s economy trembled. Across the ocean in Brazil, another chocolate powerhouse, an even more dramatic disaster was unfolding. In the late 1980s, witches’ broom fungus invaded the cacao groves of Bahia. This rich agricultural region, which once made Brazil the world’s number two cocoa producer, saw its orchards overrun in just a few years. Ghostly groves of leafless branches and fungal spores replaced what had been emerald canopies dotted with red-gold pods. By the 1990s, Brazil’s cocoa harvest had collapsed by more than half, devastating rural communities. Many still call it “the chocolate apocalypse.”

These crises exposed a dire truth: the global chocolate supply rests on a narrow genetic base. Cacao has a surprisingly shallow family tree on today’s farms. Though the plant originated in the upper Amazon with myriad wild varieties, the cacao grown commercially around the world descends from just a few lineages. In the 18th and 19th centuries, colonial planters spread cacao from its native Americas to Africa and Asia – but they carried only a handful of pods across the oceans. In West Africa, for example, virtually all the millions of cacao trees stem from a few seeds of a type called Forastero (specifically the “Amelonado” strain) that were brought from Brazil and São Tomé. This gave great yields and hardiness, so it flourished – but it was just one branch of cacao’s diversity. An even older lineage, Criollo, once dominated in Central America and was famed for its exquisite flavor, yet it was so disease-prone that most plantations abandoned it long ago. Hybrids like Trinitario (born from crosses of Criollo and Forastero in the Caribbean) offered a mix of flavor and vigor, but still only tap a fraction of cacao’s wild gene pool. Over time, farmers everywhere came to rely on a few star varieties that produced well under local conditions. It was efficient, but it was also a genetic bottleneck. Fields of nearly identical trees became easy targets for pathogens. Monoculture was a buffet for blight – when one tree fell, many fell.

The urgency to save cacao from its own success – from the genetic narrowness that made it profitable – started to dawn on scientists and the chocolate industry alike. If chocolate was to survive the 21st century, cacao would need to be armed with stronger genes. Disease resistance, climate resilience, flavor preservation, biodiversity – all hung in the balance. The solution would have to be global and visionary. It would require not just traditional crop breeding, but cutting-edge science and unprecedented international collaboration. Thus began the cacao genome revolution: an effort to decode, preserve, and manipulate the genetic blueprint of cacao. It’s a story spanning rainforest expeditions, high-tech laboratories, ancient trees and modern corporations – all converging to secure the future of chocolate.

Guardians of a Lost Genetic Heritage

Long before DNA sequencing entered the picture, a handful of agricultural adventurers had started safeguarding cacao’s vanishing diversity. In the 1930s, as diseases like witches’ broom ravaged plantations in the Caribbean, researchers realized that somewhere in cacao’s ancestral homeland might exist plants with natural resistance. They embarked on arduous expeditions into the Amazon Basin – Peru, Ecuador, Colombia, Brazil – hunting wild cacao trees in jungle backwaters. These scientists, like the legendary F.J. Pound of Trinidad, collected hundreds of pods from isolated groves along the Amazon’s tributaries. Many of those wild strains, brought back to research stations, proved indeed tougher against diseases. By cross-breeding them with local crops, breeders gradually introduced vital resilience. In Trinidad, for example, new Witches’ Broom-tolerant hybrids derived from those Amazon genes saved that island’s cocoa farms mid-century. Similar breeding efforts spread to other regions, but it was a constant race to keep up with evolving diseases.

To conserve all this genetic wealth, international cacao genebanks were established as living libraries of the crop’s diversity. Nowhere is this more impressive than at the International Cocoa Genebank in Trinidad (ICG). Hidden in the verdant fields of the University of the West Indies’ research center, the ICG is often called Noah’s Ark for chocolate. Here, about 2,400 unique varieties of cacao are grown in one expansive garden – the most diverse cacao collection on the planet. Each tree in Trinidad’s genebank has a story: some were rescued from a riverbank deep in the Peruvian jungle; others are descendants of ancient Mayan orchards. Walking through the ICG, you can spot trees with slender yellow pods, squat green pods, or even cherry-red pods covered in spiky protuberances. Some yield beans with intense fine aromas of jasmine or nuts; others carry genes that make them impervious to particular fungi. “It’s a wishing well,” says Dr. Pathmanathan Umaharan, the genebank’s director. Whenever a new threat or challenge emerges – a virulent fungus, a soil toxin, a climate extreme – chances are that the cure lies somewhere among these trees. Indeed, the Trinidad collection has given rise to many success stories. When cadmium contamination recently threatened cacao exports (as certain soils cause cacao to uptake this heavy metal into beans, running afoul of food safety limits), researchers screened the collection and found a rare genotype whose roots avoid absorbing cadmium. That trait is now being bred into commercial lines.

In Central America, another key sanctuary is CATIE, the Tropical Agricultural Research and Higher Education Center in Costa Rica. At CATIE’s sprawling farm, over 1,100 distinct cacao types are maintained, representing clones from throughout the Americas and West Africa. Tall shade trees arch over the plots, mimicking the rainforest canopy. Here too, diversity is on full display – from venerable heirloom Criollos once cherished by Maya kings, to new hybrids developed by modern scientists. CATIE’s international collection, like Trinidad’s, has been built over decades through global exchanges of seeds and cuttings. Notably, because cacao seeds cannot survive drying or freezing like typical seed bank specimens, these collections must be kept alive as trees, requiring constant care. They are vulnerable to hurricanes, droughts, or disease outbreaks themselves. Protecting them is an ongoing challenge. As Dr. Umaharan poignantly notes, these genebanks are the last refuge for cacao’s gene pool – and “the arks of chocolate must not sink.”

The work at Trinidad, CATIE, and similar institutes is largely unsung, but it laid the foundation for the genome revolution. Preserving genes is one step; the next was learning how to read and deploy them in the most efficient way possible. By the turn of the millennium, the time was ripe for cacao to enter the age of genomics.

Cracking the Chocolate Code

In September 2010, a small group of scientists and executives gathered at a press event and made an astonishing announcement: the genome of the cacao tree had been sequenced and assembled. This achievement was the culmination of years of quiet effort by an international consortium – and an unlikely alliance of industry and academia. The project had all the elements of a biotech thriller. On one side was Mars, Incorporated, the American confectionery giant behind M&Ms, Snickers, and countless chocolate treats. Mars had a clear business interest: the company relies on a stable cocoa supply, and threats to cacao are threats to their bottom line. But under the leadership of Mars’s chief agricultural officer, Dr. Howard-Yana Shapiro, the company also had a vision of “applying the best of science to an underserved crop.” Mars committed substantial funding and insisted the results be made public for all. Partnering with Mars was the U.S. Department of Agriculture’s ARS research labs – including a team in Miami headed by molecular biologist David Kuhn – and IBM’s Computational Biology Center, which lent its formidable Blue Gene supercomputer to crunch DNA data. Their focus: sequence a Forastero cacao variety known as Matina 1-6, a workhorse cultivar whose genes are representative of the bulk of the world’s cocoa trees.

Meanwhile, a parallel effort had formed across the Atlantic. The French agricultural research agency CIRAD, renowned for tropical crop science, teamed up with Penn State University in the U.S. and chocolate maker Hershey’s. This team set out to sequence a prized Criollo cacao – a strain collected from Belize, descended from the cacao grown by ancient Mayans. This dual-track approach – decoding both a common Forastero and a high-quality Criollo – would allow scientists to compare the genomes of two distinct genetic lineages of cacao. Some dubbed it a friendly race, akin to the public-versus-private sprint to sequence the human genome a decade prior. But in truth, the cacao scientists maintained a spirit of collaboration. Dozens of researchers from around 20 institutions and six countries contributed. The DNA itself was extracted from cacao leaves and then shattered into millions of fragments for sequencing. As the data poured in, bioinformaticians assembled the fragments back together like a colossal jigsaw puzzle – the cacao genome has 10 chromosomes and around 420 million DNA base pairs, so it was like solving ten huge puzzles at once. Dr. Mark Guiltinan, a plant molecular biologist at Penn State on the Criollo project, likened the task of genome assembly to “an art form” requiring equal parts computing power and genetic savvy.

When the dust settled, both teams had succeeded. The Forastero genome (Matina) was about 92% fully decoded, revealing some 35,000 genes. The Criollo genome was a bit less complete (about 76% assembled) but still identified over 95% of its genes. It was immediately clear that while the two cacao types shared most of their DNA, there were key differences – including genes potentially related to flavor, disease resistance, and adaptations to environment. The researchers found, for example, gene variants in Criollo linked to its superb flavor chemistry (traits that had made Criollo chocolate so prized), whereas the hardier Matina had genes conferring natural defenses against diseases. Now breeders could pinpoint these genes.

Crucially, all the data was released into the public domain, posted to a communal database and the U.S. National Center for Biotechnology Information. Anyone, from an Ivory Coast agronomist to a boutique cacao farmer in Hawaii, could access cacao’s genetic code. This open-science approach, championed by Mars and the academic partners, meant the genome wouldn’t sit behind patents or paywalls. The era of evidence-based cacao breeding had arrived. Instead of relying purely on observation and decades of field trials to see if a new cacao cross had the right stuff, scientists could directly look for genetic markers – DNA sequences that signaled the presence of a desired trait. This method, called marker-assisted selection, was already revolutionizing crops like corn and rice. Now cacao would get the same boost. As Guiltinan put it, the goal wasn’t to genetically engineer chocolate bars, but to give traditional breeders a high-tech map to speed up their work.

The genome breakthrough also yielded deeper scientific insights. Researchers noted that cultivated cacao varieties showed areas of startlingly low genetic variability – stark evidence of cacao’s bottleneck from its voyage out of Eden (the Amazon). And because the pathogens that attack cacao (such as the witches’ broom and frosty pod fungi) had also been genetically sequenced by plant pathologists, scientists could, for the first time, compare the genomes of cacao and its enemies. They began to unravel which cacao genes responded to infection and which fungal genes triggered the most damage. This arms-race in code could reveal new ways to fortify cacao’s defenses.

Amid the celebration of the genome’s publication, there was a humbling realization: decoding DNA was just step one. Turning that knowledge into hardier trees in farmers’ fields would be the true test of the revolution.

Breeding for a Sweeter Future

With the cacao genome in hand, plant breeders around the world leapt into action to create the next generation of cacao trees. In West Africa, home to over 70% of global cocoa production, breeding programs run by national institutes and the USDA zeroed in on traits to combat the region’s biggest scourges: swollen shoot virus and black pod rot. Using DNA markers, researchers could identify young seedlings that carried genetic resistance to these diseases – perhaps inherited from a distant Amazonian ancestor – and select them early on, rather than waiting years for those trees to mature and be tested by exposure. This shaved years off the breeding cycle. In Ghana and Ivory Coast, trials of new hybrids are underway that promise not only higher yields but also built-in resistance to viral infection, potentially saving millions of trees from the ax.

In Latin America, breeders faced the twin terrors of witches’ broom and frosty pod. They turned to the gene bank reserves: varieties like Scavina 6, a legendary Upper Amazon cacao genotype known for its strong defense against witches’ broom, became a popular parent in crosses. Before genome mapping, Scavina’s resistance gene was used, but often at the cost of other traits like flavor. Now, scientists can combine multiple resistance genes – for different diseases – into one plant, while monitoring that flavor genes remain intact. In Ecuador and Peru, where a high-yield but poor-flavor variety called CCN-51 had dominated new plantings (because it could survive fungus attacks), researchers are working on crosses that marry CCN-51’s toughness with the fine flavor of traditional strains. They scour seedlings’ DNA for markers associated with desirable flavor precursors, ensuring those aren’t lost in the quest for resilience. Even flavor itself is getting demystified: some genes influencing the cacao butter content and aromatic compounds have been pinpointed, giving breeders clues on maintaining or even enhancing taste. The stakes are high – no chocolate lover wants a future of plentiful cocoa that tastes bland. The holy grail is a tree that is disease-proof, high-yielding, climate-hardy, and delicious all at once. Breeders joke it’s like trying to breed a racehorse that’s also as tough as a mule. Progress is incremental, but accelerating thanks to the genome tools.

Profiles of the people leading this charge show a passionate, almost missionary zeal. In a Pennsylvania lab, Dr. Siela Maximova tends to petri dishes of tiny cacao embryos, coaxing them to grow into saplings after experimental crosses – her team’s innovations in tissue culture have made it easier to propagate new hybrids en masse. In France, CIRAD scientists travel frequently to Côte d’Ivoire, working with local agronomists to test French-developed varieties in African soils, bridging languages and cultures through a shared goal of sustainable cocoa. At the USDA’s facility in Miami, David Kuhn and colleagues continue to update the cacao DNA database as new genetic variants are discovered, essentially keeping a genetic encyclopedia of cacao that breeders consult like scripture. And in the green hills of Costa Rica, an old-school plant breeder at CATIE might walk through a test plot, chewing on a raw cacao bean to get a hint of its bitterness or floral notes, even as he checks his tablet displaying the genetic marker profile of that very tree. The merging of traditional knowledge and modern genomics is creating something new: the art and science of cacao improvement.

There have been setbacks and lessons. Early on, it became clear that even with genomic insight, nature can surprise breeders. Some gene combinations don’t work as expected in the field due to environmental factors we don’t yet understand. And pathogens themselves evolve – a fungus might overcome a single resistance gene after a few years, which is why stacking multiple resistance traits is crucial. This is why preserving biodiversity remains central: the more genetic options available, the more flexibility to respond to whatever comes. The genome revolution hasn’t replaced the need for those cacao “arks”; it has actually made them more valuable by highlighting which rare genes they hold and deploying them more intelligently. Every cacao farmer ultimately benefits from this network of science – even if they might not realize the new trees in their field were crafted with the help of a supercomputer and a gene bank thousands of miles away.

From Genome to Genome Editing

As remarkable as the breeding advances have been, some visionaries have set their sights on an even more direct approach: genome editing. Why wait for slow breeding cycles, they argue, if we can tweak the cacao’s own DNA precisely to improve it? In a pristine lab at the Innovative Genomics Institute in California, rows of cacao seedlings grow not in soil, but in sterile gel inside test tubes. These seedlings are special – their DNA has been subtly altered using the CRISPR-Cas9 gene editing tool. Here, plant biologist Myeong-Je Cho and colleagues work under the guidance of Nobel laureate Jennifer Doudna, the co-inventor of CRISPR. In partnership with Mars, they began experimenting with editing cacao genes around 2017. The first targets: genes that influence drought tolerance and disease resistance. By knocking out or modifying certain genes, they aim to create cacao trees that can thrive in a hotter, drier climate, or that become immune to viruses. The vision is bold: cacao that could be planted in new regions or remain productive despite climate change, ensuring chocolate doesn’t become a casualty of the warming planet. Mars’s long-term commitment – including a pledge of $1 billion toward sustainability research – signaled how seriously the industry takes these threats. “Chocolate is forever, right?” one headline quipped – the implication being that it won’t be unless we act. Mars and others want to make sure it is.

By 2025, the gene-editing effort took a further leap. Mars announced a partnership with a leading agricultural biotech startup to fast-track CRISPR-enhanced cacao. The company licensed cutting-edge CRISPR technology to specifically address cacao’s most pressing issues. The director of plant science at Mars, Dr. Carl Jones, explained that the aim is to help cacao “adapt to climate challenges, disease pressures, and resource constraints” in ways traditional breeding might not achieve quickly enough. One can imagine a not-too-distant future where a cacao tree is edited to resist the swollen shoot virus – the genetic tweak might involve inserting a fragment of a native resistant gene from a wild cacao into a high-yield variety’s genome, conferring full immunity. Or perhaps a gene that regulates the plant’s response to heat stress is adjusted so that pods set even when temperatures spike. Unlike older genetic modification, which often meant introducing foreign genes (and sparked consumer fears of “GMO foods”), these cacao edits might simply fine-tune the plant’s own genetic switches. Proponents hope this will be more palatable to regulators and the public, framing it as accelerated natural selection.

Still, the prospect of gene-edited chocolate raises questions. Will consumers accept it? Mars is betting yes – or that they may not even need to know if regulations classify these edits as conventional breeding. Some scientists caution that biology is complex: one edited trait could have ripple effects on flavor or ecology. “Finding the right blend of flavors, while ensuring disease resistance, rapid growth and high productivity, isn’t easy,” one researcher noted, underscoring that even with CRISPR’s precision, cacao’s genome has trade-offs we don’t fully grasp. Despite these debates, field trials for edited cacao are likely on the horizon. If successful, they could complement the slower breeding efforts and serve as a sort of genetic insurance policy for the crop.

A New Dawn for Chocolate

On a late summer morning in Trinidad, Dr. Umaharan strolls through the cocoa genebank after a night of heavy rain. The air is thick with petrichor and the faint sweet smell of ripe cacao fruit. He stops before a particular tree – a scraggly, inconspicuous specimen with mottled pods. This is an old Upper Amazon genotype, hardly used in any plantation, but DNA tests show it carries a gene that could hold the key to defeating frosty pod disease. Nearby, a young researcher is tagging flowers on a different tree, carefully pollinating by hand and tying little identifying ribbons. That tree, a descendant of a Criollo, has outstanding flavor. The hope is that its offspring, crossed with a disease-resistant parent, will combine the best of both. It’s painstaking work – but now they have the genome data to know which seedlings from this cross to nurture.

In a very different scene halfway around the world, under the fluorescent lights of a climate-controlled growth chamber, stand row upon row of knee-high cacao plants whose leaves have a slight golden hue – a sign of an edited gene affecting chlorophyll, intentionally done to test the CRISPR system. A technician checks each for any abnormal growth, then makes notes on a tablet. So far, so good. These could be the first generation of climate-resilient cacao, engineered to prosper where their ancestors would have withered. If they pass more tests, they might be headed to a field in Côte d’Ivoire or Indonesia, heralding a new era in cacao farming.

The cacao genome revolution is not one single event but an ongoing saga – a blend of conservation, innovation, and determination. It links the deep past to the unfolding future. Think of the ancient cacao groves where Maya priests once prayed to the cacao spirit for bountiful harvests. Those priests could not have imagined DNA sequencers or gene editing, but they understood the sacred importance of this plant. Today’s scientists and farmers, in their own way, carry on that reverence. They peer through microscopes and pore over computer screens with the same hope: to keep the cacao tree healthy and abundant for generations to come.

The stakes could not be more human. Millions of smallholder farmers in West Africa, Asia, and Latin America depend on cacao for their livelihoods. Whole national economies rely on its export. And for the rest of us, cacao provides small daily joys – a piece of chocolate that melts on the tongue, a beloved flavor that has journeyed from a tropical pod to our palate. Preserving that joy against threats like blight and climate change is both a scientific challenge and a moral one.

As we look ahead, we can imagine the year 2050: a farmer walking through a thriving cocoa orchard that grows where previously it was too hot or dry. The trees are bearing plentiful pods, and he knows these particular trees won’t easily fall to disease – they were bred and even gene-edited for exactly that resilience. The pods he harvests carry the classic rich cocoa taste that his grandparents remember, because the breeders made sure not to lose the fine flavor genes in the process. In a distant lab, some of the very scientists who helped create these trees might be enjoying a chocolate bar made from that farmer’s crop, marveling at how far things have come.

From the brink of collapse in the 1980s to the high-tech rescue missions of today, cacao has been on a remarkable journey. It has survived fungal plagues, navigated genetic bottlenecks, and now is entering a renaissance of genomic enlightenment. The cacao genome revolution is ensuring that the story of chocolate will go on – richer, more sustainable, and perhaps even sweeter than before. In saving the cacao tree, we are, in a sense, decoding and safeguarding a small but beloved piece of our global heritage. And that is truly a heroic enterprise worth savoring.