Research Article |
|
Corresponding author: Pablo Dupiol ( pablo.dupiol@gmail.com ) Corresponding author: Jérôme Duminil ( jerome.duminil@ird.fr ) Academic editor: Katharina Budde
© 2025 Pablo Dupiol, Armel Chakocha, Marie-Louise Avana Tientcheu, Mohamed Mahamoud Charahabil, Cedric Mariac, Adeline Barnaud, Jérôme Duminil.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Dupiol P, Chakocha A, Avana Tientcheu M-L, Charahabil MM, Mariac C, Barnaud A, Duminil J (2025) Development of new microsatellite markers for Cola acuminata (Malvaceae), a socio-economically important fruit tree species in Central Africa. Plant Ecology and Evolution 158(3): 358-366. https://doi.org/10.5091/plecevo.147801
|
Background and aims – We developed a new set of nuclear microsatellite markers for Cola acuminata (Malvaceae), an important African food tree species commonly known as the kola nut. Probably originating from the tropical rainforests of the Congo Basin, C. acuminata is widely cultivated in the humid savannahs of the region where its nuts are sold throughout Central and West Africa for their stimulant properties. Nuclear microsatellite markers (SSRs) are well suited for assessing the genetic diversity and population structure of plant species due to their high variability.
Material and methods – Leaf samples were collected from 84 C. acuminata cultivated individuals across three sites in Cameroon, two in the savannah zone, one in the forest zone. SSR markers were developed by sequencing genomic DNA from two individuals using an Illumina HiSeq platform. Genetic diversity was assessed based on 14 SSR markers genotyped in 84 individuals, and marker transferability to the closely related species Cola nitida was tested.
Key results – Forty-eight new microsatellite loci were developed, of which 14 were polymorphic in C. acuminata. The results demonstrated a high level of genetic diversity with the presence of two to 33 alleles per locus (with an average of 14.14) across the three sites. The transferability of these markers was confirmed with 13 out of the 14 SSRs successfully amplifying in the closely related species, Cola nitida.
Conclusion – These newly developed SSRs will be useful for assessing genetic diversity, genetic differentiation, and gene flow patterns of C. acuminata in the tropical forests of Central Africa. Preliminary results suggest genetic similarity between the two savannah sites. However, these two sites were significantly differentiated from the site in the forest zone. This suggests that the propagation material introduced in the savannah zone did not originate from the forest in southern Cameroon.
conservation, domestication history, food tree species, non-timber forest products, tree crop
In Africa, an estimated 600 million people rely heavily on non-timber forest products (NTFPs) for their livelihood (
The Cola Schott & Endl. genus of the Malvaceae family comprises around 125 species, including Cola acuminata (P.Beauv.) Schott & Endl., known as the kola tree, whose seeds (the kola nuts) hold a significant cultural value in many African communities for social and religious aspects (
So far, only one study investigated the genetic diversity of C. acuminata and C. nitida (Vent.) Schott & Endl, in Nigeria using Random Amplified Polymorphic DNA (RAPD) markers (
The genus Cola contains two socio-economically important species known as kola trees, Cola acuminata, native to Central Africa, and Cola nitida, native to West Africa (
Cola acuminata can reach a height of 30 m and a diameter of about 30 cm (
We sampled leaf samples of cultivated individuals of C. acuminata from three sites in Cameroon, one in the forest zone and the other two in the savannah zone (Fig.
In the forest zone, trees were sampled in Meyo-Ville (3.55464°N, 11.93430°E; Mvila department, South region) and were corresponding either to wild individuals that were spared when the fields were prepared (slash-and-burn area), or were planted using local genetic material. In total, 28 individuals were sampled from four agroforest farms (corresponding to four different land owners).
In the savannah zone, trees were sampled in Koupa-Matapit, (5.76953°N, 10.80081°E; Noun department, West region) and in Bametcha (5.29322°N, 10.49694°E; Ndé department, West region). In both sites, 28 individuals were sampled from respectively 13 and nine agroforest plantations. The cultivated individuals sampled at these two sites were introduced several generations ago (probably several centuries ago) using genetic material of unknown origin.
Moreover, in order to test the transferability of SSR markers to the closely related Cola nitida species, 10 cultivated trees were collected from two different villages in Senegal, Adéane, and Agnack (Suppl. material
DNA from two C. acuminata individuals from Koupa-Matapit village in Cameroon (AC051 and AC066; Suppl. material
To enable multiplexed co-amplification of loci, we employed the three-primer PCR method described by
Characteristics of the 14 microsatellite markers developed for Cola acuminata. * Q1 = TGTAAAACGACGGCCAGT (
| Microsatellite marker | Primer sequences (5’–3’) F: forward, R: reverse | Tail and fluorescent label* | Motif and repeat number** | Allele size range (bp) | GenBank accession number |
| CA09 | F: AGCCATAAGTGAAAGTGTGAAACT | Q1-6-FAM | (AT)13 | 145–220 | PP943415 |
| R: AGTTGCTGAAGTATGACTAAGCCA | |||||
| CA18 | F: TGGTAGAATTGCATAGGAGATTGA | Q2-NED | (AAT)17 | 140–170 | PP943418 |
| R: ACCAACCTCCAATCGCAACT | |||||
| CA27 | F: GCCAACACGATTCATTAACATATAGG | Q3-VIC | (AT)12 | 144–182 | PP943421 |
| R: ACCGGATATGAACTCTTTGCCA | |||||
| CA36 | F: AAGGGCCCTTGAAATGGTGG | Q3-VIC | (AG)21 | 225–275 | PP943423 |
| R: TGAGTAACGGTGTTGAGATCCT | |||||
| CA05 | F: TTTGTTAGCCACCTTCAAATCATT | Q4-PET | (AG)10 | 135–175 | PP943412 |
| R: TGATTTCTTATTTCACTAGCAATGGC | |||||
| CA01 | F: AGACGCAAGGAGTTCTTTCCA | Q1-6-FAM | (AT)10 | 116–146 | PP943411 |
| R: GGTTTCGTTGAATTGAATCAAACA | |||||
| CA12 | F: TTTCTTCCACGGCTAGATACAA | Q1-6-FAM | (AT)11 | 178–220 | PP943416 |
| R: TCAGTCTACAAAGAGGATAAGTAGG | |||||
| CA23 | F: CATAGGCTTCCTGGTGCACT | Q2-NED | (AAG)13 | 142–225 | PP943420 |
| R: TTGATGAGCTCAGATGGGCAT | |||||
| CA06 | F: TGTAGGTTTGGCTTGCCTCC | Q3-VIC | (AT)10 | 135–175 | PP943413 |
| R: CAAAGCATCCCAACAATCGCA | |||||
| CA16 | F: CCATCGATGGCTAGTTATGAGTT | Q4-PET | (AT)12 | 105–265 | PP943417 |
| R: TGAGCCTTAGACTACTAGGGACA | |||||
| CA08 | F: GTTTGTATTGAAGTGCAAACTGTT | Q1-6-FAM | (AT)16 | 145–180 | PP943414 |
| R: CACCTCCCTCCCTTCCCTTA | |||||
| CA19 | F: GTGCGTAAACACAGCCTCAG | Q2-NED | (AGC)12 | 160–190 | PP943419 |
| R: ACTTTAGTAAACCAACAGGAGGGA | |||||
| CA34 | F: TGAAATGGTCCTAAGTTACATCCA | Q3-VIC | (AT)10 | 195–228 | PP943422 |
| R: TCTAAACTTTGTGCGGGCCA | |||||
| CA39 | F: GCACTCAGAACGTTCTCCCT | Q4-PET | (AT)14 | 175–230 | PP943424 |
| R: ACGAGATTCCTGTGCTGGTG |
The 14 polymorphic SSRs were then amplified for all 84 C. acuminata samples using three multiplex reactions (Multiplex 1: CA09-Q1, CA18-Q2, CA27-Q3, CA36-Q3, CA05-Q4; Multiplex 2: CA01-Q1, CA12-Q1, CA23-Q2, CA06-Q3, CA16-Q4; Multiplex 3: CA08-Q1, CA19-Q2, CA34-Q3, CA39-Q4). The multiplex PCR included 0.15 μL of each forward primer [10 μM], 0.10 μL of each reverse primer [10 μM], 0.15 μL of each tail (Q1–Q4) [10 μM], 1 μL of DNA, 7.5 μL of PCR Master Mix (Type-it Microsatellite, Qiagen), and was completed to a total volume of 15 μL with distilled water. PCRs and genotype scoring were carried out in the same way as for the simplex reactions above. The same protocol was used to test the transferability of the markers on 10 individuals of C. nitida coming from Senegal (Suppl. material
An analysis of genetic diversity was carried out on a dataset comprising 84 C. acuminata samples coming from three different Cameroonian populations (Suppl. material
The SPAGeDi program v.1-5a (
We conducted a linkage disequilibrium test for each locus pair across all C. acuminata populations (Fisher’s method) using Genepop v.4.7.5 (
The genetic differentiation between populations (pairwise FST) was estimated using a nested ANOVA procedure following
We developed 14 new SSR markers in C. acuminata that proved to be highly polymorphic, with an average of 14.14 alleles per locus across all populations (Suppl. material
Intrapopulation genetic diversity of Cola acuminata in three Cameroonian populations at 14 newly developed microsatellite markers. N = number of individuals, nA = number of alleles, He = expected heterozygosity, Ho = observed heterozygosity, FI = inbreeding coefficient with * p value < 0.05, ** p value < 0.01, *** p value < 0.001, AR = allelic richness, fnull = null allele frequency.
| Koupa-Matapit (N = 28) | Bametcha (N = 28) | Meyo-Ville (N = 28) | ||||||||||||||||
| Locus | nA | He | Ho | FI | AR (k = 30) | fnull | nA | He | Ho | FI | AR (k = 30) | fnull | nA | He | Ho | FI | AR (k = 30) | fnull |
| CA01 | 9 | 0.8825 | 0.821 | 0.070 | 8.45 | 0.03 | 8 | 0.8487 | 0.750 | 0.118 | 7.30 | 0.06 | 10 | 0.8122 | 0.560 | 0.315*** | 8.71 | 0.06 |
| CA05 | 13 | 0.9202 | 0.542 | 0.417*** | 11.42 | 0.00 | 12 | 0.7604 | 0.643 | 0.157 | 9.61 | 0.00 | 12 | 0.8998 | 0.500 | 0.450*** | 10.40 | 0.00 |
| CA06 | 7 | 0.4682 | 0.148 | 0.688*** | 5.64 | 0.56 | 1 | - | - | - | 1.00 | 0.56 | 9 | 0.7409 | 0.571 | 0.232 | 7.13 | 0.56 |
| CA08 | 4 | 0.5332 | 0.296 | 0.449* | 3.36 | 0.33 | 4 | 0.5240 | 0.321 | 0.391* | 3.32 | 0.24 | 6 | 0.7023 | 0.444 | 0.372** | 5.53 | 0.26 |
| CA09 | 2 | 0.1623 | 0.000 | 1.000** | 1.99 | 0.00 | 2 | 0.2139 | 0.000 | 1.000** | 2.00 | 0.00 | 2 | 0.3310 | 0.000 | 1.000** | 2.00 | 0.00 |
| CA12 | 13 | 0.9169 | 0.643 | 0.303*** | 11.48 | 0.18 | 10 | 0.8715 | 0.348 | 0.606*** | 9.05 | 0.51 | 11 | 0.8449 | 0.240 | 0.720*** | 9.03 | 0.51 |
| CA16 | 19 | 0.9266 | 0.630 | 0.325*** | 14.55 | 0.21 | 13 | 0.8974 | 0.550 | 0.393*** | 11.61 | 0.38 | 16 | 0.9053 | 0.640 | 0.297*** | 12.98 | 0.38 |
| CA18 | 9 | 0.8672 | 0.778 | 0.105 | 8.35 | 0.08 | 7 | 0.8041 | 0.680 | 0.157 | 6.53 | 0.16 | 5 | 0.7135 | 0.704 | 0.014 | 4.55 | 0.16 |
| CA19 | 9 | 0.7305 | 0.679 | 0.072 | 6.97 | 0.04 | 5 | 0.7016 | 0.593 | 0.158 | 4.54 | 0.12 | 12 | 0.8422 | 0.786 | 0.068 | 9.33 | 0.03 |
| CA23 | 8 | 0.8092 | 0.481 | 0.410*** | 7.48 | 0.00 | 5 | 0.6513 | 0.481 | 0.264* | 4.54 | 0.00 | 5 | 0.7376 | 0.423 | 0.431*** | 4.92 | 0.00 |
| CA27 | 11 | 0.8833 | 0.741 | 0.164* | 10.15 | 0.00 | 8 | 0.8537 | 0.769 | 0.101 | 7.49 | 0.00 | 13 | 0.8945 | 0.926 | -0.036 | 10.91 | 0.00 |
| CA34 | 11 | 0.8718 | 0.600 | 0.316*** | 9.42 | 0.25 | 8 | 0.7968 | 0.857 | -0.077 | 6.57 | 0.01 | 2 | 0.4832 | 0.000 | 1.000*** | 2.00 | 0.01 |
| CA36 | 11 | 0.7792 | 0.714 | 0.085 | 8.16 | 0.00 | 7 | 0.6534 | 0.630 | 0.037 | 5.91 | 0.00 | 13 | 0.8279 | 0.857 | -0.036 | 9.59 | 0.00 |
| CA39 | 16 | 0.8903 | 0.750 | 0.160* | 12.14 | 0.09 | 13 | 0.8700 | 0.704 | 0.194 | 10.19 | 0.09 | 12 | 0.8868 | 0.778 | 0.125 | 10.61 | 0.09 |
| Multilocus average | 10.14 | 0.760 | 0.559 | 0.269*** | 8.54 | 0.13 | 7.36 | 0.674 | 0.523 | 0.228*** | 6.40 | 0.15 | 9.14 | 0.758 | 0.531 | 0.305*** | 7.69 | 0.15 |
We detected potential signs of scoring errors due to stuttering for four out of the fourteen loci (CA06, CA08, CA09, CA39) but we have not detected evidence of large allele dropout across all loci. Five out of the 91 locus pair comparisons showed signs of linkage disequilibrium (significant level of 0.05), but only one pair remained significant after applying a Bonferroni correction (loci 6 and 36).
Genetic differentiation between populations showed a relatively low FST value (0.023) between Bametcha and Koupa-Matapit, while differentiation between the two savannah sites and the forest site was higher (0.132 between Meyo Ville and Koupa Matapit and 0.196 between Meyo Ville and Bametcha; Suppl. material
Thirteen out of the fourteen SSR markers were transferable to C. nitida. Only CA12 was not transferable. Polymorphism was observed for all transferable loci except locus 9 (Suppl. material
The high polymorphism observed in the 14 SSR markers developed for C. acuminata demonstrates their effectiveness in assessing the distribution of genetic diversity within this species. The presence of null alleles in several loci, such as in loci CA23, CA36, and CA09, leads to an underestimation of heterozygosity and high FI values. The FI values corrected for the presence of null alleles were close to zero, suggesting the absence of inbreeding or genetic substructure within the populations.
Our results provide preliminary insights into the evolutionary history of C. acuminata in Cameroon, particularly with regard to its cultivation history. The relatively low FST value (0.023) between the Bametcha and Koupa-Matapit populations suggests high gene flow and genetic similarity between them. This is consistent with observed local seed exchange practices and trade routes between these populations. The relatively higher genetic differentiation between the two cultivated populations from the savannah region and the cultivated population from the forest region, as reflected by higher FST values than those obtained between the two savannah populations, can be interpreted in different ways. As we have no information on the population of origin of planting material now present in the cultivated populations, and having no wild samples included in our study, our results need to be interpreted with caution. The cultivated population of Meyo-Ville is close to southern Cameroon rainforests, so that we can speculate that C. acuminata in this population represents remnant trees from a previously natural population and/or was introduced using wild material from southern Cameroon. If we now consider that the introduction of C. acuminata in the savannah zone also involved propagation material originating from southern Cameroon rainforests, the pattern of genetic divergence observed between the savannah and forest zone may be the result of a genetic bottleneck effect resulting from the introduction and subsequent cultivation process of C. acuminata in the savannah zone. This might also be due to the management practices employed in the two cultivated populations in the savannah region. In particular, the selection practices applied to these populations may have led to a loss of genetic diversity and genetic differentiation between the populations of origin in the forest and the introduced populations in the savannah. The genetic bottleneck effect and the effect of management practices are not mutually exclusive.
Alternatively, we could speculate that the original planting material used when the species was introduced to the savannah actually came from sources other than the rainforests in the south of Cameroon. Based on the ethnobotanical data that we have collected (data not shown), and from data obtained on other species (
The levels of genetic diversity are relatively high in all three populations. Despite a low genetic differentiation between the two cultivated populations from the savannah zone, we observed differences in their levels of genetic diversity, with higher allelic richness in Koupa-Matapit than in Bametcha (ANOVA procedure, p value < 0.001). This result can be interpreted in light of social factors. During the sampling of the plant material, we conducted interviews with farmers in Koupa-Matapit, Bametcha and Meyo-Ville, addressing notably the origin of the genetic material used for plantation (data not shown). As discussed, the low level of genetic differentiation between the two savannah populations suggests that they share a common ancestry and/or experience extensive gene flow (i.e. exchange of planting material). However, differences in management practices between the two populations might have adversely influenced the levels of genetic diversity of C. acuminata in these two cultivated populations. In fact, farmers in Koupa-Matapit often source seeds for planting from local markets, whereas those in Bametcha tend to use their own seeds. Sourcing seeds locally in Bametcha, with no or very limited input of external genetic material, might reduce the genetic diversity of the species through genetic drift (reduction in population size). The selection of individuals with desirable traits by farmers can also result in a further reduction of genetic diversity in this population. These two processes can act in synergy.
The successful transferability of SSR markers to C. nitida highlights their potential utility for cross-species genetic studies within this genus. The reduced polymorphism and narrower allele size range observed in C. nitida may be due to the small size of the tested population. The high transferability rate of the SSR markers between the two species (93% based on our results) and the previous demonstration of hybridisation events between them (
Fourteen new polymorphic microsatellite markers were developed for the food tree C. acuminata. These markers enable to study the distribution of genetic diversity at different geographical scales, improving our understanding of the domestication history and gene flow patterns within this species. The low level of genetic differentiation between the two cultivated populations in the savannah zone suggests a common origin and/or gene flow, probably facilitated by seed exchange and trade routes. In contrast, the higher genetic differentiation between the savannah and forest populations possibly indicated distinct source populations or could reflect differences due to the introduction and subsequent cultivation practices in the savannah zone. A higher level of genetic diversity was observed in the Koupa-Matapit population than in the Bametcha population, potentially due to differences in seed selection practices by the farmers. This demonstrates the importance of characterising the impact of farmers’ management practices on the genetic diversity of cultivated species, in order to develop sustainable management strategies to maintain their genetic resources. The high transferability success rate of the newly-developed genetic markers to another Cola species, C. nitida, suggests that these markers may be useful for conducting genetic studies across these species.
The data [and related documentations] that support the findings of this study are openly available in DataSuds repository (IRD, France) at https://doi.org/10.23708/6FCSLF. Data reuse is granted under CC-BY license 4.0.
The authors acknowledge the ISO 9001 certified IRD i-Trop HPC (South Green Platform) at IRD Montpellier for providing HPC resources that have contributed to the research results reported within this paper. We are deeply grateful to the local farmers and communities that helped us collecting plant material in Cameroon and Senegal. We would like to acknowledge the help of two anonymous reviewers and Katharina Budde, the subject editor for Plant Ecology and Evolution, in improving the quality of the manuscript.
List of Cola acuminata and C. nitida individuals used in this study.
Number of alleles (nA) and allelic richness (k = 30) of 14 newly developed microsatellite markers in Cola acuminata across three Cameroonian populations.
Pairwise FST values between the four sites. FST were calculated using the 14 SSR markers for estimations between Cola acuminata populations, and using the 13 SSR markers in common for estimations between C. acuminata / C. nitida populations.
Genetic properties of the newly developed 13 SSRs in Cola nitida (N = 10). nA = number of alleles sampled, He = expected heterozygosity, Ho = observed heterozygosity, FI = inbreeding coefficient with * p value < 0.05 and ** p value < 0.01, AR (k = 8) = allelic richness.