Research Article |
Corresponding author: Martin Xanthos ( m.xanthos@kew.org ) Academic editor: Alexander Vrijdaghs
© 2023 Martin Xanthos, Simon J. Mayo, Isabel Larridon.
This is an open access article distributed under the terms of the Creative Commons Attribution License (CC BY 4.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Citation:
Xanthos M, Mayo SJ, Larridon I (2023) Reassessment of morphological species delimitations in the Cyperus margaritaceus-niveus complex using morphometrics. Plant Ecology and Evolution 156(1): 112-127. https://doi.org/10.5091/plecevo.97453
|
Background and aims – The Cyperus margaritaceus-niveus complex is a group of ten tropical species from sub-Saharan Africa and Madagascar: C. karlschumannii, C. kibweanus, C. ledermannii, C. margaritaceus, C. niveus, C. nduru, C. obtusiflorus, C. somaliensis, C. sphaerocephalus, and C. tisserantii. They are characterised by a capitate head of white-yellow spikelets and modified culm bases and recent molecular analysis puts them in a distinct clade. The group lacks a modern taxonomic revision, and the taxa described in the Flora treatments of the past 50 years differ considerably in their circumscription. In this study, morphometric analyses are used to test species limits to establish more stable morphological delimitations of the taxa.
Material and methods – An examination of 15 morphological characters on 489 herbarium specimens was carried out and the data was analysed using Principal Component Analysis (PCA), Linear Discriminant Analysis (LDA) with cross-validation, and Classification and Regression Tree (CART) analysis. Cyperus kibweanus was not further considered due to lack of material.
Key results – Both PCA and LDA showed varying degrees of overlap in the nine remaining taxa, with no single group clearly separating in multivariate space. However, cross-validation clearly showed C. margaritaceus as a distinct entity despite its overwhelming presence in the PCA. Both LDA and CART failed to separate C. niveus as a distinct group as its specimens were dispersed among the other groups. Differing results were obtained for other taxa depending on the type of analysis. Cyperus margaritaceus, C. nduru, and C. sphaerocephalus were divided into two groups by CART but re-examination of the specimens does not definitively support the idea that these infraspecific groups represent separate taxa.
Conclusions – The results show that eight morphospecies are recognised by LDA and six morphospecies by CART. Characters used to separate the taxa in Flora treatments scored high loadings in the analysis showing their high taxonomic utility value. The methods used can be applied to resolving other complexes in the Cyperaceae.
Cyperaceae, morphometrics, species complex, taxonomy
The Cyperus margaritaceus-niveus complex is here understood to comprise a group of ten species (C. karlschumannii C.B.Clarke, C. kibweanus Duvign., C. ledermannii (Kük.) S.S.Hooper, C. margaritaceus Vahl, C. niveus Retz., C. nduru Cherm., C. obtusiflorus Vahl, C. somaliensis C.B.Clarke, C. sphaerocephalus Vahl, and C. tisserantii Cherm.) distributed throughout sub-Saharan Africa and Madagascar. The group has been considered a species complex since
Current species delimitations for the taxa in this complex are based on classical morphotaxonomy. Despite a considerable overlap of the characters used to delimit the taxa across the regional African Floras (Table
Comparison of the morphological characters used in classical Floras to delineate taxa in the Cyperus margaritaceus-niveus complex.
Flora of West Tropical Africa ( |
Sedges and Rushes of East Africa ( |
Flora of Somalia ( |
Flora of Ethiopia and Eritrea ( |
Flora of Tropical East Africa ( |
Flora Zambesiaca ( |
Rhizomes vs modified stem bases | Rhizomes vs modified stem bases | ||||
Basal sheath texture | |||||
Culm length | Culm length | Culm length | |||
Leaf width | Leaf length and width | Leaf width | |||
Length of involucral bracts | Length of involucral bracts | Length and width of involucral bracts | Length of involucral bracts | ||
No. of involucral bracts | |||||
Spikelet length and width | Spikelet length and width | ||||
No. of spikelets per head | No. of spikelets per head | ||||
Confluent vs discrete spikelets | |||||
Glume length | Glume length | Glume length | |||
Glume shape | Glume apex shape and texture | ||||
Length of anthers and filaments | |||||
Nutlet length and width | Nutlet width | ||||
Nutlet surface | Nutlet surface | Nutlet surface |
Comparison of the taxonomy of the complex and its status in relevant literature.
Flora of West Tropical Africa ( |
Sedges and Rushes of East Africa ( |
Flora of Somalia ( |
Flora of Ethiopia and Eritrea ( |
Flora of Tropical East Africa ( |
Flora Zambesiaca ( |
|
Cyperus karlschumannii | C. karlschumannii | Not present | Not present | Not present | Species of doubtful occurrence | Not present |
Cyperus ledermannii | C. ledermannii | Synonym of C. niveus var. ledermannii | Not present | Not present | Synonym of C. niveus var. leucocephalus | Synonym of C. niveus var. leucocephalus |
Cyperus margaritaceus | C. margaritaceus | C. margaritaceus | Not present | Not present | C. margaritaceus | C. margaritaceus |
Cyperus niveus | Not present | Typical variety not recorded | C. niveus | C. niveus | C. niveus | C. niveus |
Cyperus nduru | C. nduru | Synonym of C. margaritaceus var. nduru | Not present | Not present | C. nduru | C. nduru |
Cyperus obtusiflorus | Not present | Synonym of C. niveus var. leucocephalus | Synonym of C. niveus var. leucocephalus | Synonym of C. niveus var. leucocephalus | Synonym of C. niveus var. leucocephalus | Synonym of C. niveus var. leucocephalus |
Cyperus somaliensis | Not present | Not present | C. somaliensis | Not present | Not present | Not present |
Cyperus sphaerocephalus | Not present | C. niveus var. flavissimus | C. niveus var. flavissimus | Not present | C. flavissimus | C. flavissimus |
Cyperus tisserantii | C. tisserantii | Synonym of C. margaritaceus var. tisserantii | Not present | Synonym of C. niveus var. tisserantii | Synonym of C. niveus var. tisserantii | Synonym of C. niveus var. tisserantii |
Given the differing taxonomic opinions outlined above, in this study, we use the currently accepted species, as presented in the relevant Floras, as our baseline taxa, which function as our hypotheses. Morphometrics is then used as an algorithmic process to test the results of the traditional taxonomic procedure and thereby test the robustness of the authors’ original circumscriptions from the Flora accounts. As such, C. margaritaceus is the most widespread species, occurring in west tropical Africa, central Africa, east Africa, and southern Africa but not in Madagascar. Cyperus karlschumannii, C. ledermannii, and C. tisserantii are native to west tropical Africa, although C. tisserantii extends south into the Democratic Republic of the Congo. Cyperus niveus and C. nduru – the latter characterised by bottle-like plant bases and hardened basal leaf sheaths (Fig.
Representatives of the Cyperus margaritaceus-niveus complex. A. Habitat of Cyperus margaritaceus. B. Cyperus margaritaceus inflorescence. C. Cyperus karlschumannii. D. Cyperus ledermannii. E–F. Cyperus nduru showing the elongated hardened stem bases. G. Unmounted herbarium specimen of C. margaritaceus. Photos A, B, C, F, G by Xander van der Burgt; D, E by Jane Browning.
Morphometrics has been shown to be a useful tool to investigate species limits in closely related taxa (
In this paper, we use multivariate analyses as a tool to support hypotheses concerning the taxa involved in the Cyperus margaritaceus-niveus complex. In the analyses, we include most of the characters used in the relevant Flora treatments (Table
This study aims to provide testable circumscriptions of taxonomic species (see
In this study, 480 specimens and 9 type images were examined from the herbaria B, C, K, LD, and P (Supplementary material
Fifteen morphological characters (12 quantitative and 3 qualitative) were used for the analyses (Table
List of characters measured and scored for morphometric analysis. ªcharacters excluded from PCA and LDA. bcharacters excluded from all analyses due to multiple missing values.
Vegetative characters | |
LFLEN | Leaf length (cm) |
LFWID | Leaf width (mm) |
CULLEN | Culm length (cm) |
CULWID | Culm width (mm) |
NUMBRAC | Number of involucral bracts |
LENBRAC | Length of involucral bracts (cm) |
BSHPERSa | Basal sheath persistency (flattened/fibrous) |
BSHTEXTª | Basal sheath texture (papery/firm/hard) |
BSHGLOSSª | Basal sheath surface (glossy/dull) |
Floral characters | |
NUMSPK | Number of spikelets per inflorescence |
SPKLEN | Spikelet length (cm) |
SPKWID | Spikelet width (mm) |
GLLEN | Glume length (mm) |
GLWID | Glume width (mm) |
NUTLENb | Nutlet length (mm) |
NUTWIDb | Nutlet width (mm) |
NUMGL | Number of glumes per spikelet |
All analyses were carried out in R (
The PCA was carried out on a matrix of the scaled quantitative variables of the imputed data set using the prcomp function from the stats package (loaded automatically with R). The number of significant principal components was computed using the evplot function (
The LDA was carried out using the lda function from the package MASS v.7.3-49 (
Classification and Regression Tree (CART) analysis is a non-parametric data-mining algorithm that can use qualitative and quantitative data sets, either singly or mixed, to classify individuals into pre-defined categories (classification trees) or predict the value of some quantitative trait of interest (regression trees); this procedure is known technically as binary recursive partitioning (
The classification algorithm creates a dichotomous tree, similar to a key, beginning by dividing (partitioning) the complete set of individuals (the root node) into two daughter nodes, and then successively dividing these into subnodes and so on until some stopping criterion is reached. At every node, the algorithm searches each variable separately to find the variable value (threshold) that divides the node set of individuals into the two least heterogeneous subnodes (
To avoid overfitting, the initial tree must be pruned back to the subtree that is considered optimally predictive (
The purpose of the PCA was to show which characters contribute most to the overall variability of the data set, without considering differences between the species.
The PCA did not produce a very favourable dimensional reduction of the data since the first nine axes were needed to capture 95% of the total variance. However, only the first three principal components, which expressed 66% of total variance, were found to be significant according to the mean eigenvalue (Supplementary materials
Principal component analysis (PCA) ordination of the scores of the first two principal components, with 95% confidence ellipses shown for each species. Based on scaled data of 12 quantitative measured variables of specimens from the Cyperus margaritaceus-niveus complex. The colours represent the nine hypothetical species of the complex.
The hypothetical species groups did not have homogeneous covariance matrices and so assessment of their relative distinctness was based mainly on the results of the cross-validation tests.
The LDA of the non-scaled data expressed 78.62% of the total variance in the first two discriminant axes. When compared to the PCA, the ordination of these axes showed slight improvement in the separation of the nine taxa (Fig.
Linear discriminant analysis (LDA) ordination of the first two discriminant axes, with 95% confidence ellipses shown for each species. Based on scaled data of 12 quantitative measured variables of specimens from the Cyperus margaritaceus-niveus complex. The colours represent the nine hypothetical species of the complex.
The loadings determine which characters are the most influential in separating the species on the first and second discriminant axes (Supplementary materials
The LDA cross-validation was carried on the untransformed data set. This procedure gives a better guide to the distinctiveness of the species than the ordinations, because the algorithmic allocation of each individual to a group uses all the discriminant function axes, whereas the ordinations only show two-dimensional patterns. In Table
Cross validation results of LDA using non-standardised data. Rows show the original population memberships, while columns show the composition of the cross-validated populations. Numbers in bold show the number of specimens from the original taxa that are assigned to the cross-validated populations. Row sums show the total number of original individuals per taxon, and the percentage correct value shows the number of correctly assigned individuals to their original taxon. Column sums show the number of original individuals assigned to each cross-validated population, and the ‘% comp.’ value is the percentage of individuals in the cross-validated population that belong to the original population with the same name.
C. karlschumannii | C. ledermannii | C. margaritaceus | C. nduru | C. niveus | C. obtusiflorus | C. somaliensis | C. sphaerocephalus | C. tisserantii | SUM | % correct | |
C. karlschumannii | 9 | 0 | 5 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 64.3 |
C. ledermannii | 0 | 9 | 0 | 0 | 0 | 1 | 0 | 3 | 0 | 13 | 69.2 |
C. margaritaceus | 9 | 3 | 175 | 5 | 2 | 3 | 0 | 2 | 2 | 201 | 87.1 |
C. nduru | 0 | 0 | 5 | 44 | 0 | 0 | 0 | 1 | 10 | 60 | 73.3 |
C. niveus | 0 | 0 | 4 | 1 | 0 | 2 | 0 | 1 | 4 | 12 | 0.0 |
C. obtusiflorus | 0 | 0 | 4 | 2 | 2 | 50 | 0 | 10 | 4 | 72 | 69.4 |
C. somaliensis | 0 | 0 | 0 | 1 | 0 | 0 | 3 | 0 | 2 | 6 | 50.0 |
C. sphaerocephalus | 0 | 3 | 5 | 3 | 1 | 5 | 0 | 45 | 2 | 64 | 70.3 |
C. tisserantii | 0 | 0 | 4 | 6 | 0 | 1 | 0 | 0 | 36 | 47 | 76.6 |
SUM | 18 | 15 | 202 | 62 | 5 | 62 | 3 | 62 | 60 | ||
% comp. | 50 | 60 | 86.6 | 71.0 | 0 | 80.6 | 100 | 72.6 | 60 |
The results showed that C. margaritaceus was the most consistent species, with 87.1% of the originally determined individuals correctly assigned to the resulting cross-validated group with the same name, and 86.6% of the individuals of this group belonging to the species (Table
Three optimal subtrees were identified that had a relative error below the critical threshold (Supplementary material
As with the LDA cross-validation, the species with the largest number of individuals assigned to a terminal group provides the name of that group. This resulted in no terminal groups for C. ledermannii, C. somaliensis, and C. niveus, and two each for C. margaritaceus, C. nduru, and C. sphaerocephalus (Fig.
Classification tree after one cycle of cost complexity pruning. The terminal nodes are the result of assignment by cross-validation. The terminal cross-validated groups are named by the species with the largest number of individuals assigned to that group; the numbers separated by a slash represent on the left the number of individuals originally from the species of the leaf name, and on the right the total number of individuals assigned to that leaf by CART; the percentages represent the proportion of the total number of individuals of the study assigned to that leaf. Each node is marked by the character that provides the optimal binary split of the individuals at that node; the logical statement at each node (e.g. glume width at the root node) indicates that individuals for which the statement is true pass to the left-hand branch and those for which it is false to the right-hand branch.
An overview of the assignments of individuals to the cross-validated groups resulting from the CART analysis are given in Table
Cross validation results of CART using non-standardised data. Rows show the original population memberships, while columns show the composition of the cross-validated populations. Numbers in bold show the number of specimens from the original taxa that are assigned to the cross-validated populations. Row sums show the total number of original individuals per taxon, and the percentage correct value shows the number of correctly assigned individuals to their original taxon. Column sums show the number of original individuals assigned to each cross-validated population, and the ‘% comp.’ value is the percentage of individuals in the cross-validated population which belong to the original population with the same name.
karl.crt | marg1.crt | marg2.crt | nduru1.crt | nduru2.crt | obtus.crt | sphaer1.crt | sphaer2.crt | tisser.crt | SUM | % correct | |
C. karlschumannii | 10 | 4 | 0 | 0 | 0 | 0 | 0 | 0 | 0 | 14 | 71.4 |
C. ledermannii | 0 | 0 | 0 | 0 | 0 | 8 | 2 | 2 | 1 | 13 | 0.0 |
C. margaritaceus | 5 | 175 | 5 | 6 | 0 | 3 | 2 | 3 | 2 | 201 | 87.0 |
C. nduru | 0 | 0 | 0 | 36 | 18 | 0 | 0 | 2 | 4 | 60 | 90 |
C. niveus | 0 | 3 | 0 | 0 | 0 | 2 | 0 | 3 | 4 | 12 | 0.0 |
C. obtusiflorus | 0 | 3 | 4 | 0 | 1 | 47 | 5 | 7 | 5 | 72 | 65.3 |
C. somaliensis | 0 | 0 | 0 | 0 | 2 | 0 | 0 | 0 | 4 | 6 | 0.0 |
C. sphaerocephalus | 0 | 8 | 0 | 0 | 1 | 5 | 26 | 21 | 3 | 64 | 73.4 |
C. tisserantii | 0 | 9 | 0 | 3 | 2 | 1 | 0 | 3 | 29 | 47 | 61.7 |
SUM | 15 | 202 | 9 | 45 | 24 | 66 | 36 | 41 | 52 | ||
% comp. | 66.7 | 86.6 | 55.6 | 80.0 | 75.0 | 71.2 | 72.2 | 51.2 | 55.8 |
Nevertheless, six of the hypothetical taxa that formed a cross-validated group bearing the same name had high percentages of correct assignments; C. margaritaceus and C. nduru with the highest percentages, 87% and 90% respectively. Of the three original taxa that failed to come out as separate groups, two of them had many of their individuals assigned to one group. Cyperus ledermannii had 62% of its individuals, including the type, assigned to the obtus.crt group. Cyperus somaliensis had 67% of its individuals, including the type, assigned to the tisser.crt group. In the case of C. niveus, not only did the original taxon fail to separate but none of its individuals received majority representation in any cross-validated group. The type was assigned to the obtus.crt group. Only four of the type specimens were assigned to a cross-validated group bearing the same name: C. margaritaceus, C. karlschumannii, C. nduru, and C. tisserantii. For the remaining two taxa, although these come out as their own cross-validated groups, the types were assigned elsewhere. Thus, C. obtusiflorus was assigned to tisser.crt and C. sphaerocephalus was assigned to nduru2.crt.
Within the Cyperus margaritaceus-niveus complex, the analyses classify the individuals, originally assigned to nine species, into eight groups in the LDA and six groups in the CART analysis. The two results we wish to highlight are the failure of C. niveus to separate as a group in both analyses and the retention of C. margaritaceus as a distinct entity mixed together with a small number of individuals of other species. In the PCA, although the plots for C. niveus and C. sphaerocephalus are embedded within the overall scatter plot, C. sphaerocephalus is still separated by LDA and CART, while C. niveus is not separated by either analysis. In both the LDA and CART cross-validation, the 12 specimens of C. niveus were distributed among several taxa, although without consistency between the two analyses (Tables
Cyperus margaritaceus shows considerable overlap with the other taxa in the PCA (Fig.
Cyperus nduru was treated as a variety of C. margaritaceus by
The taxonomic status of C. tisserantii differs amongst the regional Floras (Table
The analyses show strong support for C. karlschumannii as a distinct entity. This species can be easily distinguished from the other taxa in the complex by the considerably longer ovate spikelets, ranging from white to pale-yellow and with broader glumes.
The analyses differ as to the identity of C. ledermannii. While the LDA treats it as a separate entity, the CART analysis places most of the C. ledermannii individuals in the obtus.crt group.
Cyperus sphaerocephalus, known as the golden headed sedge in South Africa, was long considered as C. obtusiflorus var. flavissimus, until
The entity of C. somaliensis also differs between the analyses. While the LDA distinguishes this species as a distinct entity, the CART analysis shows little consistency with regards to assigning individuals. This disparity may be due to the comparative lack of specimens used that belong to this taxon; only six specimens of C. somaliensis were used in the analyses. When
The assignment of type specimens to different cross-validated groups across the analyses makes their classification difficult. Types may not be typical of the specimens assigned to their particular species, however, missing character values may alter their true position in the multispace and hence affect their assignment in the cross-validation.
For the complex as a whole, characters with the highest loadings (glume width, number of bracts, glume length, and number of spikelets per head) were also used by
In summary, of the nine putative species studied, six morphospecies can be recognised based on our results: C. karlschumannii, C. margaritaceus, C. nduru, C. obtusiflorus, C. sphaerocephalus, and C. tisserantii. Each of the six morphospecies is distinguished by a combination of morphological characters and geographical distribution. The picture is more complex for C. ledermannii and C. somaliensis, given the differences between the LDA and CART analysis. Further work is desirable to elucidate the taxonomic affinities of these two taxa. None of the analyses considered C. niveus to be a separate entity for Africa. We have also shown that the morphological characters used in classical taxonomic studies for this complex are of sufficient taxonomic value to delimit similar morphogroups as presented in the regional Floras. The analyses used in this study provide a transparent and repeatable methodology for justifying the recognition of morphology-based taxa within this complex, which also enables morphological characters, not included in this study (e.g. nutlet dimensions, anther length), to be added to our dataset to corroborate our results. We recommend the application of this methodology to resolve other known species complexes in the Cyperaceae.
The R analysis script is deposited at
Barnaby Walker and Liam Trethowan are thanked for helpful discussion on R packages and providing guidance on installation and usage. The first author is grateful to Watchara Arthan for providing R scripts for producing the map. The authors are grateful to Jane Browning and Xander van der Burgt for allowing permission of their images for Fig.
Accessions used in the study and their morphological characters.
Eigenvalues of the principal component analysis shown in Fig.
Character loadings of the first (A) and second (B) discriminant axes based on the scaled data of the Linear Discriminant Analysis.
Loadings (eigenvectors) of the linear discriminant analysis shown in Fig.