Blackwell Science, LtdOxford, UKBIJBiological Journal of the Linnean Society0024-4066The Linnean Society of London, 2003? 2003
79?
565576
Original Article
WOLFFIELLA PHYLOGENETICS and BIOGEOGRAPHY
R. T. KIMBALL
Biological Journal of the Linnean Society, 2003, 79, 565–576. With 3 figures
Out of Africa: molecular phylogenetics and biogeography
of Wolffiella (Lemnaceae)
REBECCA T. KIMBALL1*, DANIEL J. CRAWFORD2, DONALD H. LES3 and
ELIAS LANDOLT4
1
Department of Zoology, University of Florida, PO Box 118525, Gainesville, Florida 32611, USA
Department of Ecology and Evolutionary Biology & The Museum of Natural History and Biodiversity
Center, University of Kansas, Lawrence, Kansas 66045, USA
3
Department of Ecology and Evolutionary Biology, University of Connecticut, Storrs, Connecticut 06269–
3043, USA
4
Geobotanisches Institut ETH, Zürichbergstrasse 38, CH-8044, Zürich, Switzerland
2
Received 5 August 2002; accepted for publication 29 January 2003
The monophyletic genus Wolffiella (Lemnaceae) comprises 10 species divided taxonomically into three sections. Relative to other genera of Lemnaceae, Wolffiella has a restricted range, with species distributed in warm temperate to
tropical areas of Africa and the Americas, with only one species occurring in both areas. Sequence data from coding
(rbcL and matK) and non-coding (trnK and rpl16 introns) regions of cpDNA were analyzed phylogenetically to
resolve relationships within Wolffiella, and these results were compared to earlier allozyme and morphological studies. Allozymes, cpDNA and morphology all supported the recognition of three sections. Relationships among species
were similar in most respects between the allozyme and cpDNA trees, as well as among the different plastid partitions. In Wolffiella, both non-synonymous and synonymous substitutions were greater in matK than in rbcL, as
observed in other taxa. The synonymous substitution rate in matK was similar to the substitution rate of the noncoding regions. All partitions, including coding regions, exhibited some homoplasy. Biogeographical reconstructions
from a combination of cpDNA partitions indicated that Wolffiella originated in Africa with early movement to and
radiation in the Americas. The one species found in both Africa and the Americas, W. welwitschii, likely originated
in the Americas and subsequently dispersed to Africa. Using the SOWH test, the cpDNA data could reject two alternative biogeographical hypotheses suggested from analyses of morphological and allozyme data. The present distribution of Wolffiella can be explained by two major dispersal events and this contrasts with the more complex
species distributions in other Lemnaceae genera. Limited dispersal in Wolffiella relative to other Lemnaceae genera
may be due to more recent origins of species, lower dispersibility and poorer colonizing ability. © 2003 The Linnean
Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576.
ADDITIONAL KEYWORDS: allozymes – duckweeds – matK – rbcL – rpl16.
INTRODUCTION
The duckweed family (Lemnaceae) includes the smallest of all flowering plants. These cosmopolitan aquatic
monocots are reduced to tiny thalloid fronds that float
on or below the surface of the water (Landolt, 1986).
Although all species produce flowers, the frequency of
flowering differs among the species (Landolt, 1986),
*Corresponding author. E-mail: rkimball@zoo.ufl.edu
and reproduction in the family is thought to be primarily vegetative. Wolffiella is unique biogeographically
among genera of Lemnaceae in being restricted to
warm temperate, subtropical and tropical areas in the
Americas and Africa, with a relatively recent introduction of one species (W. hyalina) into India (Landolt,
1986). In contrast, other Lemnaceae genera have one
or more species distributed widely in temperate zones
(Landolt, 1986). The biogeographical origin of
Wolffiella is uncertain. Allozyme analyses indicated an
African origin (Crawford et al., 1997), although
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
565
566
R. T. KIMBALL ET AL.
Landolt (1986) concluded that the origins were likely
in South America. In addition, it is not known whether
there have been one or multiple dispersal events
between the Old and New Worlds.
The family as currently recognized consists of 37
species in five genera: Landoltia (one species), Lemna
(13 species), Spirodela (two species), Wolffia (11 species) and Wolffiella (ten species) (Table 1; Les & Crawford, 1999; Landolt, 2000; Les et al., 2002). The
taxonomic history of Wolffiella, as reviewed by Landolt
(1986), has been intertwined with that of Wolffia, the
other genus of the subfamily Wolffioideae. Landolt
(1986) initially circumscribed Wolffiella as comprising
nine species in three sections, with an additional species being described later (Landolt, 1992). Phylogenetic analyses of morphological and anatomical
characters (Les, Landolt & Crawford, 1997b) indicated
monophyly of the sections in Wolffiella but paraphyly
of the genus. However, a tree five steps longer could
resolve Wolffiella and the other genera as being monophyletic. A recent molecular phylogenetic study (Les
et al., 2002) provided strong support for both the
monophyly of the sections in Wolffiella and the monophyly of the genus (amended to include W. caudata as
recognized by Landolt, 1992).
In this study, we examine phylogenetic relation-
Table 1. Accessions used in analyses
Species
rbcL
matK, trnK
rpl16
Wolffiella
Section Wolffiella
W. caudata Landolt
9158 (Bolivia)
9173 (Bolivia)
W. denticulata (Hegelm.) Hegelm.
W. gladiata (Hegelm.) Hegelm.
8221 (S. Africa)
8261 (USA)
8221
8261
W. lingulata (Hegelm.) Hegelm.
7289 (Brazil)
7289
W. neotropica Landolt
8848 (Brazil)
8848
W. oblonga (Phil.) Hegelm.
8984 (Columbia)
8984
W. welwitschii (Hegelm.) Monod
7468 (Columbia)
7468
9158*
9214 (Bolivia)
8221
7173 (USA)
8768 (USA)*
7289*
7655 (Mexico)
7290 (Brazil)
8848*
7997 (Brazil)*
8072 (USA)
8393 (USA)
8984
7468*
9096 (Zimbabwe)
8640 (Tanzania)
8640
9122 (Zimbabwe)
9122
Section Rotundae
W. rotunda Landolt
9121 (Zimbabwe)
9121
9072 (Zimbabwe)*
9121
Wolffia
W. australiana (Benth.) Hartog & Plas
W. borealis (Englem.) Landolt
W. brasiliensis Wedd.
W. microscopica (Griff.) Kurz
7733
9123
8743
8359
7733
9123
8743
8359
7631 (Australia)
9123
9134 (Brazil)
8359
Section Stipitatae
W. hyalina (Del.) Monod
W. repanda (Hegelm.) Monod
(Australia)
(USA)
(Argentina)
(India)
7376 (Egypt)
8640*
9054 (Zimbabwe)
9062 (Zimbabwe)*
9104 (Botswana)
9107 (Botswana)
9122
Numbers are those of E.L. with vouchers in ZT. Geographic locality is indicated in parentheses for the first listing for each
accession.*Accession used in combined analyses (and in Les et al., 2002).
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
WOLFFIELIA PHYLOGENETICS AND BIOGEOGRAPHY
ships in Wolffiella using cpDNA sequence data from
all species. For most species, multiple, geographically
divergent accessions were included. The phylogenetic
trees of Wolffiella obtained from different cpDNA partitions were evaluated for congruence. Aspects of
molecular evolution were evaluated for each data partition in order to better understand and to interpret
instances of incongruence. A well-supported phylogeny for the genus is presented and is used to reconstruct the biogeographical history of Wolffiella.
Alternative biogeographical hypotheses from earlier
allozyme and morphological studies are then examined to better clarify the evolution of this unusual
Lemnaceae genus.
MATERIAL AND METHODS
DNA
AMPLIFICATION AND SEQUENCING
Four regions of the chloroplast genome were used,
including two protein coding loci (rbcL, matK) and
two intron regions (rpl16, trnK – combining both the
5¢- and the 3¢ regions). Previously published data were
supplemented by sequencing additional accessions for
the rpl16 intron. PCR amplification of the rpl16
intron was performed using primers F71 (Jordan,
Courtney & Neigel, 1996) and R622 (Les et al., 2002).
PCR reactions were carried out using standard protocols, and products were cleaned using either
QIAquick PCR purification columns (Qiagen, Inc.,
Valencia, CA) or by precipitation using an equal volume of PEG/NaCl (20%/2.5 M). Sequencing of the
rpl16 intron was conducted using the amplification
primers. Cycle sequencing reactions ( 1/4 or 1/2 volumes)
were performed using the BigDye Terminator kit (PE
Applied Biosystems, Foster City, CA) and by following
the standard protocol provided for the ABI Prism 310
automated sequencer (PE Applied Biosystems, Foster
City, CA). Sequence chromatographs were edited
manually and assembled into double-stranded
contigs.
Sequences were aligned initially in Clustal W
(Thompson, Higgins & Gibson, 1994), then manually
optimized by visual inspection. Alignments were
trimmed to exclude highly variable regions (e.g. near
exon 1 of rpl16) where positional homology was difficult to establish. GenBank accession numbers for all
molecular data, including our previously published
(Les et al., 2002) and new sequences are: AY034200AY034209 (rbcL), AY034316-AY034325 (trnK 3¢
intron), AY034355-AY034364 (trnK 5¢ intron),
AY034200-AY034209 (matK), AY034277-AY034286
and AY131184-AY131197 (rpl16 intron).
We used DNA sequence data from each species of
Wolffiella; four species in the genus Wolffia (W. australiana, W. borealis, W. brasiliensis and W. microscopica;
567
Table 1) were included as outgroups. For the rpl16
intron, two or more independent accessions were
sequenced for each species except Wolffiella denticulata; when possible, additional accessions were sampled from geographically distinct regions (Table 1).
PHYLOGENETIC
ANALYSES
Phylogenetic analyses were performed using default
values in PAUP* 4.0b8 (Swofford, 1999) unless noted
otherwise. To obtain the most parsimonious (MP) tree
using equally weighted parsimony, a heuristic search
was performed with 100 random sequence additions
and tree bisection–reconnection (TBR) branch swapping. The reliability of specific taxon groupings under
parsimony was examined using 1000 bootstrap replicates with ten random sequence additions per replicate. For parsimony analyses, data were analyzed both
by treating indels as missing data and by including
indels as characters following the ‘simple indel coding’
method of Simmons & Ochoterena (2000).
To determine the appropriate evolutionary models
for maximum likelihood (ML) analyses we used the
hierarchical likelihood ratio test as implemented in
MODELTEST 3.04 (Posada & Crandall, 1998). Parameters used in ML analyses were those recommended
by MODELTEST. To compare parameters across data
partitions, the transition/transversion ratio and the
shape parameter (a) of a gamma distribution were
also estimated for all four partitions using HKY85 + G
(Hasegawa–Kishino–Yano model with gamma distributed rates) and the MP topology obtained from the
combined dataset.
To determine whether the different plastid partitions were concordant, we performed the partition
homogeneity test (incongruence length difference test
(ILD); Farris et al., 1995). Tests were performed using
only the informative sites, with 1000 replicates and 10
random sequence additions per replicate.
To estimate the degree of sequence divergence
among the four partitions (rpl16, trnK, matK and
rbcL), the sums of the branch lengths from the ML
trees for each partition were obtained. In addition, pdistances were estimated for the two non-coding partitions. Non-synonymous and synonymous p-distances
were estimated using the method of Nei & Gojobori
(1986), as implemented in MEGA 1.02 (Kumar,
Tamura & Nei, 1993). Corrected non-synonymous and
synonymous distances were estimated using the
method of Yang & Nielsen (2000), as implemented in
PAML 3.12 (Yang, 2002).
TESTS
OF BIOGEOGRAPHICAL HYPOTHESES
We used the SOWH test (Swofford et al., 1996;
Goldman, Anderson & Rodrigio, 2000) to examine
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
568
R. T. KIMBALL ET AL.
specific biogeographical hypotheses for Wolffiella. This
approach compared a test statistic, 2d (two times the
difference in likelihood values), for the ML tree estimated from cpDNA data with that of an alternative
topology. The two alternative topologies (representing
alternative biogeographical hypotheses) were those
indicated by analyses of either morphology (Landolt,
1986, 1992) or allozymes (Crawford et al., 1997). To
determine whether an alternative topology was less
likely statistically than was the cpDNA ML topology, a
null distribution of the test statistic was generated
using 500 simulated data sets. For each topology
tested, we simulated data sets based upon the complete cpDNA data set using Seq-Gen 1.1 (Rambaut &
Grassly, 1997). Parameters and branch length information were estimated using the alternative ML
topology being tested in that specific SOWH test. For
each of the 500 simulated data sets, a heuristic search
was used to find the ML tree (as performed on the raw
data above). Parameter estimates for each simulated
data set were estimated using ML and the topology of
the alternative tree being considered, as recommended by Goldman et al. (2000). The 2d test statistic
generated from the simulated data sets was used to
establish the null distribution for this statistic. We
rejected the null hypothesis (that the ML and the
alternative topology did not differ significantly) if
fewer than 5% of the simulated data sets had 2d values greater than the observed 2d value.
RESULTS
MOLECULAR
EVOLUTION
The variability of the partitions differed, with the two
non-coding partitions having similar levels of variability and being only slightly more variable than matK
(Table 2). By summing the ML branch lengths across
the most likely tree for each partition (Table 2), which
gives a divergence rate corrected for homoplasy, the
trnK and rpl16 introns appeared to be diverging most
rapidly, with rbcL diverging much more slowly.
To compare the rate of divergence at sites that
should not be under selection, we compared pdistances among the synonymous sites in matK and
rbcL with the non-coding sequences of rpl16 and trnK.
Synonymous sites in matK have evolved the most rapidly, accumulating substitutions at 2.0, 1.6 and 1.5
times the rate of rbcL, trnK and rpl16, respectively.
Non-synonymous substitutions have also accumulated
much more rapidly in matK than in rbcL, occurring
about 4.8 times faster than the rate in rbcL. The partitions also differed in the ratio of transitions to transversions and the shape parameter of a gamma
distribution (Table 2), indicating different patterns of
molecular evolution. In general, matK, trnK and rpl16
were the most similar, while rbcL was generally quite
different.
PHYLOGENETIC
RELATIONSHIPS
Analyses including indels as characters provided similar topologies for each partition (data not shown),
with little or no improvement in resolution and consistency compared with analyses treating gaps as
missing data (Table 3). Therefore, we elected to show
our results with gaps treated as missing data.
The ILD test indicated that the four cpDNA partitions were not significantly incongruent (P = 0.11).
Some of the observed incongruence was distributed
among the outgroup taxa, and their removal increased
congruence of the different plastid partitions substantially (P = 0.40). Because significant incongruence was
not observed, we analyzed the data partitions separately and in combination.
The phylogenetic tree estimated from all four parti-
Table 2. Parameters of different molecular data partitions
Parameter
rbcL
matK
trnK
rpl16
No. of sites
% variable sites
% informative sites
Sum ML branches
Best model*
Shape parameter (a)†
ti/tv†
No. of gap characters
1348
5.8
2.9
0.08707
HKY85 + G + I
0.01
0.94
–
1548
15.9
6.7
0.21412
F81 + G
0.34
0.59
4
1017
17.1
8.4
0.26820
F81 + G
0.23
0.56
49
460
17.6
8.7
0.29567
F81 + G
0.21
0.45
20
*Determined using MODELTEST 3.04. G = incorporates site-to-site rate heterogeneity using a gamma distribution,
I = incorporates invariant sites.
†Estimated using HKY85 + G and the topology of the most parsimonious tree from the combined analysis.ti/tv = transition/
transversion ratio.
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
WOLFFIELIA PHYLOGENETICS AND BIOGEOGRAPHY
569
Table 3. Results of analysis on different molecular data partitions
Parameter
rbcL
matK
trnK
rpl16
Gaps as missing data
No. of MP trees
CI, excluding uninformative
5
0.65
3
0.72
3
0.75
1
0.77
With gap matrix
No. of MP trees
CI, excluding uninformative
–
–
3
0.72
1
0.70
4
0.76
With rapidly evolving sites removed
No. of MP trees
CI, excluding uninformative
1
0.75
3
0.74
6
0.80
1
0.78
MP = most parsimonious.
100
85
98
99
75
100
94
100
99
94
100
--100
100
90
83
----83
74
95
--66
--95
82
75
--58
---
98
34
97
78
XX
Distribution
W. lingulata
NA, SA
W. oblonga
NA, SA
W. gladiata
NA
W. caudata
SA
W. neotropica
SA
Section
Wolffiella
W. welwitschii AF, SA
100
95
100
100
100
100
77
100
99
70
W. denticulata
AF
W. hyalina
AF
W. repanda
AF
W. rotunda
AF Section Rotundae
Section
Stipitatae
Figure 1. Most parsimonious tree using the combined sequence partitions, rooted with Wolffia (outgroup taxa not shown).
Values at nodes represent percent of 1000 parsimony bootstrap replicates. Top value is from analysis of the combined
sequence partitions, followed by analysis of rbcL, matK, trnK intron and rpl16 intron. Dashes indicate that the node was
not found, but that alternative placements of those taxa were supported by fewer than 50% of bootstrap replicates; XX
indicates that the node was not found, and that alternative placements of the taxa were supported by more than 50% of
bootstrap replicates. Distribution: AF = Africa, NA = North America, SA = South America.
tions combined gave a single well-resolved MP tree (CI
excluding uninformative sites = 0.72) with greater
than 70% bootstrap support at all nodes (Fig. 1). There
was strong support for the monophyly of the three sections recognized by Landolt (1986) based on morphology and anatomy, and also by Crawford et al. (1997)
based on allozyme analyses (Fig. 2). However, cpDNA
sequence data indicated slightly different relationships within Section Wolffiella than those suggested
by previous studies (cf. Figs 1, 2; see also Les et al.,
2002).
When the different cpDNA sequence partitions were
analysed independently, slightly different topologies
were obtained. These differences were primarily due
to a lack of resolution among particular nodes (matK,
trnK and one relationship in rpl16). In rbcL, W. caudata was placed basal to W. denticulata, though with
less than 50% bootstrap support. Only in rpl16 was a
topological difference supported, and this involved
60% bootstrap support for a clade containing
W. caudata and W. oblonga. However, within each partition, there was no conflict between the strict consensus of all MP trees and the ML tree. Therefore, only
the parsimony results are shown. Trees constructed
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
570
R. T. KIMBALL ET AL.
Distribution
(a)
W. gladiata
NA
W. oblonga
NA, SA
W. lingulata
NA, SA
W. caudata
SA
W. denticulata
AF
W. welwitschii
AF, SA
W. neotropica
SA
W. hyalina
AF
W. repanda
AF
W. rotunda
AF
W. lingulata
NA, SA
W. oblonga
NA, SA
W. gladiata
NA
W. welwitschii
(b) 0.00
0.25
0.50
0.75
AF, SA
W. caudata
SA
W. neotropica
SA
W. denticulata
AF
W. hyalina
AF
W. repanda
AF
W. rotunda
AF
Section
Wolffiella
Section
Stipitatae
Section Rotundae
Section
Wolffiella
Section
Stipitatae
Section Rotundae
1.00
Figure 2. Previous hypotheses of relationships in Wolffiella. (a) Redrawn from Landolt (1986) with the placement of
W. caudata based on comments of Landolt (1992); (b) Phenogram (UPGMA) based on genetic identity at allozyme loci.
Scale across bottom indicates genetic identity. Redrawn from Crawford et al. (1997). Designations of geographical distributions are the same as in Fig. 1.
from individual partitions had few conflicts with the
tree estimated from combined data (Fig. 1). The phylogeny estimated from rbcL differed most from the
combined data tree (Fig. 1) in that W. caudata rather
than W. denticulata was basal within Section
Wolffiella. One other difference found in analyses of
rpl16 data was that one accession of W. oblonga
formed a clade with W. caudata rather than with
W. lingulata (Fig. 1). However, neither of these conflicting nodes was supported by a bootstrap value
greater than 50% (Fig. 1). Interestingly, the CIs of the
non-coding regions were higher than those for either
coding region (Table 3).
When comparing the number of parsimony steps
across the best tree (obtained using all four partitions
combined), a small proportion of sites in each partition
appeared to be evolving rapidly, accumulating three to
four steps across the tree. Although the proportion of
rapidly evolving sites was small, there was variation
among partitions (1.08% rpl16, 1.28% trnK, 0.32%
matK, 0.52% rbcL). These sites were not confined to
third positions in the two coding partitions, as might
have been expected. In particular, the seven rapidly
evolving sites for rbcL included both a first and a second position, resulting in non-synonymous substitutions in each case. Since rapidly evolving sites may
exhibit high levels of homoplasy, we reanalyzed the
data after removal of these sites. With the rapidly
evolving sites removed, the CI of all partitions
increased (Table 3), suggesting such sites did exhibit
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
WOLFFIELIA PHYLOGENETICS AND BIOGEOGRAPHY
homoplasy. For rbcL, this altered the topology of the
MP tree so that it was congruent with the tree
obtained from the combined data set (Fig. 1). However,
analysis of the more slowly evolving first, second or
first and second positions of rbcL combined did not
result in a topology congruent with Figure 1. For the
other partitions, removal of rapidly evolving sites did
not result in topological changes.
The larger rpl16 alignment, which included multiple accessions for most species, was very similar in
base composition and parameters to the smaller alignment (Tables 2, 3). The alignment contained 461 sites,
99
95
60
58
60
571
of which 18% were variable and 8% were parsimony
informative. As with the smaller rpl16 alignment, the
best model was F81 + G. To compare this alignment
with the other data partitions, we used HKY85 + G.
The shape parameter was estimated to be 0.22 and the
transition/transversion ratio was 0.43. Analysis of this
larger data set produced three MP trees with a CI
(excluding uninformative sites) of 0.78. As may occur
when analyzing alignments of relatively short
sequences (Fehrer, 1996), bootstrap values were low at
many nodes (Fig. 3), even when sequence identity was
high. For example, the two accessions of W. neotropica
W. lingulata 7289 *
Brazil
W. lingulata 7655
Mexico
W. oblonga 8393
USA
W. oblonga 8984
Columbi a
W. oblonga 8072
USA
W. oblonga 7997 *
Brazil
W. caudata 9158 *
Bolivia
W. caudata 9214
Bolivia
W. gladiata 8768 *
USA
W. gladiata 7173
USA
W. neotropica 8848 * Brazil
55
93
W. neotropica 7290
Brazil
W. welwitschii 9096
Zimbabwe
93
W. welwitschii 7468 * Columbi a
W. denticulata 8221 * S. Africa
73
70
63
100
98
W. hyalina 8640 *
Tanzania
W. hyalina 7376
Egypt
W. repanda 9062 *
Zimbabwe
W. repanda 9054
Zimbabwe
W. repanda 9122
Zimbabwe
W. repanda 9104
Botswana
W. repanda 9107
Botswana
W. rotunda 9072 *
Zimbabwe
W. rotunda 9121
Zimbabwe
Figure 3. Phylogeny estimated using the complete rpl16 alignment with multiple accessions for most species, rooted with
Wolffia (outgroup taxa not shown). Asterisks identify the accession included in the combined analysis above. Geographic
locality of each accession is given. Values at nodes represent per cent of 1000 parsimony bootstrap replicates. Nodes with
less than 50% bootstrap support were collapsed.
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
572
R. T. KIMBALL ET AL.
Table 4. Number of nucleotide differences (excluding indels) in rpl16 sequences between two groups of closely related
species
W. caudata 9158
W. caudata 9214
W. gladiata 7173
W. gladiata 8768
W. lingulata 7289
W. lingulata 7655
W. oblonga 7997
W. oblonga 8072
W. oblonga 8393
W. oblonga 8984
W. hyalina 7376
W. hyalina 8640
W. repanda 9054
W. repanda 9062
W. repanda 9104
W. repanda 9107
W. repanda 9122
W. rotunda 9072
W. rotunda 9121
W. caudata
W. gladiata
W. lingulata
W. oblonga
9158
9214
7173
8768
7289
7655
7997
8072
8393
8984
–
2
2
2
6
6
2
4
6
6
–
2
2
6
6
2
4
6
6
–
0
4
4
2
2
4
4
–
4
4
2
2
4
4
–
0
6
6
0
0
–
6
6
0
0
–
4
6
6
–
6
6
–
0
–
W. hyalina
W. repanda
7376
8640
9054
9062
–
0
1
2
6
6
1
5
5
–
1
2
6
6
1
5
5
–
1
7
7
0
6
6
–
8
8
1
7
7
were identical, yet W. neotropica was supported in
only 55% of bootstrap replicates (Fig. 3) and the two
identical accessions of W. gladiata did not even form a
clade (Fig. 3).
In addition to instances of similar rpl16 sequences
that did not cluster together, there were species with
some divergent accessions that showed a high degree
of similarity to other species (Fig. 3; Table 4). For
example, two accessions of W. oblonga (8393 and 8984)
were identical to each other and also to both accessions of W. lingulata. Yet these two accessions differed
from putative conspecifics (7997 and 8072) by six
nucleotide substitutions. Wolffiella oblonga (7997)
showed greater similarity to two accessions of W. caudata (Table 4), whereas W. oblonga (8072) was not
closely related to any other accession.
BIOGEOGRAPHY
Different biogeographical hypotheses were indicated
when relationships within Wolffiella were estimated
using different types of data (morphology, allozymes,
W. rotunda
9104
9107
9122
9072
9121
–
6
6
–
0
–
–
0
7
11
10
–
7
11
10
cpDNA). Morphological data (Fig. 2a) indicated that
Wolffiella originated either in Africa (distribution of
Sections Stipitatae and Rotundae) or South America
(distribution of W. neotropica). Whether an African or
South American origin is assumed, dispersal between
these two continents would have occurred at least
three times if the morphological tree were correct. In
contrast, both the allozyme and the cpDNA data
(Figs 1, 2b) indicated an African origin, followed by
two dispersal events between Africa and South America. The allozyme and cpDNA hypotheses differ in that
the allozyme data suggest that W. welwitschii originated in South America, and then dispersed to Africa
while the combined cpDNA data are equivocal on this
point (Fig. 1; but see Fig. 3). Regardless of the topology, there have been at least three dispersal events
between North and South America, assuming both
W. lingulata and W. oblonga are each monophyletic.
Results of the SOWH test indicated that the phylogeny estimated from the cpDNA data (Fig. 1), and the
biogeographical hypothesis it supported, were significantly more likely than those supported by either
morphological data (2d = 249.27, 2d-critical = 11.84,
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
WOLFFIELIA PHYLOGENETICS AND BIOGEOGRAPHY
P < 0.002) or allozyme data
critical = 5.21, P < < 0.002).
(2d = 57.817,
2d-
DISCUSSION
MOLECULAR
EVOLUTION
This study is similar to other published results (e.g.
Steele & Vilgalys, 1994; Johnson & Soltis, 1995;
Manos & Steele, 1997; Xiang, Soltis & Soltis, 1998) in
showing that matK is more variable and diverges
more rapidly than rbcL for both synonymous and nonsynonymous substitutions. Although rbcL appeared to
be under greater constraint at non-synonymous sites,
it was surprising to find a rapidly evolving first and
second position that in both cases resulted in nonsynonymous substitutions in the rbcL data set. The
different amino acids encoded at these sites are those
typical of rbcL sequences (e.g. Kellogg & Juliano,
1997), and it may be that there is selection for one of
several amino acids at these sites, but little selection
for a specific one of those amino acids among the taxa
examined here. Analysis of the rbcL data set that
included rapidly evolving first and second position
sites resulted in a topology that was inconsistent with
all other analyses, suggesting that even nonsynonymous sites in rbcL may exhibit problematic
levels of homoplasy (see also Manos & Steele, 1997).
Although the degree of divergence among noncoding chloroplast regions varies greatly (e.g. Small
et al., 1998), the two regions examined here both
appeared to be diverging at relatively high rates, and
this, combined with a lesser degree of constraint,
might be expected to lead to greater homoplasy in
these regions. However, the CIs of both non-coding
regions were higher than for the coding regions, and
removal of rapidly evolving (and potentially more
homoplasious) sites had little effect on the resulting
topology, indicating that the non-coding regions were
not excessively homoplasious at the taxonomic level
examined in this study.
PHYLOGENETIC
RELATIONSHIPS IN
WOLFFIELLA
Phylogenetic relationships in Wolffiella inferred from
the present study may be compared to those hypothesized previously for the genus. In an attempt to
resolve relationships within Wolffiella, Landolt (1986)
used primitive and derived states of 26 anatomorphological characters and ranked species phylogenetically
according to an ‘index of primitivity’ for those characters (Fig. 2a). We placed the subsequently named
W. caudata as the sister to W. lingulata in Fig. 2a
because Landolt (1992) viewed it as most closely
related to that species. Landolt (1986) recognized a
basal split within the genus, with one lineage consisting of Sections Rotundae and Stipitatae and the other
573
comprising Section Wolffiella. Les et al. (1997a,b) used
41 anatomorphological characters (including many of
the same ones employed by Landolt, 1986) to conduct
a phylogenetic analysis using maximum parsimony.
That analysis showed the same basal split as Landolt
(1986) hypothesized with monophyly of the three sections. However, the genus was not shown to be monophyletic. The uniformity of the anatomorphological
traits used by Les et al. (1997a,b) precluded resolution
of relationships among species. Crawford et al. (1997)
inferred relationships within Wolffiella using allozyme
data (Fig. 2b) and recovered the same basal split in
the genus as well as clustering of the sections
(Fig. 2a,b). Our combined cpDNA sequence phylogeny
also supports the basal split as well as the sectional
relationships depicted by Landolt (1986) and supported by later studies (Crawford et al., 1997; Les
et al., 1997a,b). Of interest may be the tree based on
the larger rpl16 alignment, in which Section Stipitatae is not monophyletic (Fig. 3; see also Table 4).
Whether this result is real or due to an artefact
remains to be determined.
Species in the two lineages resulting from the basal
split in Wolffiella differ in several ecological features
(Landolt, 1986). Members of Sections Rotundae and
Stipitatae live on the surface of seasonal waters and
survive dry periods by producing seeds. The seeds germinate quickly, and rapid vegetative reproduction produces large populations that cover the surface of the
water with the onset of the wet period (Landolt, 1994).
In contrast, members of Section Wolffiella, both in the
Americas and in Africa, live submersed in permanent
waters. When overgrown by other plants, they survive
by using organic substances from the water. They are
also capable of sinking to the bottom of the water and
using nutrients released from the soil. With the exception of W. welwitschii, members of Section Wolffiella
live in permanent waters and do not rely on seeds for
reproduction (Landolt, 1986). Thus, the initial phylogenetic split in Wolffiella reflects two lineages differing by several fundamental life history and ecological
attributes.
Relationships among species in Section Wolffiella
have been difficult to infer. Les et al. (1997a,b) were
unable to clarify these relationships using anatomorphological data. Greater resolution was achieved in
our combined cpDNA phylogeny and in the allozyme
dendrogram which agreed topologically in most
respects, differing only in the placement of
W. welwitschii (cf. Figs 1, 2b). Landolt (1986) portrayed close phylogenetic relationships among
W. gladiata, W. lingulata and W. oblonga. He also
viewed the newly described W caudata as closely
related to these three species, and especially to
W. lingulata (Landolt, 1992). A close relationship
hypothesized between W. caudata and the other three
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
574
R. T. KIMBALL ET AL.
species is concordant with our cpDNA phylogeny.
However, W. caudata is sister to the other three species in the cpDNA phylogeny whereas Landolt considered it to be most closely related to W. lingulata. The
phylogeny of Landolt (1986) also differed in the placement of W. denticulata and W. neotropica (cf. Figs 1,
2a).
The rpl16 sequences failed to group all accessions of
Wolffiella oblonga (Fig. 3), with accessions occurring
in three different regions of the tree (see also Table 4).
The clustering of two accessions of W. oblonga(8393
and 8394) with W. lingulata is concordant with allozyme data; Crawford et al. (1997) found very high allozyme identities between these taxa (the highest yet
found between any two Lemnaceae species) with no
alleles unique to either species. This result is also supported by morphology, as Landolt (1986) observed that
W. gladiata, W. oblonga and W. lingulata were ‘very
difficult to recognize’ and that distinctions between
W. oblonga and W. lingulata were ‘especially unclear’
due to extensive variability in key, defining characteristics. He further stated that it was difficult to determine whether certain collections contained both
species, or a single species displaying different developmental forms (Landolt, 1986). Given this, it is possible that W. oblongaand W. lingulata may not form
distinct species or may have diverged very recently.
Also of interest is the accession of W. oblonga (7997)
that grouped with W. caudata in the rpl16 analysis
(Fig. 3, Table 4), though the reasons for this anomalous placement are unknown. Another accession of
W. oblonga (8072) was distinct in the rpl16 analysis
(Fig. 3) and did not show a high degree of similarity to
any other accession. Too few data exist to determine
whether this accession represents a novel lineage or
whether its position is due to incomplete coalescence.
Whatever the case, it is clear that a more detailed
study of W. oblonga and other closely related species
could be fruitful for elucidating their relationships.
MOLECULAR
BIOGEOGRAPHY OF
WOLFFIELLA
Determining the biogeographical history of Lemnaceae is challenging because they are readily dispersed by virtue of their minute size. Furthermore,
there are several documented examples of species
recently introduced by humans (Landolt, 1986). These
factors have led to widespread distributions of Lemna,
Spirodela and Wolffia. In contrast, Wolffiella is
restricted to North and South America and Africa.
Landolt (1986) suggested that Wolffiella originated in
the warmer regions of South America because the
‘most primitive and probably most ancient’ species
(W. neotropica) occurred there, although his tree was
equivocal with regard to the origins of Wolffiella
(Fig. 2a). Relationships inferred from divergence at
allozyme loci and cpDNA data indicated an African
origin for Wolffiella, with dispersal to America, followed by a later dispersal of W. welwitschii back into
Africa (Figs 2b, 3). An African origin is more parsimonious than is a South American origin, as it requires
only two dispersal events between Africa and the
Americas, rather than the three required if Wolffiella
originated in South America. We assumed a synonymous substitution rate of 0.12% per Myr for rbcL,
which is similar to rates calculated for a variety of
flowering plants (e.g. Xiang et al., 2000) to estimate
the timing of the dispersal from Africa to the Americas. The divergence between W. denticulata and
W. welwitschii was 2.35% at synonymous sites, which
sets the estimated time for the divergence between
American and African species at approximately 9.8
million years.
Although it appears most likely that Wolffiella originated in Africa, it is less clear where W. welwitschii
(which is distributed in both South America and
Africa) evolved. Allozyme data (and morphology) suggest that W. welwitschii originated in South America,
and then dispersed to Africa, while the combined
cpDNA data are equivocal (Figs 1, 2). The accession of
W. welwitschii that we sequenced for the combined
data set was from South America. If the origin of the
species were in Africa with dispersal to America, then
it should be at least as divergent from the American
species as it is from W. denticulata. In contrast, if the
species originated in America with dispersal back to
Africa the American accession should be less divergent
from exclusively American species than from the
African W. denticulata (Fig. 1). The accession of
W. welwitschii from South America was more divergent from W. denticulata than from any American species of Wolffiella (about twice as divergent in matK
and rbcL), supporting an American origin for
W. welwitschii with dispersal back to Africa. Additional support for an American origin occurred in the
rpl16 data, for which both an African and American
accession were sequenced (Table 1; Fig. 3). The two
accessions of W. welwitschii differed by only two substitutions (0.43%) whereas the mean divergence
between W. welwitschii and the South American
W. neotropica was 1.1% and divergence between
W. welwitschii (whether from America or Africa) and
its closest African species, W. denticulata, was over
2.5%. In Wolffiella, the rate of rpl16 sequence divergence was 1.33 times faster than at rbcL synonymous
sites (see Results), giving an estimated time of 1.35
Myr for dispersal of W. welwitschii back to Africa.
Among the species of Wolffiella, W. welwitschii is best
adapted for dispersal because it flowers and sets seed
rather frequently, and Lemnaceae seeds are able to
survive out of water much longer than can their fronds
(Landolt, 1997).
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
WOLFFIELIA PHYLOGENETICS AND BIOGEOGRAPHY
The distribution of Wolffiella differs markedly from
the distributions of other Lemnaceae genera because
it can be explained by only two major dispersal events.
In contrast, attempts to reconstruct distributions of
other genera using the methods employed in the
present study have given ambiguous results (D.H. Les,
unpubl. data) due to widespread distributions of several species in other genera. There are several historical and ecological factors that could account for the
fewer dispersal events in Wolffiella than in the other
genera. Molecular data (Crawford & Landolt, 1995;
Crawford et al., 1997; Les et al., 2002) showed that
species of Wolffiella are less divergent on average than
are species of other genera of Lemnaceae, suggesting
that species of Wolffiella are ‘younger’ than those of
other genera. Thus, the lack of dispersal could be a
reflection of time since origin. As indicated earlier,
Wolffiella is restricted to tropical and subtropical
regions where distances between continents are
greater than in the boreal and nemoral regions of the
northern hemisphere, and low dispersal may reflect
the greater distances between continents. The fronds
of Wolffiella are very thin, and cuticles function less
efficiently against dehydration than in other Lemnaceae, making it difficult for the fronds to be transported long distances even when covered by feathers
(Landolt, 1986). Seeds, which allow for more efficient
transport than fronds, are rare in all species except
members of Sections Rotundae and Stipitatae and in
W. welwitschii (Landolt, 1986). The specialized adaptations of Sections Rotundae and Stipitatae to seasonally dry, local pools in Africa make them poor
colonizers, so dispersal events would likely not result
in effective colonization (Landolt, 1994).
ACKNOWLEDGEMENTS
We would like to thank E. L. Braun for assisting us
with the SOWH test and allowing us to use his computer facilities. We thank two anonymous reviewers
for critically reading an earlier version of the manuscript. Research was funded in part by NSF grant
DEB-9806537 to D.H.L. and D.J.C.
REFERENCES
Crawford DJ, Landolt E. 1995. Allozyme divergence among
species of Wolffia (Lemnaceae). Plant Systematics and Evolution 197: 59–70.
Crawford DJ, Landolt E, Les DH, Tepe E. 1997. Allozyme
variation and the taxonomy of Wolffiella (Lemnaceae).
Aquatic Botany 58: 43–54.
Farris JS, Källersjo M, Kluge AG, Bult C. 1995. Testing significance of incongruence. Cladistics 10: 315–319.
Fehrer J. 1996. Conflicting character distribution within dif-
575
ferent data sets of cardueline finches: Artifact or history?
Molecular Biology and Evolution 13: 7–20.
Goldman N, Anderson JP, Rodrigo AG. 2000. Likelihoodbased tests of topologies in phylogenetics. Systematic Biology
49: 652–670.
Johnson LA, Soltis DE. 1995. Phylogenetic inference in Saxifragaceae sensu stricto and Gilia (Polemoniaceae) using
matK sequences. Annals of the Missouri Botanical Garden
82: 149–175.
Jordan WC, Courtney MW, Neigel JE. 1996. Low levels of
intraspecific genetic variation at a rapidly evolving chloroplast DNA locus in North American duckweeds (Lemnaceae).
American Journal of Botany 83: 430–439.
Kellogg EA, Juliano ND. 1997. The structure and function of
rubisco and their implications for systematic studies. American Journal of Botany 84: 413–428.
Kumar S, Tamura K, Nei M. 1993. MEGA: molecular evolution genetic analysis, Vers. 1.0. Pennsylvania: Pennsylvania
State University.
Landolt E. 1986. Biosystematic investigations in the family of
duckweeds (Lemnaceae) volume 2. The family of Lemnaceae
– a monographic study, volume 1. Veroffentlichungen des
Geobotanischen Institutes der ETH, Stiftung Rubel Zurich
71.
Landolt E. 1992. Wolffiella caudata, a new Lemnaceae species
from the Bolivian Amazon region. Berichte des Geobotanischen Institutes der ETH, Stiftung Rubel, in Zurich 58: 121–
123.
Landolt E. 1994. The Lemnaceae of Zimbabwe and Botswana.
Berichte des Geobotanischen Institutes der ETH, Stiftung
Rubel, in Zurich 60: 110–136.
Landolt E. 1997. How do Lemnaceae (duckweeds) survive dry
conditions? Bulletin of the Geobotanical Institute ETH. 63:
25–31.
Landolt E. 2000. Contribution on the Lemnaceae of Ecuador.
Fragmenta Floristica et Geobotanica 45: 221–237.
Les DH, Crawford DJ. 1999. Landoltia (Lemnaceae), a new
genus of duckweeds. Novon 9: 530–533.
Les DH, Crawford DJ, Landolt E, Aakjar R, Tepe E.
1997a. Systematics of Lemnaceae revisited. American Journal of Botany 84: 211.
Les DH, Crawford DJ, Landolt E, Gabel JD, Kimball RT.
2002. Phylogeny and systematics of Lemnaceae, the duckweed family. Systematic Botany 27: 221–240.
Les DH, Landolt E, Crawford DJ. 1997b. Systematics of
Lemnaceae: inferences from micromolecular and morphological data. Plant Systematics and Evolution 204: 161–177.
Manos PS, Steele KP. 1997. Phylogenetic analyses of ‘higher’
Hamamelididae based on plastid sequence data. American
Journal of Botany 84: 1407–1419.
Nei M, Gojobori T. 1986. Simple methods for estimating the
numbers of synonymous and nonsynonymous nucleotide
substitutions. Molecular Biology and Evolution 3: 418–
426.
Posada D, Crandall KA. 1998. MODELTEST: testing the
model of DNA substitution. Bioinformatics 14: 817–818.
Rambaut A, Grassly NC. 1997. Seq-Gen: An application for
the Monte Carlo simulation of DNA sequence evolution
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576
576
R. T. KIMBALL ET AL.
along phylogenetic trees. Computer Applications in Biosciences 13: 235–238.
Simmons MP, Ochoterena H. 2000. Gaps as characters in
sequence-based phylogenetic analyses. Systematic Biology.
49: 369–381.
Small RL, Ryburn JA, Cronn RC, Seelanan T, Wendel JF.
1998. The tortoise and the hare: choosing between noncoding
plastome and nuclear ADH sequences for phylogeny reconstruction in a recently diverged plant group. American Journal of Botany 85: 1301–1315.
Steele KP, Vilgalys R. 1994. Phylogenetic analyses of
Polemoniaceae using nucleotide sequences of the plastid
gene matK. Systematic Botany 19: 126–142.
Swofford DL. 1999. PAUP*: Phylogenetic analysis using parsimony (*and other methods), Version 4.0. Sunderland, MA:
Sinauer Associates.
Swofford DL, Olsen GJ, Waddell PJ, Hillis DM. 1996. Phylogenetic inference. In: Hillis DM, Moritz C, Marble BK, eds.
Molecular systematics, 2nd edn. Sunderland, MA: Sinauer
Associates, 404–514.
Thompson JD, Higgins DG, Gibson TJ. 1994. Clustal W:
Improving the sensitivity of progressive multiple sequence
alignment through sequence weighting, position-specific gap
penalties and weight matrix choice. Nucleic Acids Research
22: 4673–4680.
Xiang Q-Y, Soltis DE, Soltis PS. 1998. Phylogenetic relationships of Cornaceae and close relatives inferred from matK
and rbcL sequences. American Journal of Botany 85: 285–
297.
Xiang Q-Y, Soltis DE, Soltis PS, Manchester SR, Crawford DJ. 2000. Timing the eastern Asian–eastern North
American floristic disjunction: molecular clock corroborates
paleontological estimates. Molecular Phylogenetics and Evolution 15: 462–472.
Yang Z. 2002. Phylogenetic analysis by maximum likelihood
(PAML). Vers. 3.12. London: University College.
Yang Z, Nielsen R. 2000. Estimating synonymous and
nonsynonymous substitution rates under realistic evolutionary models. Molecular Biology and Evolution 17: 32–
43.
© 2003 The Linnean Society of London, Biological Journal of the Linnean Society, 2003, 79, 565–576