banner

News

Jul 11, 2023

Genomic copy number variability at the genus, species and population levels impacts in situ ecological analyses of dinoflagellates and harmful algal blooms

ISME Communications volume 3, Article number: 70 (2023) Cite this article

768 Accesses

1 Altmetric

Metrics details

The application of meta-barcoding, qPCR, and metagenomics to aquatic eukaryotic microbial communities requires knowledge of genomic copy number variability (CNV). CNV may be particularly relevant to functional genes, impacting dosage and expression, yet little is known of the scale and role of CNV in microbial eukaryotes. Here, we quantify CNV of rRNA and a gene involved in Paralytic Shellfish Toxin (PST) synthesis (sxtA4), in 51 strains of 4 Alexandrium (Dinophyceae) species. Genomes varied up to threefold within species and ~7-fold amongst species, with the largest (A. pacificum, 130 ± 1.3 pg cell−1 /~127 Gbp) in the largest size category of any eukaryote. Genomic copy numbers (GCN) of rRNA varied by 6 orders of magnitude amongst Alexandrium (102– 108 copies cell−1) and were significantly related to genome size. Within the population CNV of rRNA was 2 orders of magnitude (105 – 107 cell−1) in 15 isolates from one population, demonstrating that quantitative data based on rRNA genes needs considerable caution in interpretation, even if validated against locally isolated strains. Despite up to 30 years in laboratory culture, rRNA CNV and genome size variability were not correlated with time in culture. Cell volume was only weakly associated with rRNA GCN (20–22% variance explained across dinoflagellates, 4% in Gonyaulacales). GCN of sxtA4 varied from 0–102 copies cell−1, was significantly related to PSTs (ng cell−1), displaying a gene dosage effect modulating PST production. Our data indicate that in dinoflagellates, a major marine eukaryotic group, low-copy functional genes are more reliable and informative targets for quantification of ecological processes than unstable rRNA genes.

Genomic copy number variation (CNV) is increasingly documented in eukaryotic, bacterial and archaeal genomes [1,2,3,4,5], and represents a major source of intra-specific and population-level genetic variation. The impact of CNV on phenotypic trait expression has been characterised in flowering plants, vertebrates, yeast, and human health research including many model organisms [1,2,3,4,5]. Eukaryote CNVs can lead to increased expression and dosage, providing a potential selective advantage [1, 3, 5,6,7]. Despite its potential importance, the scale and role of CNV in most non-model organisms, including marine microbial eukaryotes, is poorly understood.

While CNV has been reported in marine microbial eukaryotes [8,9,10,11,12,13], and a few studies have indicated rRNA genes could vary in copy numbers or sequences [14, 15], it is still relatively unclear whether CNV has a significant impact on quantitative molecular ecological studies employing meta-barcoding, meta-genomics and qPCR [10]. Quantitative molecular ecology studies of marine protists generally use regions of the rRNA operon for community structure analyses due to the broad coverage of rRNA genes in reference databases, the capacity to resolve taxa, and a high genomic copy number (GCN) in eukaryotes (>102 cell−1) which aids in the detection sensitivity [9,10,11,12]. However, in animals, fungi and plants, rRNA gene copies are variably present, from 102–104 copies cell−1 [9, 10, 16,17,18], with a similar range (103–104 copies cell−1) in diatoms (Stramenopiles) [19]. Other groups of microbial eukaryotes may show greater variation, from 103- 105 copies cell−1 in ciliates (Alveolata) [11], and 102–105 in foraminifera (Rhizaria) [12]. Within most species of microbial eukaryotes, rRNA gene copy numbers are considered to be more stable [19, 20], however, relatively few studies have examined this [11, 12, 19].

Dinoflagellates encompass most harmful algal bloom (HAB) forming taxa, as well as constituting up to 50% of marine microbial eukaryotic biomass, thus are a major constituent of aquatic microbial ecosystems [13]. Genome size varies considerably in dinoflagellates (~1 Gb to >150 Gb) including some of the largest known eukaryotic genomes, larger than the largest animal (lungfish, 130 Gb) and plant (Paris japonica, 149 Gb) genomes [21,22,23,24,25]. Gene duplication and large-scale expansion appear to have occurred amongst dinoflagellate genomes, and coding genes are often present in multiple tandem repeats [26,27,28,29,30]. Genomes of coral symbiont species (Dinophyceae: Symbiodinaceae) show highly dynamic evolution, driven by gene family expansion via both tandem duplication [28, 29] and retroposition [31, 32]. Considerable genome size variation and very large genomes occur in multiple planktonic dinoflagellate orders [33], as well as in other groups of marine microbial eukaryotes such as foraminifera, ciliates, and Amoebozoa [33]. GCN of rRNA genes across much of eukaryotic life are considered broadly correlated with genome size [18]. Such large and dynamic genome sizes suggest substantial CNV may exist in these taxa.

Of marine harmful algal blooms forming taxa, those that produce Paralytic Shellfish Toxins (PSTs) are common and have significant public health and economic implications [34]. PST expression can constitute an inducible defence mechanism in marine dinoflagellates in response to the presence of copepod predators [35]. PSTs are synthesised by the cosmopolitan and common marine dinoflagellates Alexandrium species, Pyrodinium bahamense, Gymnodinium catenatum, and Centrodinum punctatum. Dinoflagellate genes associated with PST biosynthesis (sxt) [36,37,38] possess dinoflagellate features such as a unique 22 bp spliced leader sequence on transcripts, a high GC-content, and eukaryotic poly-A tails [36, 38]. A relatively low proportion of genes (~10–27%) in dinoflagellates are thought to be regulated at the transcriptional level [21, 23, 26], with many genes regulated post-transcriptionally. The role of gene dosage acting on this trait may therefore differ in dinoflagellates from that in more highly transcriptionally regulated taxa. Studies of certain species such as A. minutum and A. ostenfeldii have indicated a correlation may exist between cellular PST content and genomic copies of the PST biosynthetic gene sxtA4 [39,40,41]. Some studies have shown that PST synthesis may not be regulated at the transcriptional level [42, 43]. This gene has been found to vary in GCN across studies, as A. pacificum and A. catenella show ~180–325 copies cell−1 of sxtA4 [40, 44, 45]; while A. minutum and A. ostenfeldii showed fewer copies, at 1.5–11 copies cell−1 [39, 41].The majority of species of Alexandrium had no detectable sxtA4 copies and do not produce PSTs [46]. Thus sxtA4 is a gene with a comparatively lower copy number in dinoflagellates than those that show large scale tandem repeats [26,27,28,29,30]. If consistent across PST-producing species, GCN may constitute a useful marker for in situ ecological analyses of HABs, and potentially for other functional traits governed by genomic dosage.

Because rRNA genes, as compared to coding genes, are likely to be under different selective pressures [47], processes that lead to CNV may differ between them. To determine the impact of CNV on both a functional gene and rRNA barcoding markers, and to examine the role of genome size and time in culture on CNV, we quantified CNV of rRNA genes and sxtA4 in relation to genome size across 51 strains of PST-producing marine dinoflagellate, Alexandrium australiense, A. pacificum, A. catenella and A. minutum. Our selection of strains provided capacity to examine CNV within and between species, in strains maintained in long-term culture, and CNV variance across regions. As diversity analysis employing rRNA genes in particular becomes ubiquitous, we aimed to determine the scale of biases associated with CNV, examine its prevalence across dinoflagellates and indicate potential solutions.

Fifteen non-axenic strains of Alexandrium pacificum were established from a surface net haul collected on 22/11/18 at Mindarie Marina, Western Australia (−31.689127, 115.703103). Single cell isolation of A. pacificum was performed using drawn out glass pipettes and a Nikon Eclipse TS100 inverted microscope (100x magnification). Isolated cells were transferred into Falcon®24 well culture plates containing 1 ml of K/5 medium [48] without sodium silicate. Germanium dioxide was added (5 µg/ml) to prevent diatom growth. Plates were kept at 18 °C under a photon flux of 60–100 μmol photons PAR m−2 s −1 with a 12/12 h dark/light cycle (cool white fluorescent). After 3 weeks, the cultures were transferred into 20 ml K media in 70 mL sterile culture flasks (Thermo Fisher Scientific, Massachusetts, USA), and maintained by serial transfer every 3 weeks. In total 36 additional strains of 4 Alexandrium species (Alexandrium catenella, A. minutum, A. pacificum, A. australiense; Supplementary Table 1) were obtained from collections: the Australian National Algae Culture Collection (CS), the Cawthon Institute Culture Collection of Microalgae (CAWD), the Roscoff Culture Collection (RCC), and collections maintained at the Institute for Marine and Antarctic Studies, University of Tasmania. Strains originated from 8 different countries in Europe, Asia, Australasia and the Americas, across states and regions in Australia, and were isolated on differing dates within the past 30 years.

Isolate identity was confirmed by sequencing the D1-D3 region of large-subunit rRNA. Cells were harvested from 50 ml of culture by centrifugation and DNA extracted using the FastDNA spin kit for soil (MP Biomedicals, Santa Ana, California, USA). DNA quality was checked using a Nanodrop 2000 (Thermo Scientific, Waltham, Massachusetts) spectrophotometer. The D1-D3 region of LSU-rRNA was amplified by PCR using primers D1F [49] and D3B [50] in 25 µl reactions containing 5 µl of 5X MyTaq buffer (Bioline, London, UK), MyTaq polymerase (Bioline, London, UK) 0.5 µl, 7.5 pmol of each primer, 1 µg µl−1 BSA (Biolabs, Arundel, Australia), 1 µl of DNA and 15.5 µl of DNA-free water. PCR conditions were 94 °C for 5 min, followed by 35 cycles of: 94 °C for 30 s, 56 °C for 30 s, and 72 °C for 1 min; and a final extension step of 3 min. PCR products were verified by 1% agarose gel electrophoresis stained with GelRed (Gene Target Solutions, Dural, Australia) and purified with Zymoclean™ (Zymo Research, California, USA). Sanger sequencing of products was performed by Macrogen (South Korea), with strains assigned to A. pacificum, A. catenella, A. minutum or A. australiense based on comparisons with sequences from verified isolates of each species.

To measure genome size, CNV and PSTs, Alexandrium spp. were grown to exponential phase in GSe medium [51] at 18 °C. For genome size measurement of cell cycle synchronised strains, cells were then incubated for 48 h in darkness to induce synchronisation of cell division [52, 53]. Sub-samples were fixed with Lugol’s iodine at the point of harvest, and cell concentration determined using a Sedgewick-Rafter counting chamber (ProSciTech, Australia) and an inverted light microscope (Leica Microsystems, Wetzlar, Germany). For CNV quantification with qPCR, ~60–75 × 103 cells in triplicate from each strain were harvested by centrifugation (10 min at 1000 g). For genome size quantification using flow cytometry, ~105 cells were harvested in triplicate from each strain. At least 106 cells were harvested by centrifugation for 10 min at 1000 g for PST measurement.

Cells were washed with a 1 × PBS, fixed with 1% (w/v) paraformaldehyde for 10 min, and then washed again with 1 × PBS. Cell pellets were re-suspended in 2 mL cold methanol, stored for at least 12 h at 4 °C to remove the intracellular chlorophyll, washed twice with 1x PBS, and then stained for > 3 h in 0.1 mg mL−1 propidium iodide and 2 µg mL−1 RNAse (Merck KGaA, Darmstadt, Germany).

A CytoFLEX S Flow cytometer (Beckman Coulter, California, USA) equipped with laser excitation at 488 nm was used for the flow cytometry analysis. BD™ DNA QC Particles Chicken blood cells (3 pg DNA/nuclei; BD Biosciences, San Jose, USA) were used as a standard [54]. In total, 2-µm fluorescent beads were used as stable particles to verify instrument alignment (BD Biosciences, San Jose, USA). Triplicate samples were run at 30 µL min−1, and data acquired in linear and log modes until at least 10,000 events were measured per sample. Fluorescence emission of propidium iodide stained DNA was detected at 610 ± 10 nm. The peak ratios and coefficient of variation (CV) were quantified with CytExpert software (Beckman Coulter, California, USA). FSC channel was used as a trigger with automatic setting from the manufacturer. Gating and further analysis were only performed for peaks with CV values below 20%, and any peaks above this were rerun. The gating was performed by using FSC-A vs SSC-A gate to exclude debris and use the PI gate on histogram to remove large background noise without the DNA content. Genome size in base pairs used a conversion factor of 1 pg of DNA = 978 Mbp [55].

DNA extraction was carried out using a PowerSoil DNA Extraction kit (QIAGEN, OH, USA) according to manufacturer’s instructions. DNA was extracted in triplicate, and quality and quantity determined using a Nanodrop ND-1000 (ThermoFisher Scientific, Waltham, Massachusetts) and Qubit 2.0 Fluorometer (ThermoFisher Scientific, Waltham, Massachusetts). qPCR was carried out with a BioRad CFX384 Touch™ System (BioRad, California, USA) using species-specific qPCR assays for Alexandrium rRNA genes [56] and sxtA4 [44] (Supplementary Table 5, Supplementary Fig. 8). qPCRs were run in triplicate using the following cycling parameters: 95 °C for 10 s, 35 replicates of 95 °C for 15 s and 60 °C for 30 s. Total reaction volume was 10 μl, containing 5 μl SybrSelect™ (ThermoFisher Scientific, Massachusetts, USA), 0.5 μM each primer, 1 μl template DNA, and 3 μl PCR-grade water. Samples and Master Mix were loaded to Hard-Shell 384-well PCR plates using epMotion 5075 Liquid Handling Workstations (Eppendorf AG, Hamburg, Germany). Amplification specificity was confirmed using melt-curve analysis. Quantification cycle values were generated by CFX Manager 3.1. Standard curves of sxtA4 and rRNA genes vs quantification cycle (Cq) was developed using ten-fold serial dilution of Gblocks® fragments (Integrated DNA Technologies, Coralville, Iowa, USA). Copy number per µL DNA was determined using the formula:

Standard curves, positive controls and negative controls were included in each sample plate as the sample DNA extracted from Alexandrium strains of known concentration (cells µL−1). Copies of sxtA4 and rRNA genes per µL−1 DNA were determined relative to qPCR standard curves.

The significance of relationships between genome size, rRNA gene copies cell−1, sxtA4 copies cell−1 and total PST cell−1 were assessed using Spearman’s rank correlation and linear regression after transformation, as appropriate, as implemented in GraphPad Prism 7.04. Shapiro-Wilk tests were used to examine normal or log normal distributions. Patterns of genome size and rRNA gene CNV associated with cell culture were based on isolation dates provided by culture collections and isolators. Days from isolation date to sample extraction date were calculated. To account for different laboratory growth rates, cultivation days were converted to estimated number of generations based on published growth rates for each species at culture maintenance conditions (50–80 μmoles PAR; 12:12 L:D cycle18–20 °C). Individual strain variance from respective species means were calculated for genome size (pg) and log10 rRNA gene (copies cell−1). Individual strain deviation from species means were calculated using ([(Xi - X̅)/σ)]. The effect of extended culture periods on variability in genome size and rRNA gene GCN was examined using Levene’s test for equal variances among the following three sample groups: (1) Mindarie isolates cultivated for <20 generations; (2) strains cultivated for 100–800 generations; (3) strains cultivated for >1000 generations.

Cell pellets were freeze-dried, then extracted for PSTs and measured [53]. Briefly, samples were resuspended in 2 mL of 1 mM acetic acid and vortexed for 90 s at 100 °C for 5 min, sonicated for 5 min and filtered with 0.45 µm PVDF filter (Merck Millipore, Massachusetts, USA). Chromatographic separation (modified [57]) was performed on a Thermo Scientific™ ACCELA™ UPLC system coupled to a Thermo Scientific™ Q Exactive™ (ThermoFisher Scientific, Massachusetts, USA) mass spectrometer. Analysis was performed using an Acquity UPLC BEH Amide 130 A 1.7 µm 150 × 2.1 mm column with an injection volume of 5 µL. Mobile phases were A1 (water/formic acid/NH4OH at 500:0.075:0.3 v/v/v), B1 (acetonitrile/water/formic acid at 700:300:0.1 v/v/v). Certified standard solutions of C1,C2, GTX1, GTX2, GTX3, GTX4, GTX5, dcGTX2, dcGTX3, STX, dcSTX, NEO and dcNEO were purchased from National Research Council of Canada (NRC, Halifax, Canada). The limit of detection of the PST analysis method for all of the targeted compounds was 0.01 pg cell−1.

Genome sizes of all species were large, and varied greatly between species, from 22.5 ± 0.3 pg cell−1 to 130.9 ± 1.3 pg cell−1 (Fig. 1). The genome of A. minutum was the smallest (27.0 ± 2.0 pg cell−1, n = 16) while species of the A.tamarense species complex were much larger: A. catenella (79.2 ± 5.9 pg cell−1, n = 13), A. pacificum (72.5 ± 14 pg cell−1, n = 27) and A. australiense (87.3 ± 8.2 pg cell−1, n = 5). Strains of A. pacificum showed more than threefold variation in genome size (42.4 ± 1.5 pg cell−1 to 130.9 ± 1.3 pg cell−1) (Fig. 1). Genome size among A. pacificum strains from the same population varied by 13% above and 9% below the population mean (CV = 5.1%, Fig. 1). A. pacificum genomes from different populations, different global regions, and time in laboratory culture varied over a wider range (CV = 29.5%, Fig. 1), however, there was no clear relationship between genome size and time in laboratory culture, or significant increase or decrease in genome size variability over time in culture (Fig. 2a, Supplementary Fig. 1A, F = 2.331, df=2, p = 0.110).

Images show cells using fluorescence light microscopy, stained with Calcofluor white, from [83, 84].

a Variation in Alexandrium spp genome size with generations in laboratory culture. Values are SD from the mean genome size. Darker background shading indicates range of deviation among Mindarie (WA) Alexandrium pacificum strains cultured for <20 generations prior to analysis. b Variation in Alexandrium species rRNA copy number with increasing generations in laboratory culture. Values are SD from the mean rRNA copy number of each species. Darker background shading indicates range of deviation among Mindarie (WA) Alexandrium pacificum strains cultured for <20 generations prior to analysis.

rRNA GCN varied by 6 orders of magnitude across Alexandrium species, from 267 ± 18.8 copies cell−1 in a strain of A. minutum (RCC4877) to 1.23 × 108 ± 85 × 106 copies cell−1 in a strain of A. australiense (AT-YC-H) (Fig. 3a). In general, strains of A. australiense and A. pacificum showed the largest rRNA GCN, while those of A.minutum were the smallest and most uniform (Fig. 3a). Within-species CNV of rRNA genes was surprisingly high, particularly among isolates of A. pacificum, which varied from 1.74 × 104 ± 1.26 × 103 copies cell−1 to 1.72 × 107 ± 1.40 × 107 copies cell−1, and more than two orders magnitude (8.1 × 104 ± 1.40 × 104 copies cell−1 to 1.72 × 107 ± 1.41 × 107 copies cell−1) in a single population (Fig. 4). Increased generations in laboratory culture (Fig. 2b, Supplementary Fig. 1B) had no effect on mean rRNA GCN or CNV of Alexandrium species (F = 1.819, df=2, p = 0.177). Variability was evident in sxtA4 copies across Alexandrium species, though at a smaller scale than in rRNA gene (Fig. 3c, Supplementary Fig. 2). The sxtA4 GCN of A. pacificum strains showed the greatest variability, from 7.8 ± 1.9 copies cell−1 to 609 ± 133 copies cell−1, but were more uniform among strains from the same population (43.8 ± 9 copies cell−1 - 609 ± 133 copies cell−1), while the variability of sxtA4 GCN in A. minutum and A. australiense strains were low, from 0–8.9 ± 0.61 copies cell−1 (Fig. 3c). A. catenella strains varied from 19.9 ± 10.5 copies cell−1 - 189.12 ± 58.5 copies cell−1 (Fig. 3c). A significant positive relationship between log10 rRNA copies cell−1 and genome size was evident (Fig. 3b, F = 22.99, df=49, p < 0.0001, r2 = 0.319) across the species of Alexandrium, however at the species level this was only significant within A. pacificum (F = 6.7, df=18, p = 0.0185, r2 = 0.27). A significantly non-zero slope was found in the relationship between log10 sxtA4 copies cell−1 and genome size but explaining only a very low amount of the variance (Supplementary Fig. 2A, F = 5.3, df=43, p = 0.026, r2 = 0.109).

a CNV of rRNA (±SD) in strains of Alexandrium spp. b Relationship between genome size (pg cell−1) and rRNA copies cell−1 in Alexandrium species (F = 22.99, df=49, p < 0.0001, r2 = 0.319). c CNV of sxtA4 (±SD) in Alexandrium species. d Relationship between total PSTs (ng cell−1) and sxtA4 copies cell−1 in Alexandrium species (F = 11.73, df=18, p = 0.003, r2 = 0.395).

a Sites where strains of A. pacificum were isolated from the Pacific region including sites across Australia. b Mindarie from where strains of A. pacificum were isolated. c CNV of sxtA4 (±SD) and rRNA (±SD) in Alexandrium pacificum from all sites and from Mindarie only.

All strains produced PSTs with the exception of A. australiense ATCJ33 and two strains of A. minutum, RC4874 and CCMI1002. The range of PST congeners produced varied considerably between and within species (Fig. 5). Species of A. catenella contained the highest PST content (Figs. 5, 3d) dominated by C1, 2 and GTX1, GTX2, GTX3 and GTX4. Both GTX5, GTX6, as well as decarboxylated versions of STX and NeoSTX were mostly absent. In contrast, A. minutum strains produced a more restricted range of PSTs, with C2, GTX2 and GTX3 produced (Fig. 5). Total PSTs were more uniform among A. pacificum, with no consistently dominant PST variants. A positive relationship was significant between log10 sxtA4 copies cell−1 and log10 total PSTs ng cell−1 (Fig. 3d, F = 11.73, df=18, p = 0.003, r2 = 0.395). A. minutum produced a higher amount of PST per copy of sxtA4 in comparison to A. pacificum. Strains of A. australiense with extremely low or no detectable PSTs (ATCJ33, AADVN1, AT-YC-H), averaged <1 sxtA4 copy cell−1. In cases where less than one sxtA4 copy cell−1 was found, this was due to copy numbers below the level of quantification of the qPCR assay.

The heatmap shows the inter-species and intra-species variations of each PST congeners' concentration across Alexandrium spp.

Genomic CNV in eukaryotes, bacteria and archaea has been increasingly documented, sometimes in relation to whole genome duplication or polyploidy [1,2,3,4,5]. CNV is considered a major evolutionary process, influencing the expression of key phenotypic traits [1, 2, 4,5,6,7]. The presence of CNV is also of practical significance, impacting the way in which molecular barcoding genes such as rRNA, commonly used for community structure analyses, can be interpreted. The consistency of GCN is poorly understood in microbial eukaryotes, and the causes and consequences of CNV at the genomic level [2, 4, 5] have been rarely examined.

Estimates of CNV in other marine microbial eukaryotes suggested that variation of 1–3 orders of magnitude in rRNA, as was found in foraminifera, ciliates, haptophytes, fungi and diatoms, was considered very high [10,11,12], and was not always related to cell or genome size [10, 58]. In this study, rRNA CNV in one genus of a common harmful algae (Alexandrium spp) was found to span ~6 orders of magnitude. This should be considered the minimum variation, as we have studied comparatively few species of this genus. Previous estimates of genomic rRNA in Alexandrium spp indicated 102–103 copies cell−1 in A. minutum, [16]; from 104–106 copies cell−1 in A. pacificum [9], and A. catenella [17, 44, 45], consistent with our study. Very high rRNA GCN (106) are also known from other Gonyaulacales spp [59, 60] measured using techniques including digital qPCR. Species of Alexandrium therefore span from amongst the lowest to the highest rRNA genomic copies cell−1 of dinoflagellates (Fig. 6a). In our study, most significantly, substantial intra-specific CNV was found among individuals collected from the same population (Fig. 4), of up to 2 orders of magnitude. Recently, ~1 order of magnitude CNV among 14 strains of the haptophyte Emiliania huxleyi from different global locations was reported [10]. Using single cell qPCR, two orders of magnitude difference in rRNA gene copies was found within a species of foraminifera [12]. Our results and these recent studies indicate that intraspecific rRNA CNV in marine microbial eukaryotes may be common, suggesting that rRNA gene copies cell−1 of multiple strains is required to determine reasonably representative GCN ranges. While CNV likely impacts analyses of all marine microbial eukaryotes, not addressing this will particularly impact dinoflagellate dominated assemblages, and assuming similarity in GCN in relation to the degree of phylogenetic relatedness appears to be ineffective.

a Relationship between log10 genome size (pg cell−1) and log10 rRNA copies cell−1 in dinoflagellates from the current study and previous literature (F = 44.2, df=74, p < 0.0001, r2 = 0.374). Regression line and 95% confidence interval shown. Purple circles= Suessiales spp, red circles= Gymnodiniales spp, green circles= Prorocentrales spp, orange circles= Peridiniales spp, blue circles= Gonyaulacales spp, yellow circles=Dinophysiales spp. Black triangles = data from the present study. b Relationship between log10 cell volume (µm3) and log10 rRNA copies cell−1 in dinoflagellates, from the present and previous studies (F = 0.26.59, df=106, p < 0.0001, r2 = 0.200). c Relationship between log10 cell volume (µm3) and log10 rRNA copies cell−1 in dinoflagellates, from published studies only (F = 16.4, df=58, p = 0.0002, r2 = 0.2204). [data from: 9, 10, 13, 19, 30, 38, 54, 55, 62, 63, 71, 74, 75, 76, 77, 78, 79, 81].

Differences in GCN can be the result of genome scale processes including polyploidy, as well as processes impacting part of the genome such as aneuploidy, chromosome duplications, errors during homologous recombination and retroposition [27, 31, 32]. Mutations and selection while held in laboratory culture could result in CNV, potentially with functional implications, as has been reported in bacteria and fungi, and specific processes may particularly impact rRNA loci [3, 61]. In rRNA genes, the presence of tandem repeats that are highly transcribed, and the difficulty of replicating repetitive sequences, make rDNA inherently less stable than other genes and susceptible to CNV [47]. Over cycles of cell division, rRNA GCN in yeast and bacteria in culture showed a pattern of reduction in fungi via recombination-mediated loss [62,63,64]. As well as these inherent genetic factors, environmental factors can lead to induced CNV in rRNA genes in some microbes [58, 61, 64, 65]. In the ciliates Entodinium, Epidinium and Ophryoscolex, GCN of rRNA changes in response to nutrient availability in cultures [65]. Higher rRNA GCN has ecological implications, providing protection against mutagens [62, 63] and impacting growth and competitive ability [58, 65].

We examined CNV in relation to generations in culture and geographic regions of isolation. Our strains spanned periods in culture of several months to ~30 years, yet no significant impact was found on either variability of genome size or rRNA CNV (Fig. 2, Supplementary Fig. 1), Intraspecific genetic diversity in rRNA gene copies in A. pacificum isolates from sites distant to one another was greater in natural populations than the amount induced by up to 30 years of laboratory selection (Fig. 4). The source of such CNV may be retroposition, as it appears common in dinoflagellates [21, 27, 31], and was found to be the source of multi-copies of pcna in dinoflagellates [66], as sequences had the remains of dinoflagellate specific spliced leader sequences, which are added to mRNA, evidence of reverse transcription into the genome. In Symbiodiniaceae, chromosomes may be enriched in relation to specific biological processes, suggesting that chromosomes may be duplicated or lost as required, creating CNV [28,29,30]. Whether through a process specific to rRNA, retroposition and/or chromosome duplications, CNV appears to occur over long evolutionary time scales based on our data, as dinoflagellate mutation rates on a decadal time scale were not a significant factor in the high CNV.

It has been calculated that <0.1% of species for which genome size data are available have genomes larger than 100 Gb [24], highlighting the exceptionally large genomes of dinoflagellates [21, 22, 24, 28,29,30]. In eukaryotes, there is no correlation between genome size and coding sequences in genomes larger than 0.01 Gb, known as the C-value enigma [21, 26, 67]. Alexandrium species were in the middle to upper half of reported dinoflagellate genome sizes (Fig. 1, Supplementary Fig. 2B), indicating expansion during evolution. Cell cycle synchronised genome sizes of A. pacificum varied ~ 3 fold (Fig. 1), greater than the already large range in previous reports of strains of this species (60–104) pg cell−1 [9, 22, 40, 68, 69]. Specialised ‘ribosomal chromosomes’ were found in the former Alexandrium tamarense species complex, including A. pacificum but not in A. minutum, [68], which in our study and previous studies showed much more consistent genome sizes among isolates (22–29 pg cell−1 in previous studies, as in this study [39, 40]). A significant relationship between genome size and rRNA gene copies−1 cell within A. pacificum (Fig. 6a) may indicate that rRNA chromosome duplication contributes to genome size expansion in this species [68], but appears unimportant within the other Alexandrium species examined.

Previous studies have measured dinoflagellate genomes using a variety of methods: flow cytometry, staining with propidium iodide or other DNA dyes, and using cells of chicken or a similarly sized animal genome as a standard, as well as estimates from genome sequencing (i.e. [22, 28,29,30, 39, 66, 67, 70], Supplementary Table 3). Genome size measurements based on flow cytometry using a standard of an animal genome, which is smaller and differs structurally from a dinoflagellate genome, may lead to larger genome size estimates as compared to estimates of sequenced dinoflagellate genomes (Supplementary Table 3). However, the pattern was not consistent, and some similar estimates were determined regardless of the method (Supplementary Table 3). Dinoflagellates have unusually large genomes, and therefore standards of the appropriate size are generally not available. We ensured that the standard curve used in this measurement achieved good linearity to minimise any potential error (Supplementary Materials).

Given the ubiquity of microbial eukaryotic ecological analyses routinely targeting rRNA genes for qPCR and meta-barcoding, it can appear as though such approaches are very well characterised [9, 10, 13, 17, 20, 56, 59, 71]. Considerable and justifiable attention has focused on sources of bias that confound universal rRNA gene markers, particularly PCR primer bias, sequencing bias and statistical bias [3, 72, 73]. Primers have been developed that address primer bias [72, 73], statistical approaches have addressed other biases, however less attention has been shown to GCN bias until recently [3, 74,75,76]. CNV is of particular concern for qPCR-based assays that are increasingly applied for the detection of HAB species such as Alexandrium spp [9, 16, 17, 44, 45, 56, 59, 60]. Previous estimates of cell abundance using rRNA-targeted qPCR of HAB species indicate both over and under-estimated abundances of up to 500% [45, 60, 77]. Despite the high variability in rRNA GCN within a population and species (Fig. 4), it is possible to calibrate a qPCR result against another type of environmental cell count via detailed light microscopy or fluorescently labelled flow cytometry[44, 45, 60]. A sampling design that integrated CNV of multiple populations co-occurring in a region could determine a local rRNA GCN as a calibration proxy. Because mutation rates of rRNA GCN were low over decades (Fig. 2), if a HAB species was seeded via local cyst beds, it may not be necessary to recalibrate with every sample, as doing so would render the simplicity, speed and scalability of qPCR-based quantification obsolete.

Community profiling using molecular barcoding presents a potentially more difficult challenge than qPCR in accommodating CNV, due to unknown variability between taxa and the lack of information on GCN. GCN correction factors are an option for addressing this, and can be incorporated into barcoding analyses [11, 74,75,76]. However, GCN data is very limited and patchily distributed. Current correction approaches rely on prediction using phylogenetically related taxa [65], however GCN of bacterial and fungal rRNA is only moderately phylogenetically conserved [58, 65, 74] resulting in increasingly inaccurate GCN predictions for diverging lineages [65]. The first study attempting to apply a GCN correction factor for marine microbial rRNA genes used a single mean GCN cell−1 for each class, from a database of ~60 species, of 4919 copies cell−1 for dinoflagellates, 166 copies cell−1 for diatoms and 71,710 copies cell−1 for ciliates [77]. While this is an improvement over using GCN as a direct proxy of ASV abundance, the level of variability found in our study shows that such an approach is limited. Genome sizes, potentially a proxy for GCN [Fig. 6a], are variable and unknown for most dinoflagellates, so this approach also appears to be unfeasible. [65] consider bioinformatic-based solutions to the CNV of rRNA genes is not realistic for microbial eukaryotes, and based on our current data, we agree with that position.

Meta-barcoding of microbial eukaryotes has been referred to as semi-quantitative, as it has been suggested that the relative abundance of taxa may be proportionately representative of the community [20]. However, analyses of bias in metabarcoding of dinoflagellate mock communities using multiple primers targeted to rRNA genes, or comparisons with microscopy counts, found ASV abundances were highly skewed [78, 79]. It has been argued that a lack of correlation between cell number and rRNA genes is unimportant, as rRNA gene copies cell−1 is correlated with cell volume [13, 19, 20, 69] in marine eukaryotes, providing a stronger indication of ecosystem function than cell abundance. However, we found only a weak relationship of rRNA gene copies cell−1 with cell volume in dinoflagellates, explaining only 20% of the variance (Fig. 6b, F = 0.26.59, df=106, p < 0.0001, r2 = 0.200). This was independent of our present study, as, even when we analysed published rRNA gene copy, cell volume and genome size data [9, 10, 16, 19, 22, 40, 59, 60, 67,68,69,70, 80,81,82,83,84,85] we found 22% of variance explained (Fig. 6c, F = 16.4, df=58, p = 0.0002, r2 = 0.2204). When analysing only rRNA copies cell−1 in relation to cell volume for Gonyaulacales spp, the order including many of the most important HAB taxa, we found no significant relationship (Fig. 6c, Supplementary Fig. 2, F = 3.26, df=85, p=ns, r2 = 0.037). This may be indicative of high evolutionary rates of rRNA genes in this order, noted in relation to long branch lengths in rRNA gene-based phylogenies [86]. Dinoflagellates with the smallest genomes, such as Suessiales spp, appear to show a more straightforward relationship between genome size, cell volume and GCN (Fig. 6a–c), but polyploidy and chromosome duplication are still known from these taxa [85].

Given that ecosystem function is generally of primary interest in microbial ecological research, we suggest metabarcoding be considered a diversity presence/absence measure, and functional genes be quantified using metagenomics or gene specific assays in relation to traits of interest. Our data indicate that comparably fewer copies of the gene related to PST synthesis, sxtA4, were present in PST-producing strains (2–102 copies cell−1) in Alexandrium, with a smaller range of CNV (Fig. 3c), consistent with studies of sxtA4 in A. minutum (1.5–46 copies cell−1; [39, 40]) and A. pacificum (34–200 copies cell−1; [37, 40, 44]). Both the reduced CNV, combined with the significant correlation of sxtA4 GCN with PSTs cell−1 indicate that sxtA4 is an informative target for the potential for PST production in situ. We found a variety of PST analogues produced by Alexandrium species, with some species such as A.catenella more consistent in analogues produced than others (Fig. 5). sxtA is involved in the synthesis of the parent compound, saxitoxin, [36, 37] and tailoring enzymes in the sxt cluster appear to be responsible for analogues, suggesting that in future these genes could be quantified. An advantage of the sxtA detection is that food web dynamics can be investigated, for example, quantifying sxtA uptake in invertebrates or protists [87]. Examples of similar functional genes that have been or could be detected in situ are cell cycle related genes such as pcna involved in proliferation and growth of HAB species [66], transporters, receptors, genes involved in N and P uptake and other metabolic functions [13, 21, 29, 30, 85]. Given the potential for such genes to show a dosage response in dinoflagellates, this could indicate a promising approach to community ecology in dinoflagellate dominated ecosystems.

We have shown rRNA gene CNV of up to 6 orders of magnitude in dinoflagellates at the class, genus and species levels, and its relationship to genome size, but not generations in laboratory culture, and very weakly with cell volume. Reported significant correlations between cell volume and rRNA genomic copies in marine microbial eukaryotes were likely driven by taxa with more straightforward genome organisation, rather than dinoflagellates, ciliates and foraminifera, which taken together can constitute the majority of 18 S rRNA signal. We show a significant dosage effect of a functional gene related to a common HAB toxin, and suggest that such low copy functional genes are more stable and informative targets for eukaryotic microbial ecological profiling, using function-based methods such as metagenomics and specific trait-based molecular assays.

Data will be made available for all non-commercial purposes.

Hardigan MA, Crisovan E, Hamilton JP, Kim J, Laimbeer P, Leisner CP, et al. Genome reduction uncovers a large dispensable genome and adaptive role for copy nmber vriation in aexually popagated Solanum tuberosum. Plant Cell. 2016;28:388–405.

Article CAS PubMed PubMed Central Google Scholar

Perry GH, Dominy NJ, Claw KG, Lee AS, Fiegler H, Redon R, et al. Diet and the evolution of human amylase gene copy number variation. Nat Genet. 2007;39:1256–60.

Article CAS PubMed PubMed Central Google Scholar

Soppa J. Polyploidy and community structure. Nat Microbiol. 2017;2:16261.

Article CAS PubMed Google Scholar

Conrad DF, Pinto D, Redon R, Feuk L, Gokcumen O, Zhang Y, et al. Origins and functional impact of copy number variation in the human genome. Nature. 2010;464:704–12.

Article CAS PubMed Google Scholar

Zerulla K, Soppa J. Polyploidy in haloarchaea: advantages for growth and survival. Front Microbiol. 2014;5:274.

Article PubMed PubMed Central Google Scholar

Iantorno SA, Durrant C, Khan A, Sanders MJ, Beverley SM, Warren WC, et al. Gene expression in leishmania is regulated predominantly by gene dosage. mBio. 2017;8:e01393-17.

Article Google Scholar

Gillard GB, Gronvold L, Rosaeg LL, Holen MM, Monsen O, Koop BF, et al. Comparative regulomics supports pervasive selection on gene dosage following whole genome duplication. Genome Biol. 2021;22:103.

Article CAS PubMed PubMed Central Google Scholar

Keeling PJ, Campo JD. Marine protists are not just big bacteria. Curr Biol. 2017;27:R541–R549.

Article CAS PubMed Google Scholar

Galluzzi L, Bertozzini E, Penna A, Perini F, Garcés E, Magnani M. Analysis of rRNA gene content in the Mediterranean dinoflagellate Alexandrium catenella and Alexandrium taylori: implications for the quantitative real-time PCR-based monitoring methods. J Appl Phycol. 2010;22:1–9.

Article CAS Google Scholar

Gong W, Marchetti A. Estimation of 18S gene copy number in marine eukaryotic plankton using a next-generation sequencing approach. Front Marine Sci. 2019;6:219.

Article Google Scholar

Wang C, Zhang T, Wang Y, Katz LA, Gao F, Song W. Disentangling sources of variation in SSU rDNA sequences from single cell analyses of ciliates: impact of copy number variation and experimental error. Proc Biol Sci. 2017;284:20170425.

PubMed PubMed Central Google Scholar

Milivojevic T, Rahman SN, Raposo D, Siccha M, Kucera M, Morard R. High variability in SSU rDNA gene copy number among planktonic foraminifera revealed by single-cell qPCR. ISME Commun. 2021;1:63.

Article PubMed PubMed Central Google Scholar

de Vargas C, Audic S, Henry N, Decelle J, Mahé F, Logares R, et al. Ocean plankton. Eukaryotic plankton diversity in the sunlit ocean. Science. 2015;348:1261605.

Article PubMed Google Scholar

Thornhill DJ, Lajeunesse TC, Santos SR. Measuring rDNA diversity in eukaryotic microbial systems: how intragenomic variation, pseudogenes, and PCR artifacts confound biodiversity estimates. Mol Ecol. 2007;16:5326–40.

Article CAS PubMed Google Scholar

Egge E, Bittner L, Andersen T, Audic S, de Vargas C, Edvardsen B. 454 pyrosequencing to describe microbial eukaryotic community composition, diversity and relative abundance: a test for marine haptophytes. PLoS One. 2013;8:e74371.

Article CAS PubMed PubMed Central Google Scholar

Galluzzi L, Penna A, Bertozzini E, Vila M, Garces E, Magnani M. Development of a real-time PCR assay for rapid detection and quantification of Alexandrium minutum (a Dinoflagellate). Appl Environ Microbiol. 2004;70:1199–206.

Article CAS PubMed PubMed Central Google Scholar

Erdner DL, Percy L, Keafer B, Lewis J, Anderson DM. A quantitative real-time PCR assay for the identification and enumeration of Alexandrium cysts in marine sediments. Deep Sea Res Part 2 Top Stud Oceanogr. 2010;57:279–87.

Article CAS PubMed Central Google Scholar

Prokopowich CD, Gregory TR, Crease TJ. The correlation between rDNA copy number and genome size in eukaryotes. Genome. 2003;46:48–50.

Article CAS PubMed Google Scholar

Godhe A, Asplund ME, Harnstrom K, Saravanan V, Tyagi A, Karunasagar I. Quantification of diatom and dinoflagellate biomasses in coastal marine seawater samples by real-time PCR. Appl Environ Microbiol. 2008;74:7174–82.

Article CAS PubMed PubMed Central Google Scholar

Le Bescot N, Mahé F, Audic S, Dimier C, Garet MJ, Poulain J, et al. Global patterns of pelagic dinoflagellate diversity across protist size classes unveiled by metabarcoding. Environ Microbiol. 2016;18:609–26.

Article PubMed Google Scholar

Lin S. Genomic understanding of dinoflagellates. Res Microbiol. 2011;162:551–69.

Article CAS PubMed Google Scholar

LaJeunesse TC, Lambert G, Andersen RA, Coffroth MA, Galbraith DW. Symbiodinium (Pyrrhophyta) genome sizes (DNA Content) are smallest among dinoflagellates. J Phycol. 2005;41:880–6.

Article CAS Google Scholar

Hackett JD, Scheetz TE, Yoon HS, Soares MB, Bonaldo MF, Casavant TL, et al. Insights into a dinoflagellate genome through expressed sequence tag analysis. BMC Genomics. 2005;6:80.

Article PubMed PubMed Central Google Scholar

Hidalgo O, Pellicer J, Christenhusz M, Schneider H, Leitch AR, Leitch IJ. Is There an Upper Limit to Genome Size? Trends Plant Sci. 2017;22:567–73.

Article CAS PubMed Google Scholar

Pellicer J, Hidalgo O, Dodsworth S, Leitch IJ. Genome size diversity and its impact on the evolution of land plants. Genes (Basel). 2018;9:88.

Article PubMed Google Scholar

Murray SA, Suggett DJ, Doblin MA, Kohli GS, Seymour JR, Fabris M, et al. Unravelling the functional genetics of dinoflagellates: a review of approaches and opportunities. Perspect Phycol. 2016;3:37–52.

Google Scholar

Bachvaroff TR, Place AR. From stop to start: tandem gene arrangement, copy number and trans-splicing sites in the dinoflagellate Amphidinium carterae. PLoS One. 2008;3:e2929.

Article PubMed PubMed Central Google Scholar

Liu H, Stephens TG, Gonzalez-Pech RA, Beltran VH, Lapeyre B, Bongaerts P, et al. Symbiodinium genomes reveal adaptive evolution of functions related to coral-dinoflagellate symbiosis. Commun Biol. 2018;1:95.

Article PubMed PubMed Central Google Scholar

Lin S, Cheng S, Song B, Zhong X, Lin X, Li W, et al. The Symbiodinium kawagutii genome illuminates dinoflagellate gene expression and coral symbiosis. Science. 2015;350:691–4.

Article CAS PubMed Google Scholar

Stephens TG, Gonzalez-Pech RA, Cheng Y, Mohamed AR, Burt DW, Bhattacharya D, et al. Genomes of the dinoflagellate Polarella glacialis encode tandemly repeated single-exon genes with adaptive functions. BMC Biol. 2020;18:56.

Article CAS PubMed PubMed Central Google Scholar

Slamovits CH, Keeling PJ. Widespread recycling of processed cDNAs in dinoflagellates. Curr Biol. 2008;18:R550–2.

Article CAS PubMed Google Scholar

Song B, Chen S, Chen W. Dinoflagellates, a unique lineage for retrogene research. Front Microbiol. 2018;9:1556.

Article PubMed PubMed Central Google Scholar

Goetz EJ, Greco M, Rappaport HB, Weiner AKM, Walker LM, Bowser S, et al. Foraminifera as a model of the extensive variability in genome dynamics among eukaryotes. Bioessays. 2022;44:e2100267.

Article PubMed Google Scholar

Kouakou CRC, Poder TG. Economic impact of harmful algal blooms on human health: a systematic review. J Water Health. 2019;17:499–516.

Article PubMed Google Scholar

Ryderheim F, Selander E, Kiorboe T. Predator-induced defence in a dinoflagellate generates benefits without direct costs. ISME J. 2021;15:2107–16.

Article CAS PubMed PubMed Central Google Scholar

Hackett JD, Wisecaver JH, Brosnahan ML, Kulis DM, Anderson DM, Bhattacharya D, et al. Evolution of saxitoxin synthesis in cyanobacteria and dinoflagellates. Mol Biol Evol. 2013;30:70–8.

Article CAS PubMed Google Scholar

Stuken A, Orr RJ, Kellmann R, Murray SA, Neilan BA, Jakobsen KS. Discovery of nuclear-encoded genes for the neurotoxin saxitoxin in dinoflagellates. PLoS One. 2011;6:e20096.

Article CAS PubMed PubMed Central Google Scholar

Orr RJ, Stuken A, Murray SA, Jakobsen KS. Evolution and distribution of saxitoxin biosynthesis in dinoflagellates. Mar Drugs. 2013;11:2814–28.

Article PubMed PubMed Central Google Scholar

Stuken A, Riobo P, Franco J, Jakobsen KS, Guillou L, Figueroa RI. Paralytic shellfish toxin content is related to genomic sxtA4 copy number in Alexandrium minutum strains. Front Microbiol. 2015;6:404.

Article PubMed PubMed Central Google Scholar

Geffroy S, Lechat MM, Le Gac M, Rovillon GA, Marie D, Bigeard E, et al. From the sxtA4 gene to saxitoxin production: what controls the variability among alexandrium minutum and alexandrium pacificum strains? Front Microbiol. 2021;12:613199.

Article PubMed PubMed Central Google Scholar

Savela H, Harju K, Spoof L, Lindehoff E, Meriluoto J, Vehniainen M, et al. Quantity of the dinoflagellate sxtA4 gene and cell density correlates with paralytic shellfish toxin production in Alexandrium ostenfeldii blooms. Harmful Algae. 2016;52:1–10.

Article CAS PubMed Google Scholar

Perini F, Galluzzi L, Dell’Aversano C, Iacovo ED, Tartaglione L, Ricci F, et al. SxtA and sxtG gene expression and toxin production in the Mediterranean Alexandrium minutum (Dinophyceae). Mar Drugs. 2014;12:5258–76.

Article CAS PubMed PubMed Central Google Scholar

Wiese M, Murray SA, Alvin A, Neilan BA. Gene expression and molecular evolution of sxtA4 in a saxitoxin producing dinoflagellate Alexandrium catenella. Toxicon. 2014;92:102–12.

Article CAS PubMed Google Scholar

Murray SA, Wiese M, Stuken A, Brett S, Kellmann R, Hallegraeff G, et al. sxtA-based quantitative molecular assay to identify saxitoxin-producing harmful algal blooms in marine waters. Appl Environ Microbiol. 2011;77:7050–7.

Article CAS PubMed PubMed Central Google Scholar

Murray SA, Ruvindy R, Kohli GS, Anderson DM, Brosnahan ML. Evaluation of sxtA and rDNA qPCR assays through monitoring of an inshore bloom of Alexandrium catenella Group 1. Sci Rep. 2019;9:14532.

Article PubMed PubMed Central Google Scholar

Murray SA, Diwan R, Orr RJ, Kohli GS, John U. Gene duplication, loss and selection in the evolution of saxitoxin biosynthesis in alveolates. Mol Phylogenet Evol. 2015;92:165–80.

Article CAS PubMed Google Scholar

Salim D, Gerton JL. Ribosomal DNA instability and genome adaptability. Chromosome Res. 2019;27:73–87.

Article CAS PubMed Google Scholar

Keller MD, Selvin RC, Claus W, Guillard RRL. Media for the culture of oceanic ultraphytoplankton. J Phycol. 1987;23:633–8.

Article Google Scholar

Scholin CA, Herzog M, Sogin M, Anderson DM. Identification of group- and strain-specific genetic markers for globally distributed Alexandrium (Dinophyceae).II. Sequence analysis of a fragment of the LSU rRNA gene. J Phycol. 1994;30:999–1011.

Article CAS Google Scholar

Nunn GB, Theisen BF, Christensen B, Arctander P. Simplicity-correlated size growth of the nuclear 28S ribosomal RNA D3 expansion segment in the crustacean order Isopoda. J Mol Evol. 1996;42:211–23.

Article CAS PubMed Google Scholar

Doblin MA, Blackburn SI, Hallegraeff GM. Growth and biomass stimulation of the toxic dinoflagellate Gymnodinium catenatum (Graham) by dissolved organic substances. J Exp Mar Bio Ecol. 1999;236:33–47.

Article CAS Google Scholar

Taroncher-Oldenburg G, Kulis DM, Anderson DM. Toxin variability during the cell cycle of the dinoflagellate Alexandrium fundyense. Limnol Oceanogr. 1997;42:1178–88.

Article CAS Google Scholar

Figueroa RI, Garcés E, Bravo I. The use of flow cytometry for species identification and life-cycle studies in dinoflagellates. Deep Sea Res Part 2 Top Stud Oceanogr. 2010;57:301–7.

Article CAS Google Scholar

Vindelov LL, Christensen IJ, Nissen NI. Standardization of high-resolution flow cytometric DNA analysis by the simultaneous use of chicken and trout red blood cells as internal reference standards. Cytometry. 1983;3:328–31.

Article CAS PubMed Google Scholar

Doležel J, Bartoš J, Voglmayr H, Greilhuber J. Nuclear DNA content and genome size of trout and human. Cytometry Part A. 2003;51A:127–8.

Article Google Scholar

Ruvindy R, Bolch CJ, MacKenzie L, Smith KF, Murray SA. qPCR assays for the detection and quantification of multiple paralytic shellfish toxin-producing species of Alexandrium. Front Microbiol. 2018;9:3153.

Article PubMed PubMed Central Google Scholar

Harwood DT, Boundy M, Selwood AI, van Ginkel R. Refinement and implementation of the Lawrence method (AOAC 2005.06) in a commercial laboratory: assay performance during an Alexandrium catenella bloom event. Harmful Algae. 2013;24:20–31.

Article CAS Google Scholar

Lofgren LA, Uehling JK, Branco S, Bruns TD, Martin F, Kennedy PG. Genome-based estimates of fungal rDNA copy number variation across phylogenetic scales and ecological lifestyles. Mol Ecol. 2019;28:721–30.

Article PubMed Google Scholar

Yarimizu K, Sildever S, Hamamoto Y, Tazawa S, Oikawa H, Yamaguchi H, et al. Development of an absolute quantification method for ribosomal RNA gene copy numbers per eukaryotic single cell by digital PCR. Harmful Algae. 2021;103:102008.

Article CAS PubMed Google Scholar

Nishimura T, Hariganeya N, Tawong W, Sakanari H, Yamaguchi H, Adachi M. Quantitative PCR assay for detection and enumeration of ciguatera-causing dinoflagellate Gambierdiscus spp. (Gonyaulacales) in coastal areas of Japan. Harmful Algae. 2016;52:11–22.

Article CAS PubMed Google Scholar

Nelson JO, Watase GJ, Warsinger-Pepe N, Yamashita YM. Mechanisms of rDNA copy number maintenance. Trends Genet. 2019;35:734–42.

Article CAS PubMed PubMed Central Google Scholar

Ide S, Miyazaki T, Maki H, Kobayashi T. Abundance of ribosomal RNA gene copies maintains genome integrity. Science. 2010;327:693–6.

Article CAS PubMed Google Scholar

Kobayashi T. Regulation of ribosomal RNA gene copy number and its role in modulating genome integrity and evolutionary adaptability in yeast. Cell Mol Life Sci. 2011;68:1395–403.

Article CAS PubMed PubMed Central Google Scholar

Nemergut DR, Knelman JE, Ferrenberg S, Bilinski T, Melbourne B, Jiang L, et al. Decreases in average bacterial community rRNA operon copy number during succession. ISME J. 2016;10:1147–56.

Article CAS PubMed Google Scholar

Lavrinienko A, Jernfors T, Koskimäki JJ, Pirttilä AM, Watts PC. Does intraspecific variation in rDNA copy number affect analysis of microbial communities? Trends in Microbiology. 2021;29:19–27.

Article CAS PubMed Google Scholar

Hou Y, Ji N, Zhang H, Shi X, Han H, Lin S. Genome size-dependent pcna gene copy number in dinoflagellates and molecular evidence of retroposition as a major evolutionary mechanism. J Phycol. 2019;55:37–46.

Article CAS PubMed Google Scholar

Hou Y, Lin S. Distinct gene number-genome size relationships for eukaryotes and non-eukaryotes: gene content estimation for dinoflagellate genomes. PLoS One. 2009;4:e6978.

Article PubMed PubMed Central Google Scholar

Figueroa RI, Cuadrado A, Stüken A, Rodríguez F, Fraga S. Ribosomal DNA organization patterns within the dinoflagellate genus Alexandrium as revealed by FISH: life cycle and evolutionary implications. Protist. 2014;165:343–63.

Article CAS PubMed Google Scholar

Liu Y, Hu Z, Deng Y, Shang L, Gobler CJ, Tang YZ. Dependence of genome size and copy number of rRNA gene on cell volume in dinoflagellates. Harmful Algae. 2021;109:102108.

Article PubMed Google Scholar

Kohli GS, John U, Figueroa RI, Rhodes LL, Harwood DT, Groth M, et al. Polyketide synthesis genes associated with toxin production in two species of Gambierdiscus (Dinophyceae). BMC Genomics. 2015;16:410.

Article PubMed PubMed Central Google Scholar

Ott BM, Litaker RW, Holland WC, Delwiche CF. Using RDNA sequences to define dinoflagellate species. PLOS ONE. 2022;17:e0264143.

Article CAS PubMed PubMed Central Google Scholar

McNichol J, Berube PM, Biller SJ, Fuhrman J A Evaluating and Improving Small Subunit rRNA PCR Primer Coverage for Bacteria, Archaea, and Eukaryotes Using Metagenomes from Global Ocean Surveys. mSystems 2021; 6: https://doi.org/10.1128/msystems.00565-21.

Parada AE, Needham DM, Fuhrman JA. Every base matters: assessing small subunit rRNA primers for marine microbiomes with mock communities, time series and global field samples. Environ Microbiol. 2016;18:1403–14.

Article CAS PubMed Google Scholar

Gonzalez-de-Salceda L, Garcia-Pichel F. The allometry of cellular DNA and ribosomal gene content among microbes and its use for the assessment of microbiome community structure. Microbiome. 2021;9:173.

Article CAS PubMed PubMed Central Google Scholar

Louca S, Doebeli M, Parfrey LW. Correcting for 16S rRNA gene copy numbers in microbiome surveys remains an unsolved problem. Microbiome. 2018;6:41.

Article PubMed PubMed Central Google Scholar

Stoddard SF, Smith BJ, Hein R, Roller BRK, Schmidt TM. rrnDB: improved tools for interpreting rRNA gene abundance in bacteria and archaea and a new foundation for future development. Nucleic Acids Res. 2015;43:D593–598.

Article CAS PubMed Google Scholar

Godhe A, Cusack C, Pedersen J, Andersen P, Anderson DM, Bresnan E, et al. Intercalibration of classical and molecular techniques for identification of Alexandrium fundyense (Dinophyceae) and estimation of cell densities. Harmful Algae. 2007;6:56–72.

Article CAS Google Scholar

Smith KF, Kohli GS, Murray SA, Rhodes LL. Assessment of the metabarcoding approach for community analysis of benthic-epiphytic dinoflagellates using mock communities. N Z J Marine Freshwater Res. 2017;51:555–76.

Article CAS Google Scholar

Santi I, Kasapidis P, Karakassis I, Pitta P. A comparison of DNA metabarcoding and microscopy methodologies for the study of aquatic microbial eukaryotes. Diversity. 2021;13:180.

Article CAS Google Scholar

Martin JL, Santi I, Pitta P, John U, Gypens N. Towards quantitative metabarcoding of eukaryotic plankton: an approach to improve 18S rRNA gene copy number bias. MBMG. 2022;6:e85794.

Article Google Scholar

Kretzschmar AL, Verma A, Kohli G, Murray S. Development of a quantitative PCR assay for the detection and enumeration of a potentially ciguatoxin-producing dinoflagellate, Gambierdiscus lapillus (Gonyaulacales, Dinophyceae). PLOS ONE. 2019;14:e0224664.

Article CAS PubMed PubMed Central Google Scholar

Menden-Deuer S, Lessard EJ. Carbon to volume relationships for dinoflagellates, diatoms, and other protist plankton. Limnol Oceanogr. 2000;45:569–79.

Article CAS Google Scholar

Zhu F, Massana R, Not F, Marie D, Vaulot D. Mapping of picoeucaryotes in marine ecosystems with quantitative PCR of the 18S rRNA gene. FEMS Microbiol Ecol. 2005;52:79–92.

Article CAS PubMed Google Scholar

Vandersea MW, Kibler SR, Holland WC, Tester PA, Schultz TF, Faust MA, et al. Development of semi-quantitative PCR assays for the detection and enumeration of Gambierdiscus species (Gonyaulacales, Dinophyceae). J Phycol. 2012;48:902–15.

Article CAS PubMed Google Scholar

Nand A, Zhan Y, Salazar OR, Aranda M, Voolstra CR, Dekker J. Genetic and spatial organization of the unusual chromosomes of the dinoflagellate Symbiodinium microadriaticum. Nature Genetics. 2021;53:618–29.

Article CAS PubMed PubMed Central Google Scholar

Kretzschmar AL, Verma A, Murray S, Kahlke T, Fourment M, Darling AE. Trial by phylogenetics—Evaluating the Multi-Species Coalescent for phylogenetic inference on taxa with high levels of paralogy (Gonyaulacales, Dinophyceae). bioRxiv. https://doi.org/10.1101/683383.

Farrell H, O’Connor WA, Seebacher F, Harwood TD, Murray S. Molecular detection of the SxtA gene from saxitoxin-producing Alexandrium minutum in commercial oysters. J Shellfish Res. 2016;35:169–77.

Article Google Scholar

Download references

We thank Roscoff Culture Collection, Helene Hegaret, Laure Guillou, Marc Long, Malwenn Lassudrie, the Cawthron Collection and CSIRO for the cultures used in this study. We thank Kun Xiao for the assistance in flow cytometry. We thank Jacqui Stuart for help with the figures. This study was partially supported by a UTS HDR Scholarship to RR and an Australian Research Council FT to SM.

University of Technology Sydney, School of Life Sciences, Sydney, PO Box 123, Broadway, NSW, 2007, Australia

Rendy Ruvindy, Abanti Barua, Henna Savela & Shauna A. Murray

Institute for Marine & Antarctic Studies, University of Tasmania, Launceston, 7248, TAS, Australia

Christopher J. S. Bolch

Sydney Institute of Marine Science, Chowder Bay Rd, Mosman, NSW, Australia

Chowdhury Sarowar

Finnish Environment Institute, Marine Research Centre, Helsinki, Finland

Henna Savela

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

You can also search for this author in PubMed Google Scholar

RR and SM conceptualized and designed the study. RR, CB, SM, AB, CS generated and analyzed data, HS contributed to experimental design and sample collection and analysis. RR, SM and CB wrote the manuscript with input from all authors.

Correspondence to Shauna A. Murray.

The authors declare no competing interests.

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and Permissions

Ruvindy, R., Barua, A., Bolch, C.J.S. et al. Genomic copy number variability at the genus, species and population levels impacts in situ ecological analyses of dinoflagellates and harmful algal blooms. ISME COMMUN. 3, 70 (2023). https://doi.org/10.1038/s43705-023-00274-0

Download citation

Received: 23 December 2022

Revised: 18 June 2023

Accepted: 20 June 2023

Published: 08 July 2023

DOI: https://doi.org/10.1038/s43705-023-00274-0

Anyone you share the following link with will be able to read this content:

Sorry, a shareable link is not currently available for this article.

Provided by the Springer Nature SharedIt content-sharing initiative

SHARE