Can we characterize the microbiomes of marine ecosystems? Understanding seasonal changes and spatial variability of microbial communities is a key part of gaining insights into today’s oceans and the biogeochemical processes that microbes mediate. This knowledge also provides the baseline information against which future change can be assessed. The Baselines Initiative brings together 16S rRNA datasets that will be useful for assessing future change. This means that they must either come from multi-year sampling efforts at the same location, such as the Bermuda Atlantic Time-series Study in the North Atlantic (Western Sargasso Sea), or repeat sampling at a high spatial resolution (such as data sets from the Bay of Bengal). Three types of data are currently available, the Baselines Foundational Set, which holds near full-length Sanger 16S rRNA gene sequences, Baselines 16S V1-V2 Illumina-amplicons and Baselines 16S V1-V2 454-amplicons. If you would like us to add pointers to a dataset that meets these goals please contact We can also host pointers to 18S rRNA datasets. Baselines Release 2 is scheduled for June 2019 and will provide repeat sampling of deep ocean and deep sea sediment communities.

Baselines Foundational Set: Download the FASTA file and corresponding accession table for the Baselines Foundational Set. This contains 6,177 near-full length 16S rRNA gene sequences that have also been deposited at GenBank under the accessions KX932494 to KX938325.

Baselines Release 1 (R1) 16S Illumina Amplicons: Download the zipped quality controlled amplicon files for Baselines Release 1 16S amplicon data (paired-end Illumina V1-V2). These data are also available at BioProject PRJNA345162, PRJNA345164, and PRJNA345177.

Baselines Release 1 (R1) 16S 454 Amplicons: Eastern North Pacific 454 data is available at BioProject PRJNA258669 and Sargasso Sea BATS 454 data is available at

Metadata for all Baselines R1 samples including Baselines Foundational Set: Download the excel sheet containing available metadata for these samples and numbers of plastid and cyanobacterial sequences based on phyloassigner analysis.

Bacterial-focused V1-V2 16S analysis tool: Download the phyloassigner package and associated 16S rRNA gene alignment plus tree file for the pipeline used in Vergin et al. ISME J 2013 Vol 7:1322–1332 also used in preliminary steps of Sudek et al. Environmental Microbiology 2015 Vol 17:3692-3707 and Choi, Bachy et al. Current Biology 2017.

Plastid-focused V1-V2 16S analysis tool: Download the plastid-focused phyloassigner package containing the 16S rRNA gene alignment and treefile used to analyze Baselines Release 1 amplicon data for Choi, Bachy et al. Current Biology 2017.

Micromonas Genome Browser

What are picophytoplankton?

This is a diverse group of photosynthetic microbes which is composed of multiple different taxa within the eukaryotic Tree of Life (this online project went offline in 2021). They are grouped purely on the basis of cell size – pico being used to refer to cells less than 2 microns in diameter. Evolutionarily taxa composing this size fraction can be very different. Amongst the more commonly found marine taxa are genera within the Prasinophytes (in the eukaryotic Supergroup Archaeplastida) and the Prymnesiophytes (in the eukaryotic Supergroup Chromalveolata) as well as organisms such as Pelagomonas, a Stramenopile (also Chromalveolata). There are several excellent reviews on these organisms – for a comprehensive review from our lab please see e.g. Worden & Not in Microbial Ecology of the Oceans (ed. Kirchman, Wiley 2008).

Cultures are easily available for several lineages and genomes are available for several prasinophytes that can be purchased from a number of different culture collections.

Genomic resources for picophytoplankton

Micromonas RCC299 genome browser at JGI

Micromonas CCMP1545 genome browse at JGI

For a more detailed overview of these and other picophytoplankton genome projects go here

Updated chloroplast genome assembly and gene models for Micromonas commoda associated with Cuvelier et al. in review

Micromonas commoda CP genome assembly
Micromonas commoda CP gene model coordinates
Micromonas commoda CP protein sequences
Micromonas commoda CP gene sequences

Proteomic resources for picophytoplankton

Micromonas pusilla CCMP1545 Genomic Database for Proteomics LC-MS/MS Analysis (data files associated with Waltman et al. 2016)

Micromonas commoda RCC299 Genomic Database for Proteomics LC-MS/MS Analysis (data files associated with van Baren et al. 2016)

Getting started with picophytoplankton
(especially prasinophytes in the order Mamiellales)

Order strains (in the U.S.) from the Provasoli-Guillard National Center for Marine Algae and Microbiota

  • Micromonas RCC299, the strain sequenced for the genome project as rendered axenic and deposited by Worden lab (CCMP2709)
  • Micromonas pusilla CCMP1545, the strain sequenced for the genome project (CCMP1545)
  • Many other Mamiellales (Micromonas, Ostreococcous, Bathycoccus) as well as Pelagophytes and other taxa are available at the CCMP
  • The Roscoff Culture Collection also has a good selection of picophytoplankton cultures

Prepare culture media in artificial seawater (appropriate for Micromonas RCC299 and some other strains) using this protocol
View general culturing information and growth curves here. View fluorescent and LED light spectra here.

Extracting DNA and Other Molecular/Biochemical Protocols

Harvest marine algae and quantify thiamin cellular quotas using this protocol

Extract from cultures – with minimized shearing (i.e. appropriate for large insert libraries, or controlled shearing) using this protocol
Extract from large scale marine metagenomic samples – with minimized shearing using this protocol
Extract for environmental qPCR/single gene studies using a modification of the Qiagen DNAeasy kit protocol

Internal Lab BLAST Server and Genome Browser

To blast internal datasets (password needed) go here. For descriptions of the sequences, download this spreadsheet.

To blast publicly available relevant predicted proteomes go here. For descriptions of the sequences, download this spreadsheet.


Upper-ocean systems
Acoustical ocean ecology
Acoustic instruments
Acoustic fingerprinting
Acoustic community ecology
Acoustics in the news
Marine biogeochemistry
Ocean carbon export
Technology development
Coastal carbon cycling and ocean acidification
Carbon cycle feedbacks
Western boundary current carbon cycling
Lab news
Biological oceanography
Global modes of sea surface temperature
Krill hotspots in the California Current
Nitrate supply estimates in upwelling systems
Chemical sensors
Chemical data
Land/Ocean Biogeochemical Observatory in Elkhorn Slough
Listing of floats
SOCCOM float visualization
Periodic table of elements in the ocean
Biogeochemical-Argo Report
Profiling float
Marine microbes
Population dynamics of phytoplankton
Microbial predators
Microbe-algae interactions
Targeted metagenomics
In the news
Upcoming events and lab news
Past talks and presentations
Join the lab
Interdisciplinary field experiments
Ecogenomic Sensing
Genomic sensors
Field experiments
Harmful algal blooms (HABs)
Water quality
Environmental Sample Processor (ESP)
ESP Web Portal
In the news
Ocean observing system
Midwater research
Midwater ecology
Deep-sea squids and octopuses
Food web dynamics
Midwater time series
Respiration studies
Zooplankton biodiversity
Seafloor processes
Revealing the secrets of Sur Ridge
Exploring Sur Ridge’s coral gardens
Life at Sur Ridge
Mapping Sur Ridge
Biology and ecology
Effects of humans
Ocean acidification, warming, deoxygenation
Lost shipping container study
Effects of upwelling
Faunal patterns
Previous research
Technology development
High-CO2 / low-pH ocean
Benthic respirometer system
Climate change in extreme environments
Station M: A long-term observatory on the abyssal seafloor
Station M long-term time series
Monitoring instrumentation suite
Sargasso Sea research
Antarctic research
Geological changes
Arctic Shelf Edge
Continental Margins and Canyon Dynamics
Coordinated Canyon Experiment
CCE instruments
CCE repeat mapping data
Monterey Canyon: A Grand Canyon beneath the waves
Submarine volcanoes
Mid-ocean ridges
Magmatic processes
Volcanic processes
Explosive eruptions
Hydrothermal systems
Back arc spreading ridges
Near-ridge seamounts
Continental margin seamounts
Non-hot-spot linear chains
Eclectic seamounts topics
Margin processes
Hydrates and seeps
California borderland
Hot spot research
Hot-spot plumes
Magmatic processes
Volcanic processes
Explosive eruptions
Volcanic hazards
Hydrothermal systems
Flexural arch
Coral reefs
ReefGrow software
Eclectic topics
Submarine volcanism cruises
Volcanoes resources
Areas of study
Automated chemical sensors
Methane in the seafloor
Microscopic biology research
Open ocean biology research
Seafloor biology research
Volcanoes and seamounts
Hydrothermal vents
Methane in the seafloor
Submarine canyons
Earthquakes and landslides
Ocean acidification
Physical oceanography and climate change
Ocean circulation and algal blooms
Ocean cycles and climate change
Past research
Molecular ecology
Molecular systematics
SIMZ Project
Bone-eating worms
Gene flow and dispersal
Molecular-ecology expeditions
Ocean chemistry of greenhouse gases
Emerging science of a high CO2/low pH ocean
Research publications