Darwin Core, GBIF, and OBIS Reference Guide
Reference Guide: Darwin Core, the Unified GBIF Data Model, and OBIS
This guide introduces the Darwin Core (DwC) standard and its role in supporting data interoperability and publication for Fisheries and Oceans Canada (DFO). It also provides an overview of the Unified GBIF Data Model and the Ocean Biodiversity Information System (OBIS). The focus is on how commonly used fields in DFO fisheries datasets map to Darwin Core terms, enabling integration across internal and external systems.
What is Darwin Core?
Darwin Core is a biodiversity data standard maintained by Biodiversity Information Standards (TDWG). It provides a stable, referenceable set of terms designed to facilitate sharing information about biological diversity — particularly the occurrence of taxa in nature, along with associated metadata like location, time, method, and taxonomy.
Darwin Core is intentionally broad, supporting data from natural history collections, ecological surveys, tagging programs, and more.
Darwin Core as a Translation Layer
In the DFO Pacific Region, diverse data formats and schemas are used across programs, divisions, and systems. Darwin Core provides a common data standard that enables these internal formats to be mapped to:
- Other DFO systems
- National open data platforms
- International biodiversity repositories (e.g., GBIF, OBIS)
By treating Darwin Core as a semantic translation layer, we can harmonize our internal datasets for interoperability, discoverability, and reuse, without requiring all groups to adopt the same database design.
The Unified GBIF Data Model
The GBIF Unified Data Model extends Darwin Core to support complex biodiversity datasets, including:
- Events (e.g., sampling campaigns, cruises, transects)
- Occurrences (e.g., individual specimens, observations)
- Measurements or Facts (e.g., biological measurements, environmental conditions)
This model aligns with DFO data practices that involve nested relationships between surveys, gear deployments, catches, and individual biological samples.
OBIS: Ocean Biodiversity Information System
OBIS builds on Darwin Core and the GBIF model to support marine biodiversity data. It uses a Darwin Core Archive format with specific extensions for:
- Sampling Effort
- Environmental Parameters
- Taxonomic Verification
OBIS is the global node for marine biodiversity under the UN Ocean Decade and is highly relevant for DFO’s coastal and offshore research.
Mapping DFO Fisheries Data to Darwin Core
Below are examples of how commonly used columns in DFO data systems align with Darwin Core terms.
DFO Field / Concept | Darwin Core Term | Notes |
---|---|---|
Species_Code or Species_Name |
scientificName |
Can be resolved via WoRMS or NCBI |
Sample_ID |
occurrenceID |
A globally unique identifier for the specimen or observation |
Catch_Date |
eventDate |
Ideally formatted as ISO 8601 |
Survey_Name |
eventID |
Use hierarchical eventID s for nested events (e.g., cruise → tow → sample) |
Latitude , Longitude |
decimalLatitude , decimalLongitude |
Use WGS84 datum |
Depth_m |
minimumDepthInMeters , maximumDepthInMeters |
Use both where applicable |
Sex , Maturity , Length , Weight |
measurementType , measurementValue , measurementUnit |
Recorded under the MeasurementOrFact extension |
Gear_Type |
samplingProtocol |
Use consistent terminology for methods |
Area_Code , Subarea |
locationID , locality |
Align to spatial standards like NAFO/DFO zones |
Data_Source , Project_Name |
datasetName , recordedBy |
Identify project or responsible team |
Toward Controlled Vocabularies and Ontologies
DFO’s long-term goal is to develop controlled vocabularies and ontology modules that align with:
- Darwin Core term values
- The NERC Vocabulary Server (NVS) for environmental parameters
- World Register of Marine Species (WoRMS) for taxonomic resolution
- CF Conventions for oceanographic and climate data
- The Salmon Ontology developed by NCEAS
These efforts will allow for semantic alignment, improving both machine-readability and clarity for analysts and stakeholders.
Next Steps
- We are actively building a DFO metadata crosswalk using Darwin Core as the central translation layer
- Look for updates on term definitions, controlled value lists, and validation tools
- Join our discussions on standardizing data fields and contributing to the controlled vocabulary development
For questions, feedback, or help mapping your dataset, contact the Data Stewardship Unit.
Darwin Core is not a database format. It’s a lingua franca for biodiversity data. Use it to describe, translate, and publish your data — not to force everything into the same structure.