Skip to content

Conversation

@caufieldjh
Copy link
Collaborator

@caufieldjh caufieldjh commented Dec 18, 2025

Close #483, #484, #485

caufieldjh and others added 4 commits December 18, 2025 11:13
Adds two comprehensive biodiversity resource entries to the kg-registry:

1. **EOL TraitBank** - A searchable open digital repository for organism traits and attributes
   - 11 million+ trait records from 50+ data sources
   - 1.7 million taxa across the entire tree of life
   - Multiple access methods: web portal, Neo4j Cypher, REST APIs, bulk downloads
   - Darwin Core semantic web standards

2. **Open Tree of Life** - A comprehensive phylogenetic tree synthesizing evolutionary estimates with taxonomic data
   - 2.4 million tips representing species and infraspecific taxa
   - 1,216 published phylogenetic papers with 87,000 tip taxa
   - Interactive web browser and RESTful APIs
   - Community-curated Phylesystem repository with 4,500+ studies
   - CC0 public domain license

Both resources are validated against the kg-registry schema.

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Adds a comprehensive resource entry for GBIF, the world's largest biodiversity data portal and international data infrastructure.

**GBIF Details:**
- 3.1 billion+ species occurrence records from 81,000+ datasets
- 2,500+ publishing institutions contributing data
- All domains of life: plants, animals, fungi, protists, bacteria, archaea
- 50% of records from citizen science (primarily iNaturalist)
- Darwin Core standardized data format

**Products included:**
- Interactive GBIF.org portal for discovery and downloads
- RESTful APIs (Registry, Species, Occurrence, Maps)
- Darwin Core Archive and Simple CSV download formats
- Integrated Publishing Toolkit (IPT) for data publication
- Comprehensive technical documentation

**Standards & Quality:**
- Darwin Core standard for biodiversity data
- 82%+ of records under CC0 or CC-BY open licenses
- 30+ data quality flags and validation checks
- Support for occurrence, checklist, and sampling-event datasets

**Impact:**
- Used by 10,000+ peer-reviewed scientific papers
- Supports IUCN Red List threat assessments
- Integral to IPCC and IPBES international reports
- ~34 research papers per week published using GBIF data

🤖 Generated with [Claude Code](https://claude.com/claude-code)

Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
@caufieldjh caufieldjh changed the title Add resources: Open Tree of Life, others Add resources: Open Tree of Life, GBIF, others Dec 18, 2025
@caufieldjh caufieldjh linked an issue Dec 18, 2025 that may be closed by this pull request
Adds a comprehensive resource entry for OBIS, the global marine biodiversity data system and largest repository of ocean biodiversity information.

**OBIS Details:**
- 161 million+ marine biodiversity records
- 160,000+ marine species covered globally
- 27 million DNA sequences (eDNA metabarcoding data)
- 600+ institutions contributing data from 56+ countries
- Depth range: surface waters to 10,900 meters (hadal zone)
- 27 regional, national, and thematic OBIS nodes

**Data Standards & Formats:**
- Darwin Core Archive (DwC-A) standardized format
- GeoParquet cloud-native format on AWS S3
- CSV exports for spreadsheet analysis
- Ecological Metadata Language (EML) for metadata
- World Register of Marine Species (WoRMS) as authoritative taxonomy

**Products included:**
- Interactive OBIS web mapper for searching and visualization
- RESTful API for programmatic data access
- robis R package for R-based analysis
- Darwin Core Archive bulk downloads
- GeoParquet cloud format on AWS
- OBIS-SEAMAP specialized marine megavertebrate database (1,526 datasets, 700+ species)
- Comprehensive OBIS manual with documentation

**Quality Control:**
- Three-step validation: initial, automated, and manual review
- Taxonomic validation against WoRMS (514,088 marine species names tracked)
- Data quality flags for accuracy and reliability
- Community feedback integration for continuous improvement

**Governance & Support:**
- UNESCO Intergovernmental Oceanographic Commission (IOC) governance
- Hosted by Flanders Marine Institute (VLIZ), Belgium
- International IODE programme participation
- CC0, CC BY, CC BY-NC license options

**Use Cases:**
- Marine conservation planning and species protection
- Climate change impact assessment on marine species
- Fisheries management and sustainable harvest
- Ecosystem monitoring and coastal zone management
- Biogeographic research and species distribution analysis
@caufieldjh caufieldjh changed the title Add resources: Open Tree of Life, GBIF, others Add resources: Open Tree of Life, GBIF, OBIS Dec 18, 2025
@caufieldjh caufieldjh linked an issue Dec 18, 2025 that may be closed by this pull request
@caufieldjh caufieldjh marked this pull request as ready for review December 18, 2025 17:01
@caufieldjh caufieldjh merged commit b95b767 into main Dec 18, 2025
2 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

2 participants