-
Notifications
You must be signed in to change notification settings - Fork 4
Add resources: Open Tree of Life, GBIF, OBIS #512
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Merged
Merged
Conversation
This file contains hidden or bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Adds two comprehensive biodiversity resource entries to the kg-registry: 1. **EOL TraitBank** - A searchable open digital repository for organism traits and attributes - 11 million+ trait records from 50+ data sources - 1.7 million taxa across the entire tree of life - Multiple access methods: web portal, Neo4j Cypher, REST APIs, bulk downloads - Darwin Core semantic web standards 2. **Open Tree of Life** - A comprehensive phylogenetic tree synthesizing evolutionary estimates with taxonomic data - 2.4 million tips representing species and infraspecific taxa - 1,216 published phylogenetic papers with 87,000 tip taxa - Interactive web browser and RESTful APIs - Community-curated Phylesystem repository with 4,500+ studies - CC0 public domain license Both resources are validated against the kg-registry schema. 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Adds a comprehensive resource entry for GBIF, the world's largest biodiversity data portal and international data infrastructure. **GBIF Details:** - 3.1 billion+ species occurrence records from 81,000+ datasets - 2,500+ publishing institutions contributing data - All domains of life: plants, animals, fungi, protists, bacteria, archaea - 50% of records from citizen science (primarily iNaturalist) - Darwin Core standardized data format **Products included:** - Interactive GBIF.org portal for discovery and downloads - RESTful APIs (Registry, Species, Occurrence, Maps) - Darwin Core Archive and Simple CSV download formats - Integrated Publishing Toolkit (IPT) for data publication - Comprehensive technical documentation **Standards & Quality:** - Darwin Core standard for biodiversity data - 82%+ of records under CC0 or CC-BY open licenses - 30+ data quality flags and validation checks - Support for occurrence, checklist, and sampling-event datasets **Impact:** - Used by 10,000+ peer-reviewed scientific papers - Supports IUCN Red List threat assessments - Integral to IPCC and IPBES international reports - ~34 research papers per week published using GBIF data 🤖 Generated with [Claude Code](https://claude.com/claude-code) Co-Authored-By: Claude Sonnet 4.5 <[email protected]>
Adds a comprehensive resource entry for OBIS, the global marine biodiversity data system and largest repository of ocean biodiversity information. **OBIS Details:** - 161 million+ marine biodiversity records - 160,000+ marine species covered globally - 27 million DNA sequences (eDNA metabarcoding data) - 600+ institutions contributing data from 56+ countries - Depth range: surface waters to 10,900 meters (hadal zone) - 27 regional, national, and thematic OBIS nodes **Data Standards & Formats:** - Darwin Core Archive (DwC-A) standardized format - GeoParquet cloud-native format on AWS S3 - CSV exports for spreadsheet analysis - Ecological Metadata Language (EML) for metadata - World Register of Marine Species (WoRMS) as authoritative taxonomy **Products included:** - Interactive OBIS web mapper for searching and visualization - RESTful API for programmatic data access - robis R package for R-based analysis - Darwin Core Archive bulk downloads - GeoParquet cloud format on AWS - OBIS-SEAMAP specialized marine megavertebrate database (1,526 datasets, 700+ species) - Comprehensive OBIS manual with documentation **Quality Control:** - Three-step validation: initial, automated, and manual review - Taxonomic validation against WoRMS (514,088 marine species names tracked) - Data quality flags for accuracy and reliability - Community feedback integration for continuous improvement **Governance & Support:** - UNESCO Intergovernmental Oceanographic Commission (IOC) governance - Hosted by Flanders Marine Institute (VLIZ), Belgium - International IODE programme participation - CC0, CC BY, CC BY-NC license options **Use Cases:** - Marine conservation planning and species protection - Climate change impact assessment on marine species - Fisheries management and sustainable harvest - Ecosystem monitoring and coastal zone management - Biogeographic research and species distribution analysis
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Add this suggestion to a batch that can be applied as a single commit.
This suggestion is invalid because no changes were made to the code.
Suggestions cannot be applied while the pull request is closed.
Suggestions cannot be applied while viewing a subset of changes.
Only one suggestion per line can be applied in a batch.
Add this suggestion to a batch that can be applied as a single commit.
Applying suggestions on deleted lines is not supported.
You must change the existing code in this line in order to create a valid suggestion.
Outdated suggestions cannot be applied.
This suggestion has been applied or marked resolved.
Suggestions cannot be applied from pending reviews.
Suggestions cannot be applied on multi-line comments.
Suggestions cannot be applied while the pull request is queued to merge.
Suggestion cannot be applied right now. Please check back later.
Close #483, #484, #485