Skip to content

Conversation

@amyfromandi
Copy link
Collaborator

Need to verify if there is code to extract the points or geomaterialdict in another file. Reference #171 for details

@amyfromandi
Copy link
Collaborator Author

Update
geomaterialdict is explicitly ignored in gems_etl implementation. It seems like it's an irrelevant file for the partner to ingest. Also, in this code, points are not being extracted from the .gdb; looks like the code has yet to be written.

However, our get_dataframes() is already using the right G.read_file() to get the layers (both spatial and non-spatial) from the .gdb for us to ingest. The only difference is that we need to include a read_geometry parameter to tell geopandas to read spatial vs non-spatial files. This depicts whether we insert the data into the database or apply a metadata merge on it.

Lots of refactoring with this change. Will push my updates soon.

@amyfromandi
Copy link
Collaborator Author

amyfromandi commented Jul 18, 2025

  • b_interval and t_interval accurately populated based on geochronological order and valid interval_names that exist in macrostrat.intervals
  • need a lookup function to infer formal strat_name based on the name and descrip columns from DMU.
  • verify .gdb scraping tool to download all valid arizona .gdb's
  • test bulk .gdb ingestion with Arizona maps!

@amyfromandi amyfromandi merged commit c14c311 into main Jul 24, 2025
@amyfromandi amyfromandi deleted the gems_ingestion branch July 24, 2025 19:18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants