Skip to content

Neo4j squashed#10690

Draft
victorneo4j wants to merge 3 commits into
pyg-team:masterfrom
victorneo4j:neo4j-squashed
Draft

Neo4j squashed#10690
victorneo4j wants to merge 3 commits into
pyg-team:masterfrom
victorneo4j:neo4j-squashed

Conversation

@victorneo4j
Copy link
Copy Markdown

This pull request introduces new database-backed abstractions for large-graph training in PyG, along with a reference Neo4j implementation and documentation. These additions allow PyG to interact with graph topology and features stored in external databases, enabling training on graphs that do not fit in memory and supporting pluggable feature caching. The most important changes are:

New database-backed abstractions and Neo4j implementation:

  • Added DatabaseGraphStore, DatabaseFeatureStore, and DatabaseSampler abstractions for database-backed graph training, with a reference implementation for Neo4j and a runnable Cora notebook under examples/neo4j.
  • Added Neo4jFeatureStore in examples/neo4j/data/neo4j_feature_store.py, a general implementation of DatabaseFeatureStore for Neo4j, supporting configurable attribute mapping, efficient batched queries, and pluggable caching.

Documentation and tutorials:

  • Added a comprehensive tutorial (docs/source/tutorial/database_backend.rst) explaining the new database-backed stack, its components, and usage, including how to plug in different databases and cache backends.
  • Updated the main documentation index to include the new database backend tutorial.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant