Add neighbor list detection #1036

cbkerr · 2025-08-06T17:53:15Z

Detect the neighbor list of jobs in a project or neighbors of an individual job. This is useful for navigating in high dimensional signac projects, especially for enabling more intuitive navigation in signac-dashboard.

The idea for abstracting this code developed out of the signac dashboard Navigator module and the need to ignore certain state point parameters when building the neighbor list.

It allows new kinds of analysis based on nearby jobs.

Description

Neighbor detection is based on cache lookups by calculating job ids of possible neighbor jobs. The possible neighbors are found by changing the state point parameters to values listed in the schema.

Ignoring keys changes job ids, so I create a mapping to the related "shadow" project that has these new job ids. This is explained in detail in the docstring for prepare_shadow_project.

It was challenging to work around the conversion between "nested keys" to "dotted keys". The convention is only documented under the query syntax. Schema returns values associated with state point parameters in dotted key format, so I convert state points in the state point caches to dotted keys. However, we have to convert back to nested keys to compute the job id.

The ability to get neighbors of a single job is intended mainly for command line usage.

Motivation and Context

The Navigator module of dashboard has been very helpful, and the general concept of neighbors of jobs could help with analysis like identifying regions of parameter space.

Basic example:

neighbor_list = project.get_neighbors()
for job in project:
    neighbors = neighbor_list[job.id]
    print(f"Job {job.id}")
    for key,v in job.sp.items():
        print(f"has {key}={v} with neighbor jobs {key}-->{f" and {key}-->".join(
        f"{new_val} at job id {jid}" for new_val,jid in neighbors[key].items())}")

Complex example: show a graph of jobs (above)

import itertools
import signac
import networkx as nx
from pyvis.network import Network

project = signac.init_project("graph")
for a,b in itertools.product([1,2,3], [5,6,7]):
    project.open_job({"a": a, "b": b, "2b": 2*b}).init()

for b in [8,9,10]:
    project.open_job({"a": 1, "b": b}).init()

# works across data types
for b in ["eight", "nine", None]:
    project.open_job({"a": 1, "b": b}).init()

# see isolated jobs
project.open_job({"a": 1, "c": True, "b": "eight"}).init()

for c,b in itertools.product([True,False], ["eight", "nine"]):
    project.open_job({"c": c, "b": b}).init()

# works on lists
for c,b in itertools.product([True,False], [[1,2], [1,5]]):
    project.open_job({"c": c, "b": b}).init()

# works on nested values
# although they appear in the neighbor list like nl[id]["x.n"] because of internal limitations with schema
for x in [{"n": "nested"}, {"n": "values"}]:
    project.open_job({"x": x}).init()

neighbor_list = project.get_neighbors(ignore = ["2b"])

DG = nx.DiGraph()
for job in project:
    DG.add_node(job.id, label = f"{job.sp}")
for job in project:
    neighbors = neighbor_list[job.id]
    for key, neighbor_vals in neighbors.items():
        for neighbor_value, neighbor_job_id in neighbor_vals.items():
            DG.add_edge(job.id, neighbor_job_id, title = f"{key}-->{neighbor_value}")

nt = Network('800px', '1000px', directed = True)
nt.from_nx(DG)
nt.show('signac_graph.html', notebook=False)

Checklist:

I am familiar with the Contributing Guidelines.
I agree with the terms of the Contributor Agreement.
My name is on the list of contributors.
The changes introduced by this pull request are covered by existing or newly introduced tests.
The package documentation and framework documentation in signac-docs are up to date with these changes.
I have updated the changelog and added any related issue and pull request numbers for future reference.

Internal functions now have to take dotted keys to work with the output of detect_schema. Allow moving between neighboring jobs of different types by sorting values within each type, then joining these in order of alphabetized type name.

The shadow map will be applied outside this function

joaander

Thanks! The code is clean and the intent is clear. I think the documentation should be more explicit. Other than that, it looks great.

for more information, see https://pre-commit.ci

bdice

Thanks for the continued contributions! This is an interesting feature. I haven't evaluated anything for performance but I did a quick read-through for my own curiosity. I left a few small comments but I don't plan to make a second pass of review, so I leave my approval. Feel free to accept or reject my suggestions however you see fit.

signac/__main__.py

signac/_neighbor.py

signac/job.py

signac/project.py

By definition, these keys will not distinguish any jobs so will not show up in the neighborlist.

cbkerr added 30 commits April 29, 2025 16:28

Working prototype of neighbor list implemented within Project class

539cedc

Update docstring

b40db7b

Add idea for type of return value

290c904

Pass statepoint, not job id and dotted_sp_cache to neighbors_of_job

b91eb34

The shadow map will be applied outside this function

Add API entry points

371ddea

Prototype code

a891c8d

Merge branch 'main' into feat/neighbor-list

e542221

Add prepare_shadow_project from dashboard navigator and fix bug

a3e7d5b

Make ignore an empty list by default

500934d

Update comments

ad8395b

Import Counter

9275818

Add tests

16520d2

Avoid 0 and 1 in neighborlist test because conflated with bools

2715d46

Clean up code that gives duplicates error message

6b2a66e

Code cleanup

53c4fab

Only convert from shadow job ids if needed (if ignoring keys)

43a7872

Streamline output from search function

9bb4087

Improve function names

df9a0bb

Add old neighbor code

ae89ad8

Move neighbor code to separate module

30a0478

Remove internal neighbor code from Project class

2e04fe4

Prototype API for accessing 1 job's neighbors

bc371dd

Add shell command to print neighbors of job by id

2786117

Update docstrings

6176290

Move code paths handling ignore to neighbor module

53990bf

Improve code clarity

5b5cc84

Add test for job neighbors

778cb51

Move neighborlist tests to separate file

5394f47

Move flat_schema to internal method

4a2fab0

AlainKadar self-assigned this Aug 27, 2025

Merge branch 'main' into feat/neighbor-list

e4ecc59

joaander reviewed Sep 3, 2025

View reviewed changes

cbkerr and others added 8 commits September 3, 2025 21:02

Allow signac neighbors --ignore parsing

527a899

Don't print keys with no neighbors

e6607f9

Warn which ignored keys are not found, and only remove those

4f60c23

Ensure bad key is listed in error message

18e226e

[pre-commit.ci] auto fixes from pre-commit.com hooks

9b12687

for more information, see https://pre-commit.ci

Support older f-string syntax

5f02aab

Add neighbors in sort order

6479228

[pre-commit.ci] auto fixes from pre-commit.com hooks

521e3d6

for more information, see https://pre-commit.ci

bdice approved these changes Sep 12, 2025

View reviewed changes

cbkerr added 18 commits September 18, 2025 08:49

Optimization: Use detect_schema with exclude_const

2068eb6

By definition, these keys will not distinguish any jobs so will not show up in the neighborlist.

Add failing test for specifying ignored keys as dotted keys

bdff3d1

Support ignoring nested keys, specified in dotted key format

a08e281

Run ruff and attempt to address long lines

59da1ca

Apply suggestions from review

d0b5831

Update docstring for get_neighbors

a717722

Add note to consider not exposing job.get_neighbors()

c9d7fba

Format test file

b3bf746

Wrap lines

7f61328

Wrap a different way for older python versions

d61227c

Wrap without backslash

2516859

Clean up whitespace

565839f

Default to empty list for ignore argument on command line

fc4f90b

Make job.get_neighbors private to ensure users use the efficient one

dc8be8f

Add shell tests for neighbor

048fccd

Add copyright header

a5cdd23

Fix typo

d5ca36b

Run prek

12a608e

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add neighbor list detection #1036

Add neighbor list detection #1036

Uh oh!

cbkerr commented Aug 6, 2025 •

edited

Loading

Uh oh!

joaander left a comment

Uh oh!

bdice left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Add neighbor list detection #1036

Are you sure you want to change the base?

Add neighbor list detection #1036

Uh oh!

Conversation

cbkerr commented Aug 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Motivation and Context

Checklist:

Uh oh!

joaander left a comment

Choose a reason for hiding this comment

Uh oh!

bdice left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

cbkerr commented Aug 6, 2025 •

edited

Loading