How are spatial coordinates used in pretraining and downstream tasks ?

Hi Nicheformer team,


I wanted to ask for clarification regarding how spatial coordinate data is utilized throughout the Nicheformer workflow.

## Specifically:
### During pretraining:

- Is spatial coordinate information explicitly used as part of the model input?

- Or is pretraining based only on gene expression tokens, regardless of cell positions?

### During downstream tasks:

- Are spatial coordinates ever used directly in tasks like niche classification, density regression, etc.?

- Or are the coordinates only used indirectly — for computing ground truth labels like X_niche_n (niche composition) or local density using tools like Squidpy?

## From what I understand:

- The coordinate data is used for generating ground-truth labels (like niche composition/density) via spatial graphs.

- But the model input itself (in both pretraining and downstream tasks) consists only of ranked gene token sequences, not explicit spatial information.

**Could you confirm if this interpretation is correct? And if not, could you clarify how spatial context is encoded (if at all) during model input?**
Thanks in advance ..



Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

How are spatial coordinates used in pretraining and downstream tasks ? #25

Specifically:

During pretraining:

During downstream tasks:

From what I understand:

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

How are spatial coordinates used in pretraining and downstream tasks ? #25

Description

Specifically:

During pretraining:

During downstream tasks:

From what I understand:

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions