I've added bigbird's attention to my model, but not seeing a decrease in memory

I've replaced the [attention layers](https://github.com/deepmind/deepmind-research/blob/master/enformer/enformer.py#L131-L132) in Enformer with those in bigbird, but the memory usage calculated by [tf.get_memory_info](https://www.tensorflow.org/api_docs/python/tf/config/experimental/get_memory_info) shows the usage is still basically the same (within 1%). I'm wondering if I need to include code from the encoder or decoder to see a decrease in memory usage?

Thanks!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

I've added bigbird's attention to my model, but not seeing a decrease in memory #33

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

I've added bigbird's attention to my model, but not seeing a decrease in memory #33

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions