Skip to content

Commit 8f476dc

Browse files
authored
New design functions, checkpoints, and minor updates (#8)
* Add all gRNAde checkpoints * Add partial re-design utilities and update checkpoints, version * Minor updates to tutorial * Update eval vizualisation notebooks * Add design notebook * Add logit bias utility to AR model for partial sequence design * Add RibonanzaNet self-consistency score * Add new utility functions * Update README
1 parent 8c50526 commit 8f476dc

24 files changed

Lines changed: 2442 additions & 809 deletions

README.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -1,22 +1,22 @@
1-
# 💣 gRNAde: Geometric RNA Design
1+
# 💣 gRNAde: Geometric Deep Learning for 3D RNA Inverse Design
22

3-
**gRNAde** is a geometric deep learning pipeline for 3D RNA inverse design, analogous to [ProteinMPNN](https://github.com/dauparas/ProteinMPNN) for protein design.
3+
**gRNAde** is a **g**eometric deep learning pipeline for 3D **RNA** inverse **de**sign, analogous to [ProteinMPNN](https://github.com/dauparas/ProteinMPNN) for protein design.
44

5-
gRNAde generates an RNA sequence conditioned on one or more 3D RNA backbone conformations, i.e. both single- and multi-state **fixed-backbone sequence design**.
6-
RNA backbones are featurized as geometric graphs and processed via a multi-state GNN encoder which is equivariant to 3D roto-translation of coordinates as well as conformer order, followed by conformer order-invariant pooling and sequence design.
7-
8-
![](/tutorial/fig/grnade_pipeline.png)
5+
🧬 Tutorial notebook to get started: [gRNAde 101](/tutorial/tutorial.ipynb)
96

10-
⚙️ Want to use gRNAde for your own RNA designs? Check out the tutorial notebook: [gRNAde 101](/tutorial/tutorial.ipynb)
7+
⚙️ Using gRNAde for custom RNA design scenarios: [Design notebook](/notebooks/design.ipynb)
118

12-
️ New to 3D RNA modelling? Here's a currated reading + watch list for beginners: [Resources](/tutorial/README.md)
9+
️ New to 3D RNA modelling? Here's a currated reading + watch list for beginners: [Resources](/tutorial/README.md)
1310

1411
📄 For more details on the methodology, see the accompanying paper: ['Multi-State RNA Design with Geometric Multi-Graph Neural Networks'](https://arxiv.org/abs/2305.14749)
1512
> Chaitanya K. Joshi, Arian R. Jamasb, Ramon Viñas, Charles Harris, Simon Mathis, and Pietro Liò. Multi-State RNA Design with Geometric Multi-Graph Neural Networks. *ICML Computational Biology Workshop, 2023.*
1613
>
17-
>[PDF](https://arxiv.org/pdf/2305.14749.pdf) | [Tweet](https://twitter.com/chaitjo/status/1662118334412800001) | [Slides](https://www.chaitjo.com/publication/joshi-2023-multi/gRNAde_slides_CASP_RNA_SIG.pdf) | [Tutorial](/tutorial/tutorial.ipynb)
14+
>[PDF](https://arxiv.org/abs/2305.14749.abs) | [Tweet](https://twitter.com/chaitjo/status/1662118334412800001) | [Slides](https://www.chaitjo.com/publication/joshi-2023-grnade/gRNAde_slides_CASP_RNA_SIG.pdf)
1815
16+
![](/tutorial/fig/grnade_pipeline.png)
1917

18+
gRNAde generates an RNA sequence conditioned on one or more 3D RNA backbone conformations, i.e. both single- and multi-state **fixed-backbone sequence design**.
19+
RNA backbones are featurized as geometric graphs and processed via a multi-state GNN encoder which is equivariant to 3D roto-translation of coordinates as well as conformer order, followed by conformer order-invariant pooling and sequence design.
2020

2121
## Installation
2222

@@ -173,7 +173,7 @@ Each RNA will be processed into the following format (most of the metadata is op
173173
## Citation
174174

175175
```
176-
@inproceedings{joshi2023multi,
176+
@inproceedings{joshi2023grnade,
177177
title={Multi-State RNA Design with Geometric Multi-Graph Neural Networks},
178178
author={Joshi, Chaitanya K. and Jamasb, Arian R. and Viñas, Ramon and Harris, Charles and Mathis, Simon and Liò, Pietro},
179179
booktitle={ICML 2023 Workshop on Computation Biology},

checkpoints/README.md

Lines changed: 36 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,36 @@
1+
# gRNAde checkpoints
2+
3+
Checkpoints are available for various training data splits.
4+
For general usage and best performance, we recommend using the `all` split.
5+
6+
## All RNAsolo samples (as of October 2023): `all`
7+
8+
(Sequence recovery and Self-consistency MCC are computed on Single-state benchmark from Das et al.)
9+
10+
| Model | Max. no. conformers | Max. training RNA length | Link | Sequence recovery | Self-consistency MCC |
11+
| --- | --- | --- | --- | --- | --- |
12+
| Autoregressive | 1 | 5000 | [gRNAde_ARv1_1state_all.h5](gRNAde_ARv1_1state_all.h5) | 0.7387 | 0.6296 |
13+
| Autoregressive | 2 | 5000 | [gRNAde_ARv1_2state_all.h5](gRNAde_ARv1_2state_all.h5) | 0.7907 | 0.6192 |
14+
| Autoregressive | 3 | 5000 | [gRNAde_ARv1_3state_all.h5](gRNAde_ARv1_3state_all.h5) | 0.7987 | 0.5911 |
15+
| Autoregressive | 5 | 5000 | [gRNAde_ARv1_5state_all.h5](gRNAde_ARv1_5state_all.h5) | 0.8197 | 0.6344 |
16+
| | | | |
17+
18+
## Das et al. (2011) single-state design benchmark split: `das`
19+
20+
| Model | Max. no. conformers | Max. training RNA length | Link | Sequence recovery | Self-consistency MCC |
21+
| --- | --- | --- | --- | --- | --- |
22+
| Autoregressive | 1 | 5000 | [gRNAde_ARv1_1state_das.h5](gRNAde_ARv1_1state_das.h5) | 0.5278 | 0.6304 |
23+
| Autoregressive | 2 | 5000 | [gRNAde_ARv1_2state_das.h5](gRNAde_ARv1_2state_das.h5) | 0.5140 | 0.6163 |
24+
| Autoregressive | 3 | 5000 | [gRNAde_ARv1_3state_das.h5](gRNAde_ARv1_3state_das.h5) | 0.5424 | 0.6204 |
25+
| Autoregressive | 5 | 5000 | [gRNAde_ARv1_5state_das.h5](gRNAde_ARv1_5state_das.h5) | 0.5669 | 0.6296 |
26+
| | | | |
27+
28+
## Multi-state design benchmark split: `multi`
29+
30+
| Model | Max. no. conformers | Max. training RNA length | Link | Sequence recovery | Self-consistency MCC |
31+
| --- | --- | --- | --- | --- | --- |
32+
| Autoregressive | 1 | 5000 | [gRNAde_ARv1_1state_multi.h5](gRNAde_ARv1_1state_multi.h5) | 0.4468 | 0.5809 |
33+
| Autoregressive | 3 | 5000 | [gRNAde_ARv1_2state_multi.h5](gRNAde_ARv1_2state_multi.h5) | 0.4471 | 0.5321 |
34+
| Autoregressive | 3 | 5000 | [gRNAde_ARv1_3state_multi.h5](gRNAde_ARv1_3state_multi.h5) | 0.4692 | 0.5957 |
35+
| Autoregressive | 5 | 5000 | [gRNAde_ARv1_5state_multi.h5](gRNAde_ARv1_5state_multi.h5) | 0.4844 | 0.5841 |
36+
| | | | |
8.32 MB
Binary file not shown.
8.32 MB
Binary file not shown.
8.32 MB
Binary file not shown.
8.32 MB
Binary file not shown.
8.32 MB
Binary file not shown.

0 commit comments

Comments
 (0)