You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+16-12Lines changed: 16 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -119,16 +119,18 @@ Deepmind has released the weights for their tensorflow sonnet Enformer model! I
119
119
120
120
Update: <ahref="https://github.com/jstjohn">John St. John</a> did some work and found that the `enformer-official-rough` model hits the reported marks in the paper - human pearson R of `0.625` for validation, and `0.65` for test.
121
121
122
+
Update: As of version 0.8.0, if one were to use the `from_pretrained` function to load the pretrained model, it should automatically use precomputed gamma positions to address a difference between tensorflow and pytorch `xlogy`. This should resolve the numerical discrepancy above. If you were to further finetune and not be using the `from_pretrained` function, please make sure to set `use_tf_gamma = True` when using `.from_hparams` to instantiate the `Enformer`
Special thanks goes out to <a href="https://www.eleuther.ai/">EleutherAI</a>for providing the resources to retrain the model, during a time when the official model from Deepmind had not been released yet.
317
319
320
+
Thanks also goes out to <a href="johahi">@johahi</a>forfinding out that there are numerical differences between the torch and tensorflow implementations of `xlogy`. He provided a fix for this difference, which is adoptedin this repository in`v0.8.0`
321
+
318
322
## Todo
319
323
320
324
- [x] script to load weights from trained tensorflow enformer model to pytorch model
0 commit comments