Error gap between train and valid sets #116
-
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 2 replies
-
|
Hi @evenfarther, Regarding the inference speeds, I noticed you have 4 GNN layers, which is quite large, while setting |
Beta Was this translation helpful? Give feedback.
-
|
Thanks @evenfarther for the question and @cesaremalosso for answering! I just wanted to add that it seems to me that the PET hyperparameters were heavily engineered to match the parameter count of some other model. This is, in my experience, not a good way of putting two models "on the same footing". As you might have noticed, different ways of distributing the same number of parameters within the architecture can result in massively different accuracy and/or speed numbers. In my opinion, a more robust way is to e.g. roughly match the inference speed of the two models and then measure the accuracy/performance in some test/exercise |
Beta Was this translation helpful? Give feedback.

Hi @evenfarther,
PET models tend to be very expressive, and the fact that the error on your training set is lower than on your validation set means that your model's accuracy is currently limited by data rather than architecture. This suggests it could be significantly improved by adding more data to your dataset. That said, the discrepancy is not dramatic, but the absolute error magnitudes indicate there's still room for improvement, have you considered fine-tuning one of our foundation models? You should be able to achieve much better results that way. Keep in mind that PET requires quite a few epochs (at least a couple of hundreds in my experience, but it depends on the dataset) to lea…