Skip to content

Commit e2551f2

Browse files
committed
Added new largemix baselines
1 parent 01c40fc commit e2551f2

File tree

1 file changed

+34
-0
lines changed

1 file changed

+34
-0
lines changed

docs/baseline.md

Lines changed: 34 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -96,6 +96,40 @@ This is not surprising as they contain two orders of magnitude more datapoints a
9696
| | GIN | 0.1873 ± 0.0033 | **0.1701 ± 0.0142** |
9797
| | GINE | 0.1883 ± 0.0039 | **0.1771 ± 0.0010** |
9898

99+
## NEW: Largemix improved sweep - 2023/08-18
100+
101+
Unsatisfied with the prior results, we ran a bayesian search over a broader set of parameters, and including only more expressive models, namely GINE, GatedGCN and MPNN++. We further increase the number of parameters to 10M due to evidence of underfitting. We evaluate only the multitask setting.
102+
103+
We observe a significant improvement over all tasks, with a very notable r2-score increase of +0.53 (0.27 -> 0.80) compared to the best node-level property prediction on PCQM4M_N4.
104+
105+
The results are reported below over 1 seed. We are currently running more seeds of the same models.
106+
107+
| Dataset | Model | MAE ↓ | Pearson ↑ | R² ↑ |
108+
|---------------|----------------|--------|---------|--------|
109+
| **PCQM4M_G25** | GINE | 0.2250 | 0.8840 | 0.7911 |
110+
| | GatedGCN | 0.2457 | 0.8698 | 0.7688 |
111+
| | MPNN++ (sum) | 0.2269 | 0.8802 | 0.7855 |
112+
|
113+
| **PCQM4M_N4** | GINE | 0.2699 | 0.8475 | 0.7182 |
114+
| | GatedGCN | 0.3337 | 0.8102 | 0.6566 |
115+
| | MPNN++ (sum) | 0.2114 | 0.8942 | 0.8000 |
116+
117+
| Dataset | Model | BCE ↓ | AUROC ↑ | AP ↑ |
118+
|---------------|----------------|--------|---------|--------|
119+
| **PCBA_1328** | GINE | 0.0334 | 0.7879 | 0.2808 |
120+
| | GatedGCN | 0.0351 | 0.7788 | 0.2611 |
121+
| | MPNN++ (sum) | 0.0344 | 0.7815 | 0.2666 |
122+
|
123+
| **L1000_VCAP** | GINE | 0.1907 | 0.6416 | 0.4042 |
124+
| | GatedGCN | 0.1866 | 0.6395 | 0.4092 |
125+
| | MPNN++ (sum) | 0.1867 | 0.6478 | 0.4131 |
126+
|
127+
| **L1000_MCF7** | GINE | 0.1931 | 0.6352 | 0.4235 |
128+
| | GatedGCN | 0.1859 | 0.6547 | 0.4224 |
129+
| | MPNN++ (sum) | 0.1870 | 0.6593 | 0.4254 |
130+
131+
132+
99133
# UltraLarge Baseline
100134

101135
## UltraLarge test set metrics

0 commit comments

Comments
 (0)