ToyMix Baseline¶
From the paper to be released soon. Below, you can see the baselines for the ToyMix dataset, a multitasking dataset comprising of QM9, Zinc12k and Tox21. The datasets and their splits are available on this link.
One can observe that the smaller datasets (Zinc12k and Tox21) beneficiate from adding another unrelated task (QM9), where the labels are computed from DFT simulations.
| Dataset | Model | MAE ↓ | Pearson ↑ | R² ↑ | MAE ↓ | Pearson ↑ | R² ↑ | ||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Single-Task Model | Multi-Task Model | ||||||||||||
| QM9 | GCN | 0.102 ± 0.0003 | 0.958 ± 0.0007 | 0.920 ± 0.002 | 0.119 ± 0.01 | 0.955 ± 0.001 | 0.915 ± 0.001 | ||||||
| GIN | 0.0976 ± 0.0006 | 0.959 ± 0.0002 | 0.922 ± 0.0004 | 0.117 ± 0.01 | 0.950 ± 0.002 | 0.908 ± 0.003 | |||||||
| GINE | 0.0959 ± 0.0002 | 0.955 ± 0.002 | 0.918 ± 0.004 | 0.102 ± 0.01 | 0.956 ± 0.0009 | 0.918 ± 0.002 | |||||||
| Zinc12k | GCN | 0.348 ± 0.02 | 0.941 ± 0.002 | 0.863 ± 0.01 | 0.226 ± 0.004 | 0.973 ± 0.0005 | 0.940 ± 0.003 | ||||||
| GIN | 0.303 ± 0.007 | 0.950 ± 0.003 | 0.889 ± 0.003 | 0.189 ± 0.004 | 0.978 ± 0.006 | 0.953 ± 0.002 | |||||||
| GINE | 0.266 ± 0.02 | 0.961 ± 0.003 | 0.915 ± 0.01 | 0.147 ± 0.009 | 0.987 ± 0.001 | 0.971 ± 0.003 | |||||||
| BCE ↓ | AUROC ↑ | AP ↑ | BCE ↓ | AUROC ↑ | AP ↑ | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Single-Task Model | Multi-Task Model | ||||||||||||
| Tox21 | GCN | 0.202 ± 0.005 | 0.773 ± 0.006 | 0.334 ± 0.03 | 0.176 ± 0.001 | 0.850 ± 0.006 | 0.446 ± 0.01 | ||||||
| GIN | 0.200 ± 0.002 | 0.789 ± 0.009 | 0.350 ± 0.01 | 0.176 ± 0.001 | 0.841 ± 0.005 | 0.454 ± 0.009 | |||||||
| GINE | 0.201 ± 0.007 | 0.783 ± 0.007 | 0.345 ± 0.02 | 0.177 ± 0.0008 | 0.836 ± 0.004 | 0.455 ± 0.008 | |||||||