Baseline Without MIL (Verdelho et al., 2025)
Results are organized into WSI-specific prediction and Patient-specific aggregation strategies for survival prediction at 12 months using WSI-only data.
WSI-Specific Models
Model |
Balanced Accuracy (%) |
GCN |
0.6873 ± 0.0061 |
GAT (1 head) |
0.6917 ± 0.0118 |
GAT (2 heads) |
0.6866 ± 0.0181 |
GraphSAGE FB (mean) |
0.7003 ± 0.0089 |
GraphSAGE FB (max) |
0.6998 ± 0.0135 |
Patient-Specific Aggregation
Aggregation |
Model |
Bacc ± Std (%) |
Majority Voting | GCN | 67.95 ± 2.08 |
GAT 1 head | 68.01 ± 3.36 |
GAT 2 head | 68.56 ± 2.42 |
GraphSAGE FB (mean) | 70.78 ± 4.46 |
GraphSAGE FB (max) | 69.38 ± 2.98 |
One Dominance | GCN | 70.23 ± 2.55 |
GAT 1 head | 69.18 ± 3.18 |
GAT 2 head | 69.70 ± 2.61 |
GraphSAGE FB (mean) | 71.04 ± 2.12 |
GraphSAGE FB (max) | 69.49 ± 5.27 |
MIL WSI | GCN | 75.05 ± 0.18 |
GAT 1 head | 72.41 ± 2.11 |
GAT 2 head | 76.42 ± 1.62 |
GraphSAGE FB (mean) | 73.46 ± 2.92 |
GraphSAGE FB (max) | 71.35 ± 2.13 |
Scientific Papers
If you want more information about this benchmark feel free to read our papers:
Tiago Mota, Maria Rita Fonseca Verdelho, Diogo José Pereira Araújo, Alceu Bissoto, Carlos Santiago, Catarina Barata,
MMIST-ccRCC: A Real-World Medical Dataset for the Development of Multi-Modal Systems, Data Curation and Augmentation in Enhancing Medical Imaging Applications Workshop (archival) @CVPR 2024.
Maria Rita Fonseca Verdelho, Alexandre Bernardino, Catarina Barata,
MIL vs. Aggregation: Evaluating Patient-Level Survival Prediction Strategies Using Graph-Based Learning