Pathology
STARC-9 (STAnford coloRectal Cancer WSI)
STARC-9 (STAnford coloRectal Cancer): A large-scale dataset with 630k samples across nine tissue types (~70k per class) collected from over 200 WSI for multi-class tissue classification task. Stanford and TCGA-CRC tile-level validation datasets.Pretrained model weights on STARC-9, including baseline, SOTA, and Pathology-specific foundation models.Open-source code repository for DeepCluster++ framework that can be used for image sample collection across domains.
Related publication: STARC-9: A Large-scale Dataset for Multi-Class Tissue Classification for CRC Histopathology. Barathi Subramanian, et al. https://arxiv.org/abs/2511.00383