AI Multimodal
Curation of Multimodal — listings under the AI tab.
Summary Table
Click a column header to sort the table.
| NAME | Main citation | YEAR |
|---|---|---|
| CONCH | Lu MY et al., Nat Med, 2024 |
2024 |
CONCH
PUBMED_LINK
FULL NAME
CONCH — Contrastive Learning from Captions for Histopathology
DESCRIPTION
CONCH (CONtrastive learning from Captions for Histopathology) is a vision-language foundation model pretrained on 1.17 million histopathology image-caption pairs. It achieves state-of-the-art performance across 14 diverse benchmarks including histology image classification, segmentation, captioning, text-to-image and image-to-text retrieval. As a multimodal model bridging visual pathology data with biomedical text, CONCH enables zero-shot transfer and minimal fine-tuning for diverse computational pathology tasks.
URL
TITLE
A visual-language foundation model for computational pathology.
Main citation
Lu MY, Chen B, Williamson DFK, Chen RJ, Liang I, Ding T, Noor G, Sang Y, Mahmood F. (2024) A visual-language foundation model for computational pathology. Nature Medicine, 30(3):863-874. doi:10.1038/s41591-024-02856-4. PMID 38480913
ABSTRACT
The accelerated adoption of digital pathology and advances in deep learning have enabled the development of robust models for various pathology tasks. However, model training is often difficult due to label scarcity. Additionally, most models in histopathology leverage only image data. We introduce CONCH, a visual-language foundation model developed using diverse sources of histopathology images, biomedical text, and over 1.17 million image-caption pairs via task-agnostic pretraining. Evaluated on 14 diverse benchmarks, CONCH achieves state-of-the-art performance on histology image classification, segmentation, captioning, and cross-modal retrieval.
DOI
10.1038/s41591-024-02856-4