OpenTME: An Open Dataset of AI-powered H&E Tumor Microenvironment Profiles from TCGA
Maaike Galama, Nina Kozar-Gillan, Christina Embacher, Todd Dembo, Cornelius Böhm + 16 more
TLDR
OpenTME is an open-access dataset providing AI-powered, cell-level tumor microenvironment profiles from 3,634 H&E images across five TCGA cancer types.
Key contributions
- Introduces OpenTME, an open dataset of AI-powered TME profiles from 3,634 H&E TCGA images.
- Profiles generated using Atlas H&E-TME, an AI application built on pathology foundation models.
- Provides over 4,500 quantitative, cell-level TME readouts per slide for 5 cancer types.
Why it matters
The tumor microenvironment is crucial for cancer research, but large-scale, consistent H&E characterization is lacking. OpenTME fills this gap by providing a rich, AI-generated dataset. This resource will accelerate biomarker discovery and advance computational methods for TME analysis.
Original Abstract
The tumor microenvironment (TME) plays a central role in cancer progression, treatment response, and patient outcomes, yet large-scale, consistent, and quantitative TME characterization from routine hematoxylin and eosin (H&E)-stained histopathology remains scarce. We introduce OpenTME, an open-access dataset of pre-computed TME profiles derived from 3,634 H&E-stained whole-slide images across five cancer types (bladder, breast, colorectal, liver, and lung cancer) from The Cancer Genome Atlas (TCGA). All outputs were generated using Atlas H&E-TME, an AI-powered application built on the Atlas family of pathology foundation models, which performs tissue quality control, tissue segmentation, cell detection and classification, and spatial neighborhood analysis, yielding over 4,500 quantitative readouts per slide at cell-level resolution. OpenTME is available for non-commercial academic research on Hugging Face. We will continue to expand OpenTME over time and anticipate it will serve as a resource for biomarker discovery, spatial biology research, and the development of computational methods for TME analysis.
📬 Weekly AI Paper Digest
Get the top 10 AI/ML arXiv papers from the week — summarized, scored, and delivered to your inbox every Monday.