Hierarchical cross-entropy loss improves atlas-scale single-cell annotation models

Apr 23, 2025·
Sebastiano Cultrera Di Montesano
,
Davide D'Ascenzo
,
Srivatsan Raghavan
,
Ava P. Amini
,
Peter S. Winter
,
Lorin Crawford
· 1 min read
Abstract
Accurately annotating cell types is essential for extracting biological insight from single-cell RNA-seq data. Although cell types are naturally organized into hierarchical ontologies, most computational models do not explicitly incorporate this structure into their training objectives. We introduce a hierarchical cross-entropy loss that aligns model objectives with biological structure. Applied to architectures ranging from linear models to transformers, this simple modification significantly improves out-of-distribution performance (12–15%) without added computational cost.
Type
Publication
bioRxiv

Accurately annotating cell types is essential for extracting biological insight from single-cell RNA-seq data. Although cell types are naturally organized into hierarchical ontologies, most computational models do not explicitly incorporate this structure into their training objectives. We introduce a hierarchical cross-entropy loss that aligns model objectives with biological structure. Applied to architectures ranging from linear models to transformers, this simple modification significantly improves out-of-distribution performance (12–15%) without added computational cost.