Back to feed
arXiv cs.AI·

VolTA-3D: Self-Supervised Learning for Brain MRI using 3D Volumetric Token Alignment

Signal
72
Hype
18
In three linesVolTA-3D is a self-supervised 3D Vision Transformer framework for brain MRI. It aligns global and local tokens in a student-teacher paradigm and enforces fine-grained structural reconstruction. Evaluated on hippocampal segmentation and classification tasks (sex, Alzheimer's), it outperforms random baselines and demonstrates improved transferability across domain shifts.
Read source
Your take?
VisionPapers

Summary generated by Claude — human-verified