We adapt the remote sensing-inspired AMBER model (Dosi et al., 2025) from multi-band image segmentation to 3D medical datacube segmentation. To address the computational bottleneck of the volumetric transformer, we propose the AMBER-AFNO architecture. This approach uses Adaptive Fourier Neural Operators (AFNO) instead of the multi-head self-attention mechanism. Unlike spatial pairwise interactions between tokens, global token mixing in the frequency domain avoids attention-weight calculations. As a result, AMBER-AFNO achieves quasi-linear computational complexity and linear memory scaling. This new way to model global context reduces reliance on dense transformers while preserving global contextual modeling capability. By using attention-free spectral operations, our design offers a compact parameterization and maintains a competitive computational complexity. We evaluate AMBER-AFNO on three public datasets: ACDC, Synapse, and BraTS. On these datasets, the model achieves state-of-the-art or near-state-of-the-art results for DSC and HD95. Compared with recent compact CNN and Transformer architectures, our approach yields higher Dice scores while maintaining a compact model size. Overall, our results show that frequency-domain token mixing with AFNO provides a fast and efficient alternative to self-attention mechanisms for 3D medical image segmentation.

Less is More: AMBER-AFNO - a New Benchmark for Lightweight 3D Medical Image Segmentation / Dosi, Andrea; Mondal, Semanto; Chandra Ghosh, Rajib; Brescia, Massimo; Longo, Giuseppe. - In: EXPERT SYSTEMS WITH APPLICATIONS. - ISSN 0957-4174. - (2026). [10.1016/j.eswa.2026.132518]

Less is More: AMBER-AFNO - a New Benchmark for Lightweight 3D Medical Image Segmentation

Andrea Dosi
;
Semanto Mondal;Rajib Chandra Ghosh;Massimo Brescia
;
Giuseppe Longo
2026

Abstract

We adapt the remote sensing-inspired AMBER model (Dosi et al., 2025) from multi-band image segmentation to 3D medical datacube segmentation. To address the computational bottleneck of the volumetric transformer, we propose the AMBER-AFNO architecture. This approach uses Adaptive Fourier Neural Operators (AFNO) instead of the multi-head self-attention mechanism. Unlike spatial pairwise interactions between tokens, global token mixing in the frequency domain avoids attention-weight calculations. As a result, AMBER-AFNO achieves quasi-linear computational complexity and linear memory scaling. This new way to model global context reduces reliance on dense transformers while preserving global contextual modeling capability. By using attention-free spectral operations, our design offers a compact parameterization and maintains a competitive computational complexity. We evaluate AMBER-AFNO on three public datasets: ACDC, Synapse, and BraTS. On these datasets, the model achieves state-of-the-art or near-state-of-the-art results for DSC and HD95. Compared with recent compact CNN and Transformer architectures, our approach yields higher Dice scores while maintaining a compact model size. Overall, our results show that frequency-domain token mixing with AFNO provides a fast and efficient alternative to self-attention mechanisms for 3D medical image segmentation.
2026
Less is More: AMBER-AFNO - a New Benchmark for Lightweight 3D Medical Image Segmentation / Dosi, Andrea; Mondal, Semanto; Chandra Ghosh, Rajib; Brescia, Massimo; Longo, Giuseppe. - In: EXPERT SYSTEMS WITH APPLICATIONS. - ISSN 0957-4174. - (2026). [10.1016/j.eswa.2026.132518]
File in questo prodotto:
File Dimensione Formato  
Dosi-et-al-2026.pdf

accesso aperto

Tipologia: Documento in Pre-print
Licenza: Creative commons
Dimensione 3.6 MB
Formato Adobe PDF
3.6 MB Adobe PDF Visualizza/Apri

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/1041376
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus ND
  • ???jsp.display-item.citation.isi??? ND
social impact