Background Pancreatic ductal adenocarcinoma (PDAC) has the worst prognosis among major cancer types, primarily due to late diagnosis on contrast-enhanced CT. Artificial intelligence (AI) can improve diagnostic performance, but robust benchmarks and reliable comparison to radiologists' performance are scarce. We established an open-source benchmark with the aim of investigating AI systems for PDAC detection on CT and compared them to radiologists' performance, at scale. Methods In this international, paired, non-inferiority, confirmatory, observational study (PANORAMA), the AI system was trained and externally validated within an international benchmark, with a cohort of 2310 patients from four tertiary care centres in the Netherlands and the USA for training (n=2224) and tuning (n=86), and a sequestered cohort of 1130 patients from five tertiary care centres (the Netherlands, Sweden, and Norway) for testing. A multi-reader, multi-case observer study with 68 radiologists (40 centres, 12 countries; median 9·0 [IQR 6·0–14·5] years of experience) was conducted on a subset of 391 patients from the testing cohort. The reference standard was established with histopathology and at least 3 years of clinical follow-up. The primary endpoint was the mean area under the receiver operating characteristic curve (AUROC) of the AI system compared to that of radiologists at PDAC detection on CT. The study protocol and statistical plan were prespecified to test non-inferiority (considering a margin of 0·05), followed by superiority towards the AI system. This study is registered with Zenodo ( https://doi.org/10.5281/zenodo.10599559 ) and is complete. Findings Of the 3440 (1511 [44%] female, 1929 [56%] male; median age 67 [IQR 58–74] years) included patients (Jan 1, 2004 to Dec 31, 2023), 1103 (32%) received a positive PDAC diagnosis. In the sequestered testing cohort of 1130 patients (406 with histologically confirmed PDAC), AI achieved an AUROC of 0·92 (95% CI 0·90–0·93). In the subset of 391 patients (144 [37%] with histologically confirmed PDAC) used for the reader study, AI achieved statistically non-inferior (p<0·0001) and superior (p=0·001) performance with an AUROC of 0·92 (95% CI 0·89–0·94), compared to the pool of 68 participating radiologists, with an AUROC of 0·88 (0·85–0·91). Interpretation AI demonstrated substantially improved PDAC detection on routine CT scans compared to radiologists on average, showing potential to detect cancer earlier and improve patient outcomes. Funding European Union's Horizon 2020 research and innovation programme.

Artificial intelligence and radiologists in pancreatic cancer detection using standard of care CT scans (PANORAMA): an international, paired, non-inferiority, confirmatory, observational study / Alves, Natalia; Schuurmans, Megan; Rutkowski, Dawid; Saha, Anindo; Vendittelli, Pierpaolo; Obuchowski, Nancy; Liedenbaum, Marjolein H; Haldorsen, Ingfrid S; Molven, Anders; Yakar, Derya; Geerdink, Jeroen; Van Koeverden, Sebastiaan; Riviere, Deniece M; Venderink, Wulphert; De Haas, Robbert; Kim, Namkug; Löhr, J-Matthias; Suman, Garima; Maier-Hein, Klaus H; Hahn, Horst K; Wang, Weichung; Yuille, Alan L; Kambadakone, Avinash; Fishman, Elliot K; Verbeke, Caroline; Litjens, Geert; Hermans, John J; Huisman, Henkjan; Alves, Natália; Schuurmans, Megan; Saha, Anindo; Vendittelli, Pierpaolo; Litjens, Geert; Hermans, John; Huisman, Henkjan; Riviere, Deniece M.; Venderink, Wulphert; Van Koeverden, Sebastiaan; Rutkowski, Dawid; Liedenbaum, Marjolein H.; Haldorsen, Ingfrid S.; Molven, Anders; Yakar, Derya; De Haas, Robbert J.; Geerdink, Jeroen; Veltman, Jeroen; Yuille, Alan; Kambadakone, Avinash; Verbeke, Caroline; Matos, Celso; Fishman, Elliot; Suman, Garima; Hahn, Horst K.; Maier-Hein, Klaus; Löhr, J-Matthias; Kim, Namkug; Obuchowski, Nancy; Gallinger, Steven; Wang, Weichung; Stunt, Ali; Liu, Han; Gao, Riqiang; Grbic, Sasa; Deng, Zengtian; He, Yimeng; Shi, Yu; Vétil, Rebeca; Debs, Noëlie; Abi-Nader, Clément; Bône, Alexandre; Rohé, Marc-Michel; Yu, Ching-Yuan; Ma, Jun; Fu, Tianhao; Wang, Bo; Bezuidenhout, Abraham Fourie; Huber, Adrian Thomas; Liguori, Adriano; Korchi, Amine; Ponsiglione, Andrea; Schulz, Anselm; Stanzione, Arnaldo; Minieri, Augusto; Chen, Bang-Bin; Maino, Cesare; Triantopoulou, Charikleia; Christodoulou, Dimitra; Geisel, Dominik; Koh, Dow-Mu; Boffa, Elisa; Boninsegna, Enrico; Genco, Enza; Soloff, Erik; Lettieri, Eugenia Amelia; Omboni, Federica; Castagnoli, Francesca; Prato, Francesco; Wessels, Frank; Avesani, Giacomo; Porrello, Giorgia; Brembilla, Giorgio; Morana, Giovanni; Zamboni, Giulia; Di Costanzo, Giuseppe; Juliusson, Gunnar; Jenssen, Håvard Bjørke; Zandvoort, Herman; Pijls, Jeroen; Prince, Jip; De Paepe, Katja; Petrovic, Kosta; Van Valkenhoef, Loekie; Fortuna, Luca; Mannacio, Luigi; Engelbrecht, Marc; Chincarini, Marco; Dioguardi Burgio, Marco; Zerunian, Marta; Imbriaco, Massimo; Bariani, Matilde; Bonatti, Matteo; Ronot, Maxime; Norstedt, Natalie; Kurt, Nazmi; Patel, Nirav; Sbeghen, Paolo Maria; Patel, Pawan; Bonaffini, Pietro Andrea; Mucelli, Raffaella Pozzi; Büyüktoka, Raşit Eren; Geenen, Remy; Cuocolo, Renato; Valletta, Riccardo; Musella, Roberta; Cannella, Roberto; Dwarkasing, Roy S.; Venturini, Silvia; Gourtsoyianni, Sofia; Malekzadeh, Sonaz; Tupputi, Umberto; Obmann, Verena; Liu, Vivi. - In: THE LANCET ONCOLOGY. - ISSN 1470-2045. - 27:1(2026), pp. 116-124. [10.1016/s1470-2045(25)00567-4]

Artificial intelligence and radiologists in pancreatic cancer detection using standard of care CT scans (PANORAMA): an international, paired, non-inferiority, confirmatory, observational study

Liguori, Adriano;Ponsiglione, Andrea;Stanzione, Arnaldo;Minieri, Augusto;Lettieri, Eugenia Amelia;Mannacio, Luigi;Imbriaco, Massimo;Cuocolo, Renato;Musella, Roberta;
2026

Abstract

Background Pancreatic ductal adenocarcinoma (PDAC) has the worst prognosis among major cancer types, primarily due to late diagnosis on contrast-enhanced CT. Artificial intelligence (AI) can improve diagnostic performance, but robust benchmarks and reliable comparison to radiologists' performance are scarce. We established an open-source benchmark with the aim of investigating AI systems for PDAC detection on CT and compared them to radiologists' performance, at scale. Methods In this international, paired, non-inferiority, confirmatory, observational study (PANORAMA), the AI system was trained and externally validated within an international benchmark, with a cohort of 2310 patients from four tertiary care centres in the Netherlands and the USA for training (n=2224) and tuning (n=86), and a sequestered cohort of 1130 patients from five tertiary care centres (the Netherlands, Sweden, and Norway) for testing. A multi-reader, multi-case observer study with 68 radiologists (40 centres, 12 countries; median 9·0 [IQR 6·0–14·5] years of experience) was conducted on a subset of 391 patients from the testing cohort. The reference standard was established with histopathology and at least 3 years of clinical follow-up. The primary endpoint was the mean area under the receiver operating characteristic curve (AUROC) of the AI system compared to that of radiologists at PDAC detection on CT. The study protocol and statistical plan were prespecified to test non-inferiority (considering a margin of 0·05), followed by superiority towards the AI system. This study is registered with Zenodo ( https://doi.org/10.5281/zenodo.10599559 ) and is complete. Findings Of the 3440 (1511 [44%] female, 1929 [56%] male; median age 67 [IQR 58–74] years) included patients (Jan 1, 2004 to Dec 31, 2023), 1103 (32%) received a positive PDAC diagnosis. In the sequestered testing cohort of 1130 patients (406 with histologically confirmed PDAC), AI achieved an AUROC of 0·92 (95% CI 0·90–0·93). In the subset of 391 patients (144 [37%] with histologically confirmed PDAC) used for the reader study, AI achieved statistically non-inferior (p<0·0001) and superior (p=0·001) performance with an AUROC of 0·92 (95% CI 0·89–0·94), compared to the pool of 68 participating radiologists, with an AUROC of 0·88 (0·85–0·91). Interpretation AI demonstrated substantially improved PDAC detection on routine CT scans compared to radiologists on average, showing potential to detect cancer earlier and improve patient outcomes. Funding European Union's Horizon 2020 research and innovation programme.
2026
Artificial intelligence and radiologists in pancreatic cancer detection using standard of care CT scans (PANORAMA): an international, paired, non-inferiority, confirmatory, observational study / Alves, Natalia; Schuurmans, Megan; Rutkowski, Dawid; Saha, Anindo; Vendittelli, Pierpaolo; Obuchowski, Nancy; Liedenbaum, Marjolein H; Haldorsen, Ingfrid S; Molven, Anders; Yakar, Derya; Geerdink, Jeroen; Van Koeverden, Sebastiaan; Riviere, Deniece M; Venderink, Wulphert; De Haas, Robbert; Kim, Namkug; Löhr, J-Matthias; Suman, Garima; Maier-Hein, Klaus H; Hahn, Horst K; Wang, Weichung; Yuille, Alan L; Kambadakone, Avinash; Fishman, Elliot K; Verbeke, Caroline; Litjens, Geert; Hermans, John J; Huisman, Henkjan; Alves, Natália; Schuurmans, Megan; Saha, Anindo; Vendittelli, Pierpaolo; Litjens, Geert; Hermans, John; Huisman, Henkjan; Riviere, Deniece M.; Venderink, Wulphert; Van Koeverden, Sebastiaan; Rutkowski, Dawid; Liedenbaum, Marjolein H.; Haldorsen, Ingfrid S.; Molven, Anders; Yakar, Derya; De Haas, Robbert J.; Geerdink, Jeroen; Veltman, Jeroen; Yuille, Alan; Kambadakone, Avinash; Verbeke, Caroline; Matos, Celso; Fishman, Elliot; Suman, Garima; Hahn, Horst K.; Maier-Hein, Klaus; Löhr, J-Matthias; Kim, Namkug; Obuchowski, Nancy; Gallinger, Steven; Wang, Weichung; Stunt, Ali; Liu, Han; Gao, Riqiang; Grbic, Sasa; Deng, Zengtian; He, Yimeng; Shi, Yu; Vétil, Rebeca; Debs, Noëlie; Abi-Nader, Clément; Bône, Alexandre; Rohé, Marc-Michel; Yu, Ching-Yuan; Ma, Jun; Fu, Tianhao; Wang, Bo; Bezuidenhout, Abraham Fourie; Huber, Adrian Thomas; Liguori, Adriano; Korchi, Amine; Ponsiglione, Andrea; Schulz, Anselm; Stanzione, Arnaldo; Minieri, Augusto; Chen, Bang-Bin; Maino, Cesare; Triantopoulou, Charikleia; Christodoulou, Dimitra; Geisel, Dominik; Koh, Dow-Mu; Boffa, Elisa; Boninsegna, Enrico; Genco, Enza; Soloff, Erik; Lettieri, Eugenia Amelia; Omboni, Federica; Castagnoli, Francesca; Prato, Francesco; Wessels, Frank; Avesani, Giacomo; Porrello, Giorgia; Brembilla, Giorgio; Morana, Giovanni; Zamboni, Giulia; Di Costanzo, Giuseppe; Juliusson, Gunnar; Jenssen, Håvard Bjørke; Zandvoort, Herman; Pijls, Jeroen; Prince, Jip; De Paepe, Katja; Petrovic, Kosta; Van Valkenhoef, Loekie; Fortuna, Luca; Mannacio, Luigi; Engelbrecht, Marc; Chincarini, Marco; Dioguardi Burgio, Marco; Zerunian, Marta; Imbriaco, Massimo; Bariani, Matilde; Bonatti, Matteo; Ronot, Maxime; Norstedt, Natalie; Kurt, Nazmi; Patel, Nirav; Sbeghen, Paolo Maria; Patel, Pawan; Bonaffini, Pietro Andrea; Mucelli, Raffaella Pozzi; Büyüktoka, Raşit Eren; Geenen, Remy; Cuocolo, Renato; Valletta, Riccardo; Musella, Roberta; Cannella, Roberto; Dwarkasing, Roy S.; Venturini, Silvia; Gourtsoyianni, Sofia; Malekzadeh, Sonaz; Tupputi, Umberto; Obmann, Verena; Liu, Vivi. - In: THE LANCET ONCOLOGY. - ISSN 1470-2045. - 27:1(2026), pp. 116-124. [10.1016/s1470-2045(25)00567-4]
File in questo prodotto:
File Dimensione Formato  
Panorama_Lancet.pdf

solo utenti autorizzati

Licenza: Copyright dell'editore
Dimensione 545.48 kB
Formato Adobe PDF
545.48 kB Adobe PDF   Visualizza/Apri   Richiedi una copia

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11588/1024242
Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 5
  • ???jsp.display-item.citation.isi??? 3
social impact