2026.06.27 Beyond performance: robustness-oriented model evaluation AI in Medicine ablation study AI in Medicine model evaluation robustness sensitivity analysis